Abstract
The study discusses the importance of summarization in dealing with a large amount of data available on the internet. The study used a deep-learning algorithm based on functions from the spacy library in Python to summarize news articles and evaluated the impact of named entity recognition on the summarization process. The study assessed different datasets from CNN-DailyMail and the BBC (entertainment articles) and found that the proposed method based on named entity recognition showed significant improvement in recall, precision, and F-score compared to the word frequency method. The study also observed that the articles from CNN- DailyMail were longer, with an average of 551 words and 28 sentences, compared to the BBC (entertainment articles), which had an average of 190 words and 12 sentences. The evaluation results showed that the proposed method based onnamed entity recognition performed better on the shorter articles from the BBC, indicating that the method was more effective in summarizing shorter texts. In summary, the study highlighted the importance of summarization in dealing with a large amount of data available on the internet. It showed that named entity recognition can significantly improve the effectiveness of the summarization process. The study alsoobserved that the proposed method was more effective in summarizing shorter texts Keywords: Spacy library, entity recognition, Summarization, Deep-Learning, CNN-DailyMail
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.