Abstract

Objective: To develop document summarization for the Afaan Oromo language based on the query entered by the user(s). Methods: This study follows the design science analysis technique as a result of its considerations of thoughtful, intellectual, and ingenious activity throughout problem-solving and the creation of knowledge. The developed query-based framework has used the TF-IDF term weight methodology. Development tools such as HornMorpho are employed for morphological analysis; whereas, Natural Language Processing Toolkit is employed for the text process. The system has experimented on the various extraction rates of 10%, 20%, and 30%. The result’s evaluated exploitation recall, precision, and F-measure for objective analysis; whereas, subjective analysis has been evaluated by language consultants. Findings: The results of the evaluations showed that the proposed system registered f-measure of 90%, 91% and 93% at a summary extraction rate of 10%, 20%, and 30% respectively. The informativeness and coherence of the proposed system also registered its best performance summary of 51.67%, 56.67 % and 54.17% average score on five scale measures at an extraction rate of 10%, 20%, and 30% respectively when both methods were used together. Novelty: By using a morphological analysis tool the performance of the system is improved from 80.67% to 91.3% F-measure when we compare it with the previous work even supposing there’s still a requirement to conduct additional analysis to enhance the Afaan Oromo text summarization. Keywords: Document; Summary; Natural Language Processing; Morphological Analysis; Text Ranking

Highlights

  • As a result of the huge amount of information available, a new technology that can process this information is required by users

  • According to Michael (2) the resulting summary is about the query asked in query-based document summarization

  • Document summary that is based on a query is known as query-based summarization

Read more

Summary

Introduction

As a result of the huge amount of information available, a new technology that can process this information is required by users. According to Michael (2) the resulting summary is about the query asked in query-based document summarization. Document summary that is based on a query is known as query-based summarization. The task is to create a summary from the document that can deliver informative information relating to the user’s information demands, given a user query. The text report is associate degree activity meant to make a transparent and simple outline having solely the key ideas of the documents by shortening long items of text. Being capable to associate degreed shortly acknowledge this data in an organized, short, and precise approach offers the reader an outline of the ideas towards the contents of the scripts

Methods
Findings
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call