Abstract

News search engines are a class of search engines which professionally monitor the web news. These engines usually provide their contents through extraction of news feeds. But news feeds are not fully supported by all news sources, especially the Persian ones. Another way is indexing the content of news pages where the results are less adequately accurate due to the misrecognition of news structure. In this article we offer the architecture of a news search engine which extracts, archives structured news content and then performs complementary processes such as indexing and classifying of news which has been optimized for Persian language. Using the structured text of news, we reached higher precision in complementary processes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call