Abstract

In recent years, more and more information appeared on the web. Extracting information from the web and converting them into regular format become significantly important work. After observing a number of web sites, we found that most of useful information is contained in the web sources, which have a large number of similarly structured web documents. So in this paper we present an approach for discovering the useful information sources from the web and extracting information from them. A useful web information source discovering method and a novel information extraction method are proposed. We also develop a prototype system WIEAS (Web Information Extraction, Analysis And Services) to implement our idea, and use the information extracted by WIEAS to provide plentiful services.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.