Abstract
The paper aims for the application of focused crawler in the petroleum news topic crawling, studies the related technologies of the focused crawler, and put forward a crawling engine strategy and review strategy on the petroleum news topics, adopt different extracting methods for different types of pages through web page classification, and design a corresponding link topic correlation calculating method for the crawling engine strategy; test and verify the above-mentioned crawling engine strategies through experiments, and the experimental results show that the strategy can greatly balance its accuracy and width for focused crawler.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.