Abstract

AbstractThe credibility evaluation of web pages is an important part of current network information mining. Generally, content‐based evaluation and link relationship‐based evaluation are used to detect web spam. This paper proposes an algorithm based on link relationship which adjusts the weight of the link object according to the credibility of the link object to calculate the credibility of the web page. Considering the different linking conditions, (i) low‐scoring web pages actively linking to high‐scoring web pages, resulting in the reduction of high‐scoring web pages score; (ii) in order to endorse the low‐score web page, high‐score web pages actively linking to the low‐score web pages, adjust the score transmission. Obviously, the proposed algorithm is a recursive model. To avoid the amplification of the error caused by the loop and different evaluation results caused by different initial nodes, this paper presents a novel method to adaptively adjust the weights of the links according to the link conditions, then proves the convergence of the algorithm. Finally, we validate the proposed algorithm on the public datasets, which is compared with other up‐to‐date algorithms under different experimental indexes. The results show that the proposed algorithm can effectively reduce the ranks of spam pages in all web pages and improve the detection efficiency of spam pages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.