Abstract

An algebraic method is proposed for finding PageRank estimates for pages of websites. The amount of calculation in the proposed method does not depend on the value of the damping coefficient, which allows obtaining more accurate estimates of the rankings of PageRank in comparison with analogues. A distinctive feature of the proposed method is a step-by-step performance of calculations simultaneously with the work of the graph traversal algorithm. The comparative analysis of algorithms for traversing graphs has shown that, in contrast to the depth search algorithm, the breadth search algorithm gives a more orderly arranged matrix of transitions, which has the blockwise Hessenberg form. The use of this circumstance makes it possible to reduce significantly the amount of calculations by the proposed method. The resulting equations describing the proposed method have a block structure that allows efficient distribution of the entire volume of operations to parallel computational threads . Proceeding from the fact that the bulk of the calculations can be performed while the graph traversal algorithm is running, the study has determined the conditions under which the proposed method makes it possible to obtain PageRank values faster than the known iterative algorithms. The applicability area of the developed method is, first of all, its use in direct verification of the reliability of posting advertising materials on a relevant web resource; therefore, it is limited to specific Internet sites or segments with no more than 10 4 or 10 5 pages.

Highlights

  • The constantly increasing use of the Internet in all areas of human activity has made it one of the most important places for the development of the advertising business [1]

  • When the vertices of the graph are ordered in layers by the breadth-first search (BFS) algorithm, the transition matrix P will be of a lower blockwise Hessenberg matrix type [29]

  • We introduce the notation for the matrix beeing inverted, obtained at the k-th step of the operation of the BFS

Read more

Summary

Introduction

The constantly increasing use of the Internet in all areas of human activity has made it one of the most important places for the development of the advertising business [1]. As the structure of web resources dynamically changes, one of the primary tasks is to analyse the current structure of a site and the current ranks (importance) of its pages. This can help determine the effectiveness of advertisement posting in the steps. It is important to develop algorithms for determining the ranks of site pages, the main amount of which can be done in parallel with the work of the algorithm for analysing the structure of the site This will make it possible to use the calculator more fully and get the result faster. It is important to develop algorithms for finding PageRank estimates that differ from those known by the best ratio of the volume of necessary computations and the level of regularization

Literature review and problem statement
The aim and objectives of the study
Analysis of the features of graphs traversal algorithms
A81 A82 A83 A84 A85 A86 A87 A88
Development of a modified algorithm for estimating PageRank
Discussion of the developed algebraic method for calculating PageRank values
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call