Abstract

During the last 30 years, the web has evolved from simple information HTML pages to complex applications supporting business, television, newspapers, entertainment, and others. While there are many articles on website popularity, there has been little work in understanding the complexity of individual web pages. In the article, we present a measurement-driven study of the complexity of web pages today. We measured 426 866 web pages in about 12 weeks. Our study is devoted to two problems. The first problem was to describe the complexity of a web page with metrics based on the content they included and the kind of service they offered. The second focus of our study was to build probabilistic models of observed distributions. Such models can be used in HTTP request generators modelling the work of modern web systems. Separate models are proposed for each category of web pages and all pages together.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call