Abstract

In the Dark and Stormy Archives (DSA) project, we focus on storytelling techniques to summarize collections of archived web pages. Since collections can have hundreds or even thousands of seeds (initial URLs) and each seed can be recrawled many times, with each version separately maintained, techniques that include information about all members of the collection can be overwhelming. The premise of storytelling is to focus on sampling exemplar pages from the collection, and present them in a social media interface familiar to users. We present Hypercane , the tool in the DSA suite responsible for selecting exemplar pages. Hypercane offers eight action statements that can be combined in various ways to customize the sample that is produced. Because of its modular design, Hypercane can also be used to analyze large web archive collections outside of the DSA suite.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call