A Systematic Approach Towards Web Preservation

Muzammil Khan,Arif Ur Rahman

doi:10.6017/ital.v38i1.10181

Abstract

The main purpose of the article is to divide the web preservation process into small explicable stages and design a step-by-step web preservation process that leads to creating a well-organized web archive. A number of research articles are studied about web preservation projects and web archives, and designed a step-by-step systematic approach for web preservation. The proposed comprehensive web preservation process describes and combines strengths of different techniques observed during the study for preserving digital web contents into a digital web archive. For each web preservation step, different approaches and possible implementation techniques have been identified that can be adopted in digital archiving. The potential value of the proposed model is to guide the archivist, related personnel, and organizations to effectively preserved their intellectual digital contents for future use. Moreover, the model can help to initiate a web preservation process and create a well-organized web archive to efficiently manage the archived web contents. A section briefly describes the implementation of the proposed approach in a digital news stories preservation framework for archiving news published online from different sources.

Highlights

The amount of information generated by institutions is increasing with the passage of time
Though the World Wide Web (WWW) is a rapidly growing source of information, it is fragile in nature
According to the available statistics, 80 percent of pages become unavailable after one year and 13 percent of links in scholarly articles are broken after 27 months.[2]

Summary

Introduction

The amount of information generated by institutions is increasing with the passage of time. One of the mediums that uses this information is the World Wide Web (WWW). The WWW has become a tool to share information quickly with everyone regardless of their physical location. Google and Bing each index approximately 4.8 billion.[1]. Though the WWW is a rapidly growing source of information, it is fragile in nature. According to the available statistics, 80 percent of pages become unavailable after one year and 13 percent of links (mostly web references) in scholarly articles are broken after 27 months.[2] 11 percent of posts and comments on websites for various purposes are lost within a year

Objectives

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Technology and Libraries	Publication Date: Mar 18, 2019
Citations: 10	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

A Systematic Approach Towards Web Preservation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology and Libraries

Lead the way for us

Similar Papers

The Past Web: Exploring Web Archives
Amanda Greenwood
The American Archivist | VOL. 85
Amanda GreenwoodAmanda Greenwood
01 Sep 2022
The American Archivist | VOL. 85

Digital Resources: LLILAS Benson Latin American Studies and Collections, University of Texas at Austin
Kent Norsworthy
-
Kent NorsworthyKent Norsworthy
29 Sep 2016
29 Sep 2016

Climate change and web archives: an Ibero-American study based on the Portuguese and Brazilian contexts
Moisés Rockembach ... Anabela Serrano
Records Management Journal | VOL. 31
Moisés Rockembach, et. al.Moisés Rockembach ... Anabela Serrano
04 Oct 2021
Records Management Journal | VOL. 31

ВЕБ-АРХИВЫ В РЕКОНСТРУКЦИИ ИСТОРИИ ВИРТУАЛЬНЫХ МУЗЕЕВ: ПОТЕНЦИАЛ И ОГРАНИЧЕНИЯ
N G Povroznik
Вестник Пермского университета. История | VOL. -
N G PovroznikN G Povroznik
01 Jan 2020
Вестник Пермского университета. История | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Systematic Approach Towards Web Preservation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology and Libraries