Abstract

The growing amount of data on the Internet has led to a situation in which it is essential to process these data to generate new services with the specific aim of improving people's daily living conditions. Transport data is of the utmost importance, since everyday people have to move around to perform some daily tasks, such as going to work, studying and shopping, and this means that the number of journeys by public transport grows daily. People with special needs make a large number of these trips, but they do not have sufficient information about the accessibility of the routes they want to take. Although there are numerous websites and applications that provide information on public transport services, most do not provide detailed information on the accessibility of the routes. We are, therefore, developing a technological framework for the processing, management, and exploitation of open data to promote accessibility to urban public transport. This is taking place within the framework of the Access@City project. This paper specifically focuses on the data extraction and processing of the existing information on the web concerning public transport and its accessibility for the generation of an open data repository in which to store this information. We, therefore, propose a method for the semi-automatic generation of a data scraper for the public transport domain. This method allows the extraction of public transport data and the existing accessibility information from a selected website. We have additionally developed a web tool that employs the aforementioned method to generate a data scraper for the public transport domain.

Highlights

  • The volume of data on the Internet has exploded in the last few years, during which time more data has appeared than in the entire previous history of the human race

  • This paper focuses on the first step, and on extracting public transport data from existing Internet sources using web scraping techniques

  • We pay particular attention to mobile applications, which could allow public transport users to obtain accessible routes between two points in a city in real time, or even combine different transport networks. These apps could translate the information concerning a smart city into an accessibility context, and the result would be the definition of an accessible city

Read more

Summary

INTRODUCTION

The volume of data on the Internet has exploded in the last few years, during which time more data has appeared than in the entire previous history of the human race. The first step in the process of developing this repository consists of extracting the data from public transport web sources The format of this information is, in most cases, unstructured, which is the main reason why extracting it from non-semantic data sources is difficult.

RELATED WORK
PUBLIC TRANSPORT SOURCE ANALYSIS
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.