Researchers and planners require ridership data to study factors that influence people’s choice to use transit. However, the data can be challenging to obtain directly from transit agencies. Crowdsourced big data platforms such as StreetLight promise easily accessible ridership-related data in standard formats. It is important to assess the reliability of these data, particularly for transit agencies serving small- to medium-sized cities, which are less likely than agencies in large cities to have ridership data in standard formats. In this study, hourly ridership data from 2019 were collected from four bus transit agencies and one rail agency in Virginia and compared with StreetLight data. Comparisons for rail data were made on a station-to-station basis. Bus data comparisons were made at the city-limit level and at an aggregated-route level for each agency. In sum, StreetLight could not provide 2019 bus activity data for more than half of the localities in Virginia. Comparisons between transit agency and StreetLight data showed smaller root mean square errors when longer periods were analyzed (e.g., 4 versus 2 months). Although order of magnitude of ridership may indicate whether StreetLight can provide bus activity data, the former was not found to be correlated with the accuracy of the latter. Using data from StreetLight’s current algorithm might not be appropriate without verification against agency data, especially for agencies in small- to medium-sized cities.
Read full abstract