Potential Data Sources Research Articles

India accommodates a huge diversity of plant and animal life across a variety of biomes. However, the degree of research, funding, and attention is asymmetric, largely focused on its charismatic vertebrates. Invertebrates, despite their megadiversity, are generally overlooked with some exceptions (for example, lepidoterans). One species-rich group, spiders, exemplifies this knowledge gap. More than 1,800 species from 63 families have been reported in the country (Mondal et al. 2020), even though the true number may be much higher, in part, owing to the need for taxonomic revisions. Several spider systematics and biogeographic studies have pointed out that spatial distribution, seasonality, and natural history data are lacking from India. The lack of foundational biodiversity information has led to poor opportunities to share knowledge between researchers and community scientists. Mining occurrences through scientific publications or databases (in some cases behind paywalls) may require specialized training. This restricted access inhibits more people from using the abundance of information. The prevalence of social media posts showcasing photographs of species in biodiversity contexts has experienced a significant increase, with a multitude of users sharing wildlife photographs on social media sites such as Instagram®, Facebook®, and Flickr®. These observations on social media are a potential source of primary biodiversity data when curated and validated (Barman and Barve 2022, Barman et al. 2022a). Such data can enhance biodiversity monitoring by expanding spatial and temporal coverage and involving a global network of citizen scientists in conservation research (Barve et al. 2023, Roy 2022, Kulkarni 2023). The India Biodiversity Portal (IBP, Vattakaven et al. 2016) has been cataloguing species diversity and spider data through citizen science. Meanwhile, major social media sites report many more species sightings than citizen science platforms. SpiderIndia, a popular Facebook group, has collected over 20,000 observations from 8,500 spider enthusiasts. However, these data remain unorganised and inaccessible to academic researchers and the public. In order to tackle these problems, we implemented a methodical process to enhance the occurrence data on spiders from well-known social media platforms, such as the SpiderIndia project. The procedure involved retrieving the relevant data, and ingesting it through a custom pipeline to parse and extract, scientific names, spatial and temporal data to generate occurrence records on the IBP. The data was then presented on a curation interface, enabling taxonomic experts to verify the records, and finally publishing the verified records on the Global Biodiversity Information Facility (GBIF) (Global Biodiversity Information Facility 2022, Barman et al. 2022b) to improve the records of spider species occurrence in India. This project showcases the capacity of citizen science via social media to involve citizen scientists in generating extensive datasets that make a significant contribution to scientific knowledge and improve our comprehension of invertebrate biodiversity. The final dataset encompasses over 15,000 observations, providing valuable insights into spider diversity and distribution across India (Fig. 1). This data is publicly available on GBIF, facilitating further research on Indian spider populations.

Read full abstract

Human mobility data have been used as a potential novel data source to guide policies and response planning during the COVID-19 global pandemic. The COVID-19 Mobility Data Network (CMDN) facilitated the use of human mobility data around the world. Both researchers and policy makers assumed that mobility data would provide insights to help policy makers and response planners. However, evidence that human mobility data were operationally useful and provided added value for public health response planners remains largely unknown. This exploratory study focuses on advancing the understanding of the use of human mobility data during the early phase of the COVID-19 pandemic. The study explored how researchers and practitioners around the world used these data in response planning and policy making, focusing on processing data and human factors enabling or hindering use of the data. Our project was based on phenomenology and used an inductive approach to thematic analysis. Transcripts were open-coded to create the codebook that was then applied by 2 team members who blind-coded all transcripts. Consensus coding was used for coding discrepancies. Interviews were conducted with 45 individuals during the early period of the COVID-19 pandemic. Although some teams used mobility data for response planning, few were able to describe their uses in policy making, and there were no standardized ways that teams used mobility data. Mobility data played a larger role in providing situational awareness for government partners, helping to understand where people were moving in relation to the spread of COVID-19 variants and reactions to stay-at-home orders. Interviewees who felt they were more successful using mobility data often cited an individual who was able to answer general questions about mobility data; provide interactive feedback on results; and enable a 2-way communication exchange about data, meaning, value, and potential use. Human mobility data were used as a novel data source in the COVID-19 pandemic by a network of academic researchers and practitioners using privacy-preserving and anonymized mobility data. This study reflects the processes in analyzing and communicating human mobility data, as well as how these data were used in response planning and how the data were intended for use in policy making. The study reveals several valuable use cases. Ultimately, the role of a data translator was crucial in understanding the complexities of this novel data source. With this role, teams were able to adapt workflows, visualizations, and reports to align with end users and decision makers while communicating this information meaningfully to address the goals of responders and policy makers.

Read full abstract

Potential Data Sources Research Articles

Related Topics

Articles published on Potential Data Sources

A Comprehensive Study of OpenTelemetry Collector: Architecture, Use Cases, and Performance

A novel data source for human-caused wildfires in China: extracting information from judgment documents

The Uncharted Waters of International Trade

Pharmacoequity measurement framework: A tool to reduce health disparities.

A Systematic Review of Features Forecasting Patient Arrival Numbers.

Citizen Science for Invertebrate Biodiversity: Mobilizing Spider Occurrence Data through Facebook in India

Too rare to dare? Leveraging household surveys to boost research on climate migration

Applications of Google Trends as a Data Source for Statistical Models

Comprehensive evaluation of satellite-based precipitation products at hourly scale in Beijing

Understanding the Use of Mobility Data in Disasters: Exploratory Qualitative Study of COVID-19 User Feedback.

An evaluation of the All of Us Research Program database to examine cumulative stress.

Automated analysis and assignment of maintenance work orders using natural language processing

Tell your story: Metrics of success for academic data science collaboration and consulting programs

The Retrieval of Ground NDVI (Normalized Difference Vegetation Index) Data Consistent with Remote-Sensing Observations

Background and Rationale - CDC Guidance for Communities Assessing, Investigating, and Responding to Suicide Clusters, United States, 2024.

CDC Guidance for Community Response to Suicide Clusters, United States, 2024.

A dataset of global tropical cyclone wind and surface wave measurements from buoy and satellite platforms

Enhanced Heart Disease Prediction Using Machine Learning Techniques

A new electronic medical record database linked to claims data and discharge abstract data (the RWD database) in Japan: Study design and profile.

Food and nutrition surveillance actions in Brazil and Portugal: a comparative documentary analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Potential Data Sources Research Articles

Related Topics

Articles published on Potential Data Sources

A Comprehensive Study of OpenTelemetry Collector: Architecture, Use Cases, and Performance

A novel data source for human-caused wildfires in China: extracting information from judgment documents

The Uncharted Waters of International Trade

Pharmacoequity measurement framework: A tool to reduce health disparities.

A Systematic Review of Features Forecasting Patient Arrival Numbers.

Citizen Science for Invertebrate Biodiversity: Mobilizing Spider Occurrence Data through Facebook in India

Too rare to dare? Leveraging household surveys to boost research on climate migration

Applications of Google Trends as a Data Source for Statistical Models

Comprehensive evaluation of satellite-based precipitation products at hourly scale in Beijing

Understanding the Use of Mobility Data in Disasters: Exploratory Qualitative Study of COVID-19 User Feedback.

An evaluation of the All of Us Research Program database to examine cumulative stress.

Automated analysis and assignment of maintenance work orders using natural language processing

Tell your story: Metrics of success for academic data science collaboration and consulting programs

The Retrieval of Ground NDVI (Normalized Difference Vegetation Index) Data Consistent with Remote-Sensing Observations

Background and Rationale - CDC Guidance for Communities Assessing, Investigating, and Responding to Suicide Clusters, United States, 2024.

CDC Guidance for Community Response to Suicide Clusters, United States, 2024.

A dataset of global tropical cyclone wind and surface wave measurements from buoy and satellite platforms

Enhanced Heart Disease Prediction Using Machine Learning Techniques

A new electronic medical record database linked to claims data and discharge abstract data (the RWD database) in Japan: Study design and profile.

Food and nutrition surveillance actions in Brazil and Portugal: a comparative documentary analysis