Where can virtual programming labs data sets be downloaded?

Answer from top 10 papers

Virtual programming labs data sets can be downloaded from various sources depending on the specific field and type of data required. For instance, in the biomedical field, large and complex data sets are often shared via public data repositories, which can be accessed and visualized through web browsers, facilitating interaction with the data (Huttner et al., 2020). In the context of digital health research, the Shareable Data Publishing and Access Service for Living Labs allows external researchers to download anonymised or synthetic data versions for local development and testing, while also providing a mechanism for remote execution against real internal data (Skluzacek et al., 2016).
In the realm of geospatial data, the National Snow and Ice Data Center has utilized Virtual Globes to display scientific data, with the production of Keyhole Markup Language (KML) files that are interoperable with various mapping software packages (Hernandez et al., 2024). Additionally, Klimatic provides a system for discovering and integrating geospatial data, offering an expansive virtual data lake from which users can request and download data sets that meet specific criteria (Fine et al., 2020).
In summary, virtual programming labs data sets can be sourced from specialized public data repositories, services like the Shareable Data Publishing and Access Service for Living Labs, and systems like Klimatic that facilitate the discovery and integration of geospatial data. These platforms provide access to a wide range of data sets that can be used for various research and educational purposes, enhancing the accessibility and utility of virtual labs (Fine et al., 2020; Hernandez et al., 2024; Huttner et al., 2020; Skluzacek et al., 2016).

Source Papers

Klimatic: a virtual data lake for harvesting and distribution of geospatial data

Many interesting geospatial datasets are publicly accessible on web sites and other online repositories. However, the sheer number of datasets and locations, plus a lack of support for cross-repository search, makes it difficult for researchers to discover and integrate relevant data. We describe here early results from a system, Klimatic, that aims to overcome these barriers to discovery and use by automating the tasks of crawling, indexing, integrating, and distributing geospatial data. Klimatic implements a scalable crawling and processing architecture that uses an elastic container-based model to locate and retrieve relevant datasets and to extract metadata from headers and within files to build a global index of known geospatial data. In so doing, we create an expansive geospatial virtual data lake that records the location, formats, and other characteristics of large numbers of geospatial datasets while also caching popular data subsets for rapid access. A flexible query interface allows users to request data that satisfy supplied type, spatial, temporal, and provider specifications; in processing such queries, the system uses interpolation and aggregation to combine data of different types, data formats, resolutions, and bounds. Klimatic has so far incorporated more than 10,000 datasets from over 120 sources and has been demonstrated to scale well with data size and query complexity.

A Secure Data Publishing and Access Service for Sensitive Data from Living Labs: Enabling Collaboration with External Researchers via Shareable Data

Intending to enable a broader collaboration with the scientific community while maintaining privacy of the data stored and generated in Living Labs, this paper presents the Shareable Data Publishing and Access Service for Living Labs, implemented within the framework of the H2020 VITALISE project. Building upon previous work, significant enhancements and improvements are presented in the architecture enabling Living Labs to securely publish collected data in an internal and isolated node for external use. External researchers can access a portal to discover and download shareable data versions (anonymised or synthetic data) derived from the data stored across different Living Labs that they can use to develop, test, and debug their processing scripts locally, adhering to legal and ethical data handling practices. Subsequently, they may request remote execution of the same algorithms against the real internal data in Living Lab nodes, comparing the outcomes with those obtained using shareable data. The paper details the architecture, data flows, technical details and validation of the service with real-world usage examples, demonstrating its efficacy in promoting data-driven research in digital health while preserving privacy. The presented service can be used as an intermediary between Living Labs and external researchers for secure data exchange and to accelerate research on data analytics paradigms in digital health, ensuring compliance with data protection laws.

Open Access
633-P: Optimizing Workflow for Remote Diabetes Data Download: University of Pittsburgh Experience

Background: As CMS started to reimburse videovisits during public health emergency, virtual diabetes care became routine for diabetes clinics. Since better diabetes care depends on review of data, at UPMC, we created a workflow to facilitate diabetes data download before virtual visits. Best practices for virtual diabetes data download is not known. Objective: To analyze the diabetes technology data download workflow efficiency in our clinic for virtual visits. Methods: At University of Pittsburgh endocrine clinic, staff reaches out to patients with diabetes who are scheduled for videovisits one week before their appointment with instructions on how to download glucometer, insulin pump and/or CGM data. We completed a retrospective chart review to understand the effectiveness of this workflow in obtaining data before visits. Results: August 2020 to January 2020, our clinic staff reached out to 1197 patients with diabetes via secure portal before their videovisits to obtain diabetes data download. Patients used Tidepool software to download their pumps and glucometers or cloud technology for CGM data sharing. 164(14%) patients were on insulin pump(with or without CGM), 159(13%) patients had a CGM alone and 874(73%) patients had a glucometer alone. 190(15%) patients uploaded their data to Tidepool, 150 (12%) patients shared their CGM data via cloud using their cell phone apps.873(73%) patients did not download their blood sugar data before appointment, most of these patients had glucometer only. The barriers to sharing data virtually were reported by 104 patients. 25/104 (25%) of patients had no access to a computer. 6 patients (5%) reported having a glucometer that does not have compatibility with Tidepool software. 35/104 (32%) of patients reported not having cable to download their data. Conclusion: The most common barrier for patients to share their diabetes technology data was lack of cables. Diabetes clinics should analyze their remote diabetes data workflows to troubleshoot barriers. Disclosure E. Karslioglu-french: None. D. Sistla: None. A. C. Meyer: None.

Instruction in 802.11 Technology in Online Virtual Labs

Contribution: A novel approach to remote instruction in 802.11 technology is described using the virtual lab technology. Background: Lab-based education has been a staple of computing education for decades. By interacting with the technology, students are able to gain a much greater understanding of the subject through hands-on activities. Recently, virtual labs have provided a mechanism to allow both co-located and online students access to these lab environments without the time, space, and monetary constraints of traditional labs. Computer networking and security is one area where virtual labs provide a highly useful testbed for learning security concepts. One problem is implementing virtual educational labs pertaining to 802.11 technologies given the inherently physical location-based nature of the wireless medium. Intended Outcomes: This article builds on work by others by developing and testing a new hybrid physical/virtual lab to enable students to learn concepts pertaining to 802.11 networking technologies remotely in a virtual lab setting. Application Design: The virtual lab environment uses the ESXi virtualization technology coupled with arrays of USB 802.11 adapters to enable the use of 802.11 technologies remotely. Findings: The experimental results show the proposed system to be as effective as a traditional physical lab environment with regards to usefulness, ease of use, satisfaction, self-efficacy, and exercise completion time.