Make Data Access Research Articles

BackgroundMost people receive most of their health care in in Australia in primary care, yet researchers and policymakers have limited access to resulting clinical data. Widening access to primary care data and linking it with hospital or other data can contribute to research informing policy and provision of services and care; however, limitations of primary care data and barriers to access curtail its use. The Australian Health Research Alliance (AHRA) is seeking to build capacity in data-driven healthcare improvement; this study formed part of its workplan.MethodsThe study aimed to build capacity for data driven healthcare improvement through identifying primary care datasets in Australia available for secondary use and understand data quality frameworks being applied to them, and factors affecting national capacity for secondary use of primary care data from the perspectives of data custodians and users. Purposive and snowball sampling were used to disseminate a questionnaire and respondents were invited to contribute additional information via semi-structured interviews.ResultsSixty-two respondents collectively named 106 datasets from eclectic sources, indicating a broad conceptualisation of what a primary care dataset available for secondary use is. The datasets were generated from multiple clinical software systems, using different data extraction tools, resulting in non-standardised data structures. Use of non-standard data quality frameworks were described by two-thirds of data custodians. Building trust between citizens, clinicians, third party data custodians and data end-users was considered by many to be a key enabler to improve primary care data quality and efficiencies related to secondary use. Trust building qualities included meaningful stakeholder engagement, transparency, strong leadership, shared vision, robust data security and data privacy protection. Resources to improve capacity for primary care data access and use were sought for data collection tool improvements, workforce upskilling and education, incentivising data collection and making data access more affordable.ConclusionsThe large number of identified Australian primary care related datasets suggests duplication of labour related to data collection, preparation and utilisation. Benefits of secondary use of primary care data were many, and strong national leadership is required to reach consensus on how to address limitations and barriers, for example accreditation of EMR clinical software systems and the adoption of agreed data and quality standards at all stages of the clinical and research data-use lifecycle. The study informed the workplan of AHRA’s Transformational Data Collaboration to improve partner engagement and use of clinical data for research.

The upstream industry’s pervasive struggle to account for and make use of its data has become almost cliché. But it is a reality—even though the industry is now multiple years into its digitalization phase. Among the many entrepreneurs and researchers coming up with solutions, two entities have leveraged data donations from big operators in an effort to make data access as quick and easy as a Google search. A big operator, which may have interests in basins all over the world, adds terabytes upon terabytes of data from its wells each day to the generations of data it has already accumulated over decades. These large, disparate sets of information come both structured and unstructured from a variety of sources. Interpretations of those records can vary depending on the terminology used by the faceless person or program that put them together. Externally, a large portion of operator data is still stowed in a disorganized manner on servers owned by multiple electronic drilling recorder providers, observed Pradeep Ashok, senior research scientist in the drilling and rig automation group at the University of Texas’s Hildebrand Department of Petroleum and Geosystems Engineering, one of the entities tackling this problem. Operators can download the data, view the data through a web interface, and perform analysis locally. But the more ideal option would be to have all the data readily accessible in a data store within the company. Internally, in many cases, data sit in different silos within a company, spread across different physical locations. And the people who work with that data are not static entities: When the downturn hit a few years ago and layoffs occurred, gobs of data were left stranded. Opera-tors are still trying to account for that lost information. Unless companies implement “a system that can deal with the volume of data created every day, they will continue to be challenged,” said Frank Perez, chief executive officer of Sfile, a software firm that has set out to help operators make the most of their data. Perez noted an instance when an operator asked Sfile to train its system to identify and collect all of the company’s hydraulic fracturing pump curves and transform them into a normalized data feature set. The operator initially estimated that it had 10,000 pump curves, but, after Sfile got a hold of its data, it determined that the company actually had 65,000. “We were able to facilitate a massive database of pressure information that was used to basically build a new reservoir model,” he said. “It was just something that they didn’t even expect they had.” It is a common trend observed by Perez: Companies have way more data than they realize, and the possibilities that come with leveraging that data are endless. But those companies first need to be able to harness the data they know they have.

Make Data Access Research Articles

Related Topics

Articles published on Make Data Access

A survey on semantic data management as intersection of ontology-based data access, semantic modeling and data lakes

A customizable secure DIY web application for accessing, sharing, and browsing aggregate experimental results and metadata.

Fake News in Virtual Community, Virtual Society, and Metaverse: A Survey

CDS-DB, an omnibus for patient-derived gene expression signatures induced by cancer treatment.

Randomly-based Stepwise Multi-Level Distributed Medical Image Steganography

An Attribute-Based Keyword Search Scheme for Multiple Data Owners in Cloud-Assisted Industrial Internet of Things

A Context-Aware Empowering Business with AI: Case of Chatbots in Business Intelligence Systems

Identifying primary care datasets and perspectives on their secondary use: a survey of Australian data users and custodians

Multi-Authority Criteria-Based Encryption Scheme for IoT

Social-minded Measures of Data Quality

Lessons Learned from the NOAA CoastWatch Ocean Satellite Course Developed for Integrating Oceanographic Satellite Data into Operational Use

Souped-Up Search Engines Wrangle Drilling, Completions Data

Evaluating Surveillance for Excessive Alcohol Use in New Mexico.

Frequent item set mining using normalized FP-growth algorithm

ProtozoaDB 2.0: A Trypanosoma Brucei Case Study.

Types from data: making structured data first-class citizens in F#

Utilizing cloud storage architecture for long-pulse fusion experiment data storage

Adoption of health information technologies: the case of a wireless monitor for diabetes and obesity patients

Using Data Accessibility for Resource Selection in Large-Scale Distributed Systems

Blue: a database for high-fold γ-ray coincidence data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Make Data Access Research Articles

Related Topics

Articles published on Make Data Access

A survey on semantic data management as intersection of ontology-based data access, semantic modeling and data lakes

A customizable secure DIY web application for accessing, sharing, and browsing aggregate experimental results and metadata.

Fake News in Virtual Community, Virtual Society, and Metaverse: A Survey

CDS-DB, an omnibus for patient-derived gene expression signatures induced by cancer treatment.

Randomly-based Stepwise Multi-Level Distributed Medical Image Steganography

An Attribute-Based Keyword Search Scheme for Multiple Data Owners in Cloud-Assisted Industrial Internet of Things

A Context-Aware Empowering Business with AI: Case of Chatbots in Business Intelligence Systems

Identifying primary care datasets and perspectives on their secondary use: a survey of Australian data users and custodians

Multi-Authority Criteria-Based Encryption Scheme for IoT

Social-minded Measures of Data Quality

Lessons Learned from the NOAA CoastWatch Ocean Satellite Course Developed for Integrating Oceanographic Satellite Data into Operational Use

Souped-Up Search Engines Wrangle Drilling, Completions Data

Evaluating Surveillance for Excessive Alcohol Use in New Mexico.

Frequent item set mining using normalized FP-growth algorithm

ProtozoaDB 2.0: A Trypanosoma Brucei Case Study.

Types from data: making structured data first-class citizens in F#

Utilizing cloud storage architecture for long-pulse fusion experiment data storage

Adoption of health information technologies: the case of a wireless monitor for diabetes and obesity patients

Using Data Accessibility for Resource Selection in Large-Scale Distributed Systems

Blue: a database for high-fold γ-ray coincidence data