Data Lifecycle Model Research Articles

Poor lifestyle leads potentially to chronic diseases and low-grade physical and mental fitness. However, ahead of time, we can measure and analyze multiple aspects of physical and mental health, such as body parameters, health risk factors, degrees of motivation, and the overall willingness to change the current lifestyle. In conjunction with data representing human brain activity, we can obtain and identify human health problems resulting from a long-term lifestyle more precisely and, where appropriate, improve the quality and length of human life. Currently, brain and physical health-related data are not commonly collected and evaluated together. However, doing that is supposed to be an interesting and viable concept, especially when followed by a more detailed definition and description of their whole processing lifecycle. Moreover, when best practices are used to store, annotate, analyze, and evaluate such data collections, the necessary infrastructure development and more intense cooperation among scientific teams and laboratories are facilitated. This approach also improves the reproducibility of experimental work. As a result, large collections of physical and brain health-related data could provide a robust basis for better interpretation of a person's overall health. This work aims to overview and reflect some best practices used within global communities to ensure the reproducibility of experiments, collected datasets and related workflows. These best practices concern, e.g., data lifecycle models, FAIR principles, and definitions and implementations of terminologies and ontologies. Then, an example of how an automated workflow system could be created to support the collection, annotation, storage, analysis, and publication of findings is shown. The Body in Numbers pilot system, also utilizing software engineering best practices, was developed to implement the concept of such an automated workflow system. It is unique just due to the combination of the processing and evaluation of physical and brain (electrophysiological) data. Its implementation is explored in greater detail, and opportunities to use the gained findings and results throughout various application domains are discussed.

Read full abstract

As science becomes more data-intensive and collaborative, researchers increasingly use larger and more complex data to answer research questions. The capacity of storage infrastructure, the increased sophistication and deployment of sensors, the ubiquitous availability of computer clusters, the development of new analysis techniques, and larger collaborations allow researchers to address grand societal challenges in a way that is unprecedented. In parallel, research data repositories have been built to host research data in response to the requirements of sponsors that research data be publicly available. Libraries are re-inventing themselves to respond to a growing demand to manage, store, curate and preserve the data produced in the course of publicly funded research. As librarians and data managers are developing the tools and knowledge they need to meet these new expectations, they inevitably encounter conversations around Big Data. This paper explores definitions of Big Data that have coalesced in the last decade around four commonly mentioned characteristics: volume, variety, velocity, and veracity. We highlight the issues associated with each characteristic, particularly their impact on data management and curation. We use the methodological framework of the data life cycle model, assessing two models developed in the context of Big Data projects and find them lacking. We propose a Big Data life cycle model that includes activities focused on Big Data and more closely integrates curation with the research life cycle. These activities include planning, acquiring, preparing, analyzing, preserving, and discovering, with describing the data and assuring quality being an integral part of each activity. We discuss the relationship between institutional data curation repositories and new long-term data resources associated with high performance computing centers, and reproducibility in computational science. We apply this model by mapping the four characteristics of Big Data outlined above to each of the activities in the model. This mapping produces a set of questions that practitioners should be asking in a Big Data project

Read full abstract

Data Lifecycle Model Research Articles

Related Topics

Articles published on Data Lifecycle Model

The ripple effect of dataset reuse: Contextualising the data lifecycle for machine learning data sets and social impact

Quality Assessment for Research Data Management in Research Projects

Secure Data Management Life Cycle for Government Big-Data Ecosystem: Design and Development Perspective

Workflow for health-related and brain data lifecycle.

Research on Personal Data Privacy Security in the Era of Big Data

How is Big Data Changing Economic Research Paradigms?

Process-driven quality improvement for scientific data based on information product map

Construction of Shipping Linked Data Lifecycle Model and Its Application in Semantic Navigation

Data Lifecycle Management in Precision Agriculture Supported by Information and Communication Technology

The scientific approach as a transparency enabler throughout the data life-cycle1

Towards an Enhanced Data- and Knowledge Management Capability: A Data Life Cycle Model Proposition for Integrated Vehicle Health Management

Research Data Management Practices: Synergies and Discords between Researchers and Institutions

Developing a Knowledge Management System for Integrated Vehicle Health Management Using a Data Life Cycle Model

A study of e-Research and its relation with research data life cycle: a literature perspective

Towards improving existing online social networks' privacy policies

The Evolution, Approval and Implementation of the U.S. Geological Survey Science Data Lifecycle Model

Revisiting the Data Lifecycle with Big Data Curation

Lifecycle models of data-centric systems and domains

Managing Organizational Data Resources

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Data Lifecycle Model Research Articles

Related Topics

Articles published on Data Lifecycle Model

The ripple effect of dataset reuse: Contextualising the data lifecycle for machine learning data sets and social impact

Quality Assessment for Research Data Management in Research Projects

Secure Data Management Life Cycle for Government Big-Data Ecosystem: Design and Development Perspective

Workflow for health-related and brain data lifecycle.

Research on Personal Data Privacy Security in the Era of Big Data

How is Big Data Changing Economic Research Paradigms?

Process-driven quality improvement for scientific data based on information product map

Construction of Shipping Linked Data Lifecycle Model and Its Application in Semantic Navigation

Data Lifecycle Management in Precision Agriculture Supported by Information and Communication Technology

The scientific approach as a transparency enabler throughout the data life-cycle1

Towards an Enhanced Data- and Knowledge Management Capability: A Data Life Cycle Model Proposition for Integrated Vehicle Health Management

Research Data Management Practices: Synergies and Discords between Researchers and Institutions

Developing a Knowledge Management System for Integrated Vehicle Health Management Using a Data Life Cycle Model

A study of e-Research and its relation with research data life cycle: a literature perspective

Towards improving existing online social networks' privacy policies

The Evolution, Approval and Implementation of the U.S. Geological Survey Science Data Lifecycle Model

Revisiting the Data Lifecycle with Big Data Curation

Lifecycle models of data-centric systems and domains

Managing Organizational Data Resources