Data hazards in synthetic biology.

Natalie R Zelenka,Natalie R Zelenka,Jeff Nivala,Fabio Parmeggiani,Seeralan Sarvaharman,Zahraa S Abdallah,Lucia Marucci,Jasdeep S Ghataora,Kieren Sharma,Nina Di Cara,Lucia Marucci,Jasdeep S Ghataora,Fabio Parmeggiani,Thomas E Gorochowski,Thomas E Gorochowski

doi:10.1093/synbio/ysae010

Natalie R Zelenka, Natalie R Zelenka + Show 13 more

Open Access

https://doi.org/10.1093/synbio/ysae010

Copy DOI

Journal: Synthetic biology (Oxford, England)	Publication Date: Jun 21, 2024
License type: CC BY 4.0

Abstract

Data science is playing an increasingly important role in the design and analysis of engineered biology. This has been fueled by the development of high-throughput methods like massively parallel reporter assays, data-rich microscopy techniques, computational protein structure prediction and design, and the development of whole-cell models able to generate huge volumes of data. Although the ability to apply data-centric analyses in these contexts is appealing and increasingly simple to do, it comes with potential risks. For example, how might biases in the underlying data affect the validity of a result and what might the environmental impact of large-scale data analyses be? Here, we present a community-developed framework for assessing data hazards to help address these concerns and demonstrate its application to two synthetic biology case studies. We show the diversity of considerations that arise in common types of bioengineering projects and provide some guidelines and mitigating steps. Understanding potential issues and dangers when working with data and proactively addressing them will be essential for ensuring the appropriate use of emerging data-intensive AI methods and help increase the trustworthiness of their applications in synthetic biology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data hazards in synthetic biology.

Abstract

Talk to us

Similar Papers

More From: Synthetic biology (Oxford, England)

Lead the way for us

Similar Papers

Computational protein design — the next generation tool to expand synthetic biology applications
Pablo Gainza-Cirauqui ... Bruno Emanuel Correia
Current Opinion in Biotechnology | VOL. 52
Pablo Gainza-Cirauqui, et. al.Pablo Gainza-Cirauqui ... Bruno Emanuel Correia
02 May 2018
Current Opinion in Biotechnology | VOL. 52

Computing van der Waals energies in the context of the rotamer approximation
Gevorg Grigoryan ... Alejandro Ochoa
Proteins: Structure, Function, and Bioinformatics | VOL. 68
Gevorg Grigoryan, et. al.Gevorg Grigoryan ... Alejandro Ochoa
06 Jun 2007
Proteins: Structure, Function, and Bioinformatics | VOL. 68

The emerging age of cell‐free synthetic biology
Mark Thomas Smith ... Anthony M Bennett
FEBS Letters | VOL. 588
Mark Thomas Smith, et. al.Mark Thomas Smith ... Anthony M Bennett
12 Jun 2014
FEBS Letters | VOL. 588

Big Data from Sparse Data: Diverse Scientific Benchmarks Reveal Optimization Imperatives for Implicit Membrane Energy Functions
Rebecca F Alford ... Jeffrey J Gray
Biophysical Journal | VOL. 118
Rebecca F Alford, et. al.Rebecca F Alford ... Jeffrey J Gray
01 Feb 2020
Biophysical Journal | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data hazards in synthetic biology.

Abstract

Talk to us

Similar Papers

More From: Synthetic biology (Oxford, England)