Abstract

Biopreservation and BiobankingVol. 21, No. 2 EditorialFree AccessReadiness for Artificial Intelligence in BiobankingGregory H. Grossman and Marianne K. HendersonGregory H. GrossmanAdvancing Sight Network, Birmingham, Alabama, USA.Search for more papers by this author and Marianne K. HendersonAddress correspondence to: Marianne K. Henderson, MS, CPC, National Cancer Institute, NIH, 9609 Medical Center Drive, 7E442, Rockville, MD 20850, USA E-mail Address: hendersm@mail.nih.govNational Cancer Institute, NIH, Bethesda, Maryland, USA.Search for more papers by this authorPublished Online:17 Apr 2023https://doi.org/10.1089/bio.2023.29121.editorialAboutSectionsPDF/EPUB Permissions & CitationsPermissionsDownload CitationsTrack CitationsAdd to favorites Back To Publication ShareShare onFacebookTwitterLinked InRedditEmail The recent launch of the language and text processing artificial intelligence (AI) platform ChatGPT from OpenAI has markedly raised interest in AI and signaled a new era of machine and deep learning. AI can be defined as an automated software system that can ingest, compile, and analyze data and domain knowledge to present informed, context-specific outputs.As contributors to the information management (data, systems, and networks) section of the International Society for Biological and Environmental Repositories (ISBER) Best Practices 5th edition, we have been following the advances and implementation of AI in health care, research, and biobanking with great anticipation. We also see the same excitement from the ISBER membership, as illustrated by the active discussions on the ISBER Open Forum.There are many different types and categories of deep and machine learning AI platforms based on their engineering and function. Although AI was described in computer science as early as the 1950s and the earliest NIH AI in Medicine conference was held in 1975,1 it is only recently that some of the technological limitations of its use have been overcome, including data availability, to allow it to be more widely applied. We routinely encounter AI-driven programs in our daily lives, although we may not be fully aware of their presence over the past few years.Examples range from chatbots to voice assistants such as Amazon's Alexa or Apple's Siri, to screening-based algorithms, such as those used in mammography, that detect patterns that may signal abnormalities to be more closely examined. Imaging algorithms benefit from learning through continued use, to refine performance. Recently, AI has become integrated more widely into medical and scientific practices. An AI program to screen for diabetic retinopathy by interpreting clinical images was successfully modeled in India and the United States,2 with screening implemented in a country-wide health system in Thailand.3Furthermore and importantly, some approved AI screening programs have matured enough that they are autonomous, meaning that human oversight of clinical diagnosis is not required.4 These examples highlight AI's unique ability to decipher hidden patterns in data, build predictive models, and deliver actionable recommendations.Much of our field anticipates that the inevitable implementation of AI in biobanking will have profound and positive impacts, with the overall outcome of increasing specimen/data utilization, greater repository sustainability, and increased speed of scientific discovery. AI has the potential to impact all areas of biobanking, from recruiting donors, improving biobank operations, developing educational and training materials for new biobankers, increasing sample use, improving data annotation, and faster deposition of ethically-protected data. Industries across the globe are looking toward AI to help optimize operations, and biobanking will be no different.As examples, we can imagine using natural language processors to help our biobanks and their resources to market more broadly to the scientific community; or interpreting and converting free text comments or voice notes into standard categorical data fields that can facilitate data interoperability. AI in biobanking could create the draft text for consenting, standard operating procedures or operational documentation, write analytical code for samples and data, and write scripts for data deposition. We can also see its impact on accelerating the availability and analysis of large data sets, such as in the case of whole genome sequencing.Although the beneficial aspects and opportunities hold much promise, integrating AI within the biobanking continuum poses challenges and risks. What looks to be the most salient risk of implementing AI in biobanking is not the malicious actions of a self-aware superintelligence portrayed in dystopian science fiction, but something more closely aligned to an inherent human trait—bias. Data bias in AI parallels the traditional computer science concept of “garbage in, garbage out,” in which inaccurate, erroneous, or absent input leads to false or “garbage” output data (including actionable reports or decisions). It is here that we can make the most impact at this time so that biobanking is “ready” for AI applications.The continued push for incorporating and utilizing principles such as Open Science, FAIR (Findability, Accessibility, Interoperability, and Reuse), or DwC (Darwin Core), data standards such as SPREC (Standard PREanalytical Code), and guidelines such as ISO 8000 (data quality), will help to ensure robust, high-quality, and ethical data.5–8 There is also a greater need for repositories to plan for the interoperability of their data and systems to be able to integrate with AI applications in the cloud, across networks, or on virtual platforms. Another form of bias that presents a challenge is algorithmic bias.AI algorithmic bias tends to highly reflect human bias, whereby the system delivers erroneous conclusions or predictions, without showing ambiguity or confidence limits in its delivery, which can be ethically problematic and ultimately damaging to groups of people or sensitive species. This threat underscores the need for AI programmers to develop and incorporate robust ethical guidelines/guardrails to protect from harm to environments, species, and populations. The unintended harm to populations and society was called out by an Open Letter to AI developers to take a pause in their training and release of more powerful AI systems than GPT-4.9Our biobanking subject matter expert role will be needed to influence the development of these guardrails as AI methods and algorithms evolve. Two additional pillars that must be forefront in AI development include the reliability that outputs provided can be trusted and that security measures from hacking and breaches of personal or sensitive data are implemented.This most exciting and challenging era of biobanking awaits our community. We need to educate ourselves on the possible uses and impacts of deep learning, machine learning, and natural language processing applications through engagements with AI experts. Through these engagements and the formation of biobanking workgroups, we will contribute to ethically-sound AI tools and practices for biobanking and data sharing. Perhaps the most exciting aspect of AI in biobanking is that this nascent technology will have emergent and exponential benefits, accelerating our impact on the discovery and implementation of new knowledge to protect the health of our populations and the environment.To kick off further discussions, we are developing a webinar global cast on AI's impact on biobanking as part of ISBER's Webinar Planning Task Force. We look forward to continuing this dialogue, through the active engagement of members of the biobanking community and the implementation of AI in biobanking in the very near future.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call