The changing landscape of text mining: a review of approaches for ecology and evolution.

Maxwell J Farrell,Nicolas Le Guillarme,Liam Brierley,Bronwen Hunter,Daan Scheepens,Anna Willoughby,Andrew Yates,Nicole Mideo

doi:10.1098/rspb.2024.0423

Abstract

In ecology and evolutionary biology, the synthesis and modelling of data from published literature are commonly used to generate insights and test theories across systems. However, the tasks of searching, screening, and extracting data from literature are often arduous. Researchers may manually process hundreds to thousands of articles for systematic reviews, meta-analyses, and compiling synthetic datasets. As relevant articles expand to tens or hundreds of thousands, computer-based approaches can increase the efficiency, transparency and reproducibility of literature-based research. Methods available for text mining are rapidly changing owing to developments in machine learning-based language models. We review the growing landscape of approaches, mapping them onto three broad paradigms (frequency-based approaches, traditional Natural Language Processing and deep learning-based language models). This serves as an entry point to learn foundational and cutting-edge concepts, vocabularies, and methods to foster integration of these tools into ecological and evolutionary research. We cover approaches for modelling ecological texts, generating training data, developing custom models and interacting with large language models and discuss challenges and possible solutions to implementing these methods in ecology and evolution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings. Biological sciences	Publication Date: Jul 1, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The changing landscape of text mining: a review of approaches for ecology and evolution.

Abstract

Talk to us

Similar Papers

More From: Proceedings. Biological sciences

Lead the way for us

Similar Papers

#2924 Comparison of large language models and traditional natural language processing techniques in predicting arteriovenous fistula failure
Suman Lama ... Luca Neri
Nephrology Dialysis Transplantation | VOL. 39
Suman Lama, et. al.Suman Lama ... Luca Neri
23 May 2024
Nephrology Dialysis Transplantation | VOL. 39

Large language models outperform traditional natural language processing methods in extracting patient-reported outcomes in IBD.
Perseus V Patel ... Vivek Rudrapatna
medRxiv : the preprint server for health sciences | VOL. -
Perseus V Patel, et. al.Perseus V Patel ... Vivek Rudrapatna
06 Sep 2024
medRxiv : the preprint server for health sciences | VOL. -

Large Language Models Outperform Traditional Natural Language Processing Methods in Extracting Patient-Reported Outcomes in Inflammatory Bowel Disease
Perseus V Patel ... Vivek Rudrapatna
Gastro Hep Advances | VOL. -
Perseus V Patel, et. al.Perseus V Patel ... Vivek Rudrapatna
01 Oct 2024
Gastro Hep Advances | VOL. -

Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.
Marcel C Langenbach ... Julius C Heemelaar
European radiology | VOL. -
Marcel C Langenbach, et. al.Marcel C Langenbach ... Julius C Heemelaar
31 Oct 2024
European radiology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The changing landscape of text mining: a review of approaches for ecology and evolution.

Abstract

Talk to us

Similar Papers

More From: Proceedings. Biological sciences