Pre-defined Ontologies Research Articles

Traditional gene set enrichment analyses are typically limited to a few ontologies and do not account for the interdependence of gene sets or terms, resulting in overcorrected p-values. To address these challenges, we introduce mulea, an R package offering comprehensive overrepresentation and functional enrichment analysis. mulea employs a progressive empirical false discovery rate (eFDR) method, specifically designed for interconnected biological data, to accurately identify significant terms within diverse ontologies. mulea expands beyond traditional tools by incorporating a wide range of ontologies, encompassing Gene Ontology, pathways, regulatory elements, genomic locations, and protein domains. This flexibility enables researchers to tailor enrichment analysis to their specific questions, such as identifying enriched transcriptional regulators in gene expression data or overrepresented protein domains in protein sets. To facilitate seamless analysis, mulea provides gene sets (in standardised GMT format) for 27 model organisms, covering 22 ontology types from 16 databases and various identifiers resulting in almost 900 files. Additionally, the muleaData ExperimentData Bioconductor package simplifies access to these pre-defined ontologies. Finally, mulea's architecture allows for easy integration of user-defined ontologies, or GMT files from external sources (e.g., MSigDB or Enrichr), expanding its applicability across diverse research areas. mulea is distributed as a CRAN R package downloadable from https://cran.r-project.org/web/packages/mulea/ and https://github.com/ELTEbioinformatics/mulea. It offers researchers a powerful and flexible toolkit for functional enrichment analysis, addressing limitations of traditional tools with its progressive eFDR and by supporting a variety of ontologies. Overall, mulea fosters the exploration of diverse biological questions across various model organisms.

Read full abstract

Recent regenerative medicine studies have emphasized the need for increased standardization, harmonization and sharing of information related to stem cell product characterization, to help drive these innovative interventions toward public availability and to increase collaboration in the scientific community. Although numerous attempts and numerous databases have been made to manage these data, a platform that incorporates all the heterogeneous data collected from stem cell projects into a harmonized project-based framework is still lacking. The aim of the database, which is described in this study, is to provide an intelligent informatics solution that integrates comprehensive characterization of diverse stem cell product characteristics with research subject and project outcome information. In the resulting platform, heterogeneous data are validated using predefined ontologies and stored in a relational database, to ensure data quality and ease of access. Testing was performed using 51 published, publically available induced pluripotent stem cell projects conducted in clinical, preclinical and in-vitro evaluations. Future aims of this project include further increasing the database size to include all published stem cell trials and develop additional data visualization tools to improve usability. Our testing demonstrated the robustness of the proposed platform, by seamlessly harmonizing diverse common data elements, and the potential of this platform for driving knowledge generation from the aggregation and harmonization of these diverse data.Database URL https://remedy.mssm.edu/

Read full abstract

Pre-defined Ontologies Research Articles

Related Topics

Articles published on Pre-defined Ontologies

Mulea: An R package for enrichment analysis using multiple ontologies and empirical false discovery rate

Extraction, labelling, clustering, and semantic mapping of segments from clinical notes.

Intelligent Integrative Platform for Sharing Heterogenuous Stem Cell Research Data.

ReMeDy: a platform for integrating and sharing published stem cell research data with a focus on iPSC trials.

Introducing a Platform for Integrating and Sharing Stem Cell Research Data.

Building a New Semantic Social Network Using Semantic Web-Based Techniques

Towards Intelligent Integration and Sharing of Stem Cell Research Data.

What If Colorful Images Become More Important than Words? Visual Representations as the Basic Building Blocks of Human Communication and Dynamic Storytelling

The tissue microarray OWL schema: An open-source tool for sharing tissue microarray data

Text-based over-representation analysis of microarray gene lists with annotation bias

OntoVote: a scalable distributed vote-collecting mechanism for ontology drift on a P2P platform

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pre-defined Ontologies Research Articles

Related Topics

Articles published on Pre-defined Ontologies

Mulea: An R package for enrichment analysis using multiple ontologies and empirical false discovery rate

Extraction, labelling, clustering, and semantic mapping of segments from clinical notes.

Intelligent Integrative Platform for Sharing Heterogenuous Stem Cell Research Data.

ReMeDy: a platform for integrating and sharing published stem cell research data with a focus on iPSC trials.

Introducing a Platform for Integrating and Sharing Stem Cell Research Data.

Building a New Semantic Social Network Using Semantic Web-Based Techniques

Towards Intelligent Integration and Sharing of Stem Cell Research Data.

What If Colorful Images Become More Important than Words? Visual Representations as the Basic Building Blocks of Human Communication and Dynamic Storytelling

The tissue microarray OWL schema: An open-source tool for sharing tissue microarray data

Text-based over-representation analysis of microarray gene lists with annotation bias

OntoVote: a scalable distributed vote-collecting mechanism for ontology drift on a P2P platform