- New
- Research Article
- 10.1515/jib-2025-0023
- Jan 8, 2026
- Journal of integrative bioinformatics
- Elena Ignatieva + 5 more
Appetite is an instinct that has been formed through evolution. Appetite promotes normal growth and development in humans. However, under conditions of food abundance, appetite can become excessive, posing significant health risks. In this study we have identified 80 human genes whose orthologs regulated food intake in model animal species. More than 80 % of these genes encode G-protein-coupled receptors and 29 % were found to be involved in developmental processes. Using phylostratigraphic age index (PAI), which specifies the evolutionary age of a gene, we found that this set of 80 genes contains an increased proportion of genes with the same phylostratigraphic age (PAI = 6, the stage of Vertebrata divergence) indicating the coordinated evolution of this group of genes. Using divergence index (DI), which indicates the type of selection to which the gene is subjected, we observed significant enrichment for genes with DI ≤ 0.25, i.e., those that are subject to strong stabilizing selection. The subgroup of genes having DI ≤ 0.25 included 45 genes and was enriched with genes that are associated with developmental processes. This finding supports the hypothesis that developmental disturbances generally impose strong constraints on viability due to purifying selection.
- New
- Research Article
- 10.1515/jib-2025-0047
- Dec 24, 2025
- Journal of integrative bioinformatics
- Gianfranco Frigerio
Metabolomics studies require complex data processing pipelines to ensure data quality and extract meaningful biological insights. GetFeatistics is an R-package developed to streamline the elaboration and statistical analysis of metabolomics data. For targeted analyses, the package enables calibration curve-based quantification with different data weighting options. For untargeted studies, it includes dedicated functions to import feature tables from tools like patRoon and MS-DIAL, assign annotation confidence levels, and filter features based on pooled quality control (QC) criteria, including options for group-specific pooled QCs. The package also provides functions for univariate and multivariate statistical analyses, notably streamlined regression modelling with fixed effects, mixed-effects models for longitudinal data, and Tobit regression for censoring values exceeding the limits of detection. Output tables are concise and informative, facilitating interpretation and reporting, while output visualisations are fully customisable via the ggplot grammar. Additional functionalities include automated retrieval of chemical properties from PubChem, ontology classification via ClassyFire, and pathway enrichment analysis using the FELLA package. GetFeatistics is publicly available on GitHub, with comprehensive documentation and a step-by-step vignette. By integrating key steps of the metabolomics workflow, the package aims to facilitate both exploratory studies and large-scale epidemiological applications in metabolomics research.
- Front Matter
- 10.1515/jib-2025-frontmatter2
- Oct 30, 2025
- Journal of Integrative Bioinformatics
- Front Matter
- 10.1515/jib-2025-frontmatter1
- Aug 6, 2025
- Journal of Integrative Bioinformatics
- Research Article
- 10.1515/jib-2024-0052
- Jul 14, 2025
- Journal of Integrative Bioinformatics
- Rawan Gedeon + 1 more
This study investigates the application of deep learning techniques for segmenting glands in histopathological images of colorectal cancer. We trained two convolutional neural network models, U-Net and DCAN, on a combination of the GlaS and CRAG datasets to enhance generalization across diverse histological appearances, selecting DCAN for its superior accuracy in delineating gland boundaries. The goal was to achieve robust gland segmentation applicable to whole slide images (WSIs) from The Cancer Genome Atlas (TCGA). Using the segmented glands, we extracted patient-level morphological features and used them to predict survival outcomes. A Cox proportional hazards model was trained on these features and achieved a high concordance index, indicating strong predictive performance. Patients were then stratified into high- and low-risk groups, with significant differences in survival distributions (log-rank p-value: 0.01317). In addition, we benchmarked our models against state-of-the-art gland segmentation methods on GlaS and CRAG, highlighting the trade-off between domain-specific accuracy and cross-dataset robustness.
- Front Matter
- 10.1515/jib-2025-0034
- Jul 9, 2025
- Journal of Integrative Bioinformatics
- Ralf Hofestädt
- Research Article
- 10.1515/jib-2025-0007
- Jul 1, 2025
- Journal of Integrative Bioinformatics
- Falk Schreiber + 7 more
Sustainable software development requires the software to remain accessible and maintainable over long time. This is particularly challenging in a scientific context. For example, fewer than one third of tools and platforms for biological network representation, analysis, and visualisation have been available and supported over a period of 15 years. One of those tools is Vanted, which has been developed and actively supported over the past 20 years. In this work, we discuss sustainable software development in science and investigate which software tools for biological network representation, analysis, and visualisation are maintained over a period of at least 15 years. With Vanted as a case study, we highlight five key insights that we consider crucial for sustainable, long-term software development and software maintenance in science.
- Research Article
- 10.1515/jib-2024-0062
- Jun 24, 2025
- Journal of integrative bioinformatics
- Igor Goryanin + 6 more
This study investigates the gut microbiome and metabolome of asthma patients treated with inhaled corticosteroids (ICS), some of whom experience adverse side effects. We analyzed stool samples from 24 participants, divided into three cohorts: asthma patients with side effects, those without, and healthy controls. Using next-generation sequencing and LC-MS/MS metabolomics, we identified significant differences in bacterial species and metabolites. Multi-Omics Factor Analysis (MOFA) and Global Sensitivity Analysis-Partial Rank Correlation Coefficient (GSA-PRCC) provided insights into key contributors to side effects, such as tryptophan depletion and altered linolenate and glucose-1-phosphate levels. The study proposes dietary or probiotic interventions to mitigate side effects. Despite the limited sample size, these findings provide a basis for personalized asthma management approaches. Further studies are required to confirm initial fundings.
- Research Article
- 10.1515/jib-2024-0037
- Jun 23, 2025
- Journal of Integrative Bioinformatics
- Aruana F F Hansel-Fröse + 4 more
Stem cells are capable of self-renewal and differentiation into various cell types, showing significant potential for cellular therapies and regenerative medicine, particularly in cardiovascular diseases. The differentiation to cardiomyocytes replicates the embryonic heart development, potentially supporting cardiac regeneration. Cardiomyogenesis is controlled by complex post-transcriptional regulation that affects the construction of gene regulatory networks (GRNs), such as: alternative polyadenylation (APA), length changes in untranslated regulatory regions (3′UTRs), and microRNA (miRNA) regulation. To deepen our understanding of the cardiomyogenesis process, we have modeled a GRN for each day of cardiomyocyte differentiation. Then, each GRN was automatically transformed by four transformation rules to a Petri net and simulated using the software VANESA. The Petri nets highlighted the relationship between genes and alternative isoforms, emphasizing the inhibition of miRNA on APA isoforms with varying 3′UTR lengths. Moreover, in silico simulation of miRNA knockout enabled the visualization of the consequential effects on isoform expression. Our Petri net models provide a resourceful tool and holistic perspective to investigate the functional orchestra of transcript regulation that differentiate hESCs to cardiomyocytes. Additionally, the models can be adapted to investigate post-transcriptional GRN in other biological contexts.
- Research Article
- 10.1515/jib-2024-0047
- Jun 23, 2025
- Journal of Integrative Bioinformatics
- Pablo Enrique Guillem + 7 more
The rapid advancement of Next-Generation Sequencing (NGS) technologies has revolutionized the field of genomics, producing large volumes of data that necessitate sophisticated analytical techniques. This paper introduces a Deep Learning model designed to predict the pathogenicity of genetic variants, a vital component in advancing personalized medicine. The model is trained on a dataset derived from the analysis of NGS outputs, containing a combination of well-defined and ambiguous genetic variants. By employing a semi-supervised learning approach, the model efficiently utilizes both confidently labeled and less certain data. At the core of the methodology is the Feature Tokenizer Transformer architecture, which processes both numerical and categorical genomic information. The preprocessing pipeline includes key steps such as data imputation, scaling, and encoding to ensure high data quality. The results highlight the model’s impressive accuracy, particularly in detecting confidently labeled variants, while also addressing the impact of its predictions on less certain (soft-labeled) data.