Year Year arrow
arrow-active-down-0
Publisher Publisher arrow
arrow-active-down-1
Journal
1
Journal arrow
arrow-active-down-2
Institution Institution arrow
arrow-active-down-3
Institution Country Institution Country arrow
arrow-active-down-4
Publication Type Publication Type arrow
arrow-active-down-5
Field Of Study Field Of Study arrow
arrow-active-down-6
Topics Topics arrow
arrow-active-down-7
Open Access Open Access arrow
arrow-active-down-8
Language Language arrow
arrow-active-down-9
Filter Icon Filter 1
Year Year arrow
arrow-active-down-0
Publisher Publisher arrow
arrow-active-down-1
Journal
1
Journal arrow
arrow-active-down-2
Institution Institution arrow
arrow-active-down-3
Institution Country Institution Country arrow
arrow-active-down-4
Publication Type Publication Type arrow
arrow-active-down-5
Field Of Study Field Of Study arrow
arrow-active-down-6
Topics Topics arrow
arrow-active-down-7
Open Access Open Access arrow
arrow-active-down-8
Language Language arrow
arrow-active-down-9
Filter Icon Filter 1
Export
Sort by: Relevance
  • Open Access Icon
  • PDF Download Icon
  • Addendum
  • 10.1186/s12918-019-0708-9
Correction to: A quantitative systems pharmacology (QSP) model for Pneumocystis treatment in mice
  • Aug 12, 2019
  • BMC Systems Biology
  • Guan-Sheng Liu + 6 more

It was highlighted that the original article [1] contained errors in the figures and their legends and by extension the in-text figure citations. This Corrections article shows the correct figures and correct figure legends.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 17
  • 10.1186/s12918-019-0691-1
Network-based characterization of drug-protein interaction signatures with a space-efficient approach
  • Apr 1, 2019
  • BMC Systems Biology
  • Yasuo Tabei + 3 more

BackgroundCharacterization of drug-protein interaction networks with biological features has recently become challenging in recent pharmaceutical science toward a better understanding of polypharmacology.ResultsWe present a novel method for systematic analyses of the underlying features characteristic of drug-protein interaction networks, which we call “drug-protein interaction signatures” from the integration of large-scale heterogeneous data of drugs and proteins. We develop a new efficient algorithm for extracting informative drug-protein interaction signatures from the integration of large-scale heterogeneous data of drugs and proteins, which is made possible by space-efficient representations for fingerprints of drug-protein pairs and sparsity-induced classifiers.ConclusionsOur method infers a set of drug-protein interaction signatures consisting of the associations between drug chemical substructures, adverse drug reactions, protein domains, biological pathways, and pathway modules. We argue the these signatures are biologically meaningful and useful for predicting unknown drug-protein interactions and are expected to contribute to rational drug design.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 19
  • 10.1186/s12918-019-0699-6
A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data
  • Apr 1, 2019
  • BMC Systems Biology
  • Shiquan Sun + 3 more

BackgroundSingle-cell RNA sequencing (scRNAseq) data always involves various unwanted variables, which would be able to mask the true signal to identify cell-types. More efficient way of dealing with this issue is to extract low dimension information from high dimensional gene expression data to represent cell-type structure. In the past two years, several powerful matrix factorization tools were developed for scRNAseq data, such as NMF, ZIFA, pCMF and ZINB-WaVE. But the existing approaches either are unable to directly model the raw count of scRNAseq data or are really time-consuming when handling a large number of cells (e.g. n>500).ResultsIn this paper, we developed a fast and efficient count-based matrix factorization method (single-cell negative binomial matrix factorization, scNBMF) based on the TensorFlow framework to infer the low dimensional structure of cell types. To make our method scalable, we conducted a series of experiments on three public scRNAseq data sets, brain, embryonic stem, and pancreatic islet. The experimental results show that scNBMF is more powerful to detect cell types and 10 - 100 folds faster than the scRNAseq bespoke tools.ConclusionsIn this paper, we proposed a fast and efficient count-based matrix factorization method, scNBMF, which is more powerful for detecting cell type purposes. A series of experiments were performed on three public scRNAseq data sets. The results show that scNBMF is a more powerful tool in large-scale scRNAseq data analysis. scNBMF was implemented in R and Python, and the source code are freely available at https://github.com/sqsun.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 13
  • 10.1186/s12918-019-0695-x
Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
  • Apr 1, 2019
  • BMC systems biology
  • Wenting Liu + 1 more

BackgroundSystematic fusion of multiple data sources for Gene Regulatory Networks (GRN) inference remains a key challenge in systems biology. We incorporate information from protein-protein interaction networks (PPIN) into the process of GRN inference from gene expression (GE) data. However, existing PPIN remain sparse and transitive protein interactions can help predict missing protein interactions. We therefore propose a systematic probabilistic framework on fusing GE data and transitive protein interaction data to coherently build GRN.ResultsWe use a Gaussian Mixture Model (GMM) to soft-cluster GE data, allowing overlapping cluster memberships. Next, a heuristic method is proposed to extend sparse PPIN by incorporating transitive linkages. We then propose a novel way to score extended protein interactions by combining topological properties of PPIN and correlations of GE. Following this, GE data and extended PPIN are fused using a Gaussian Hidden Markov Model (GHMM) in order to identify gene regulatory pathways and refine interaction scores that are then used to constrain the GRN structure. We employ a Bayesian Gaussian Mixture (BGM) model to refine the GRN derived from GE data by using the structural priors derived from GHMM. Experiments on real yeast regulatory networks demonstrate both the feasibility of the extended PPIN in predicting transitive protein interactions and its effectiveness on improving the coverage and accuracy the proposed method of fusing PPIN and GE to build GRN.ConclusionThe GE and PPIN fusion model outperforms both the state-of-the-art single data source models (CLR, GENIE3, TIGRESS) as well as existing fusion models under various constraints.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 20
  • 10.1186/s12918-019-0698-7
Anti-TNF- \u03b1treatment-related pathways and biomarkers revealed by transcriptome analysis in Chinese psoriasis patients
  • Apr 1, 2019
  • BMC Systems Biology
  • Lunfei Liu + 9 more

BackgroundAnti-tumor necrosis factor alpha (TNF- α) therapy has made a significant impact on treating psoriasis. Despite these agents being designed to block TNF- α activity, their mechanism of action in the remission of psoriasis is still not fully understood at the molecular level.ResultsTo better understand the molecular mechanisms of Anti-TNF- α therapy, we analysed the global gene expression profile (using mRNA microarray) in peripheral blood mononuclear cells (PBMCs) that were collected from 6 psoriasis patients before and 12 weeks after the treatment of etanercept. First, we identified 176 differentially expressed genes (DEGs) before and after treatment by using paired t-test. Then, we constructed the gene co-expression modules by weighted correlation network analysis (WGCNA), and 22 co-expression modules were found to be significantly correlated with treatment response. Of these 176 DEGs, 79 DEGs (M_DEGs) were the members of these 22 co-expression modules. Of the 287 GO functional processes and pathways that were enriched for these 79 M_DEGs, we identified 30 pathways whose overall gene expression activities were significantly correlated with treatment response. Of the original 176 DEGs, 19 (GO_DEGs) were found to be the members of these 30 pathways, whose expression profiles showed clear discrimination before and after treatment. As expected, of the biological processes and functionalities implicated by these 30 treatment response-related pathways, the inflammation and immune response was the top pathway in response to etanercept treatment, and some known TNF- α related pathways, such as molting cycle process, hair cycle process, skin epidermis development, regulation of hair follicle development, were implicated. Furthermore, additional novel pathways were also suggested, such as heparan sulfate proteoglycan metabolic process, vascular endothelial growth factor production, whose transcriptional regulation may mediate the response to etanercept treatment.ConclusionThrough global gene expression analysis in PBMC of psoriasis patient and subsequent co-expression module based pathway analyses, we have identified a group of functionally coherent and differentially expressed genes (DEGs) and related pathways, which has not only provided new biological insight about the molecular mechanism of anti-TNF- α treatment, but also identified several genes whose expression profiles can be used as potential biomarkers for anti-TNF- α treatment response in psoriasis.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 91
  • 10.1186/s12918-019-0694-y
GNE: a deep learning framework for gene network inference by aggregating biological information
  • Apr 1, 2019
  • BMC Systems Biology
  • Kishan Kc + 4 more

BackgroundThe topological landscape of gene interaction networks provides a rich source of information for inferring functional patterns of genes or proteins. However, it is still a challenging task to aggregate heterogeneous biological information such as gene expression and gene interactions to achieve more accurate inference for prediction and discovery of new gene interactions. In particular, how to generate a unified vector representation to integrate diverse input data is a key challenge addressed here.ResultsWe propose a scalable and robust deep learning framework to learn embedded representations to unify known gene interactions and gene expression for gene interaction predictions. These low- dimensional embeddings derive deeper insights into the structure of rapidly accumulating and diverse gene interaction networks and greatly simplify downstream modeling. We compare the predictive power of our deep embeddings to the strong baselines. The results suggest that our deep embeddings achieve significantly more accurate predictions. Moreover, a set of novel gene interaction predictions are validated by up-to-date literature-based database entries.ConclusionThe proposed model demonstrates the importance of integrating heterogeneous information about genes for gene network inference. GNE is freely available under the GNU General Public License and can be downloaded from GitHub (https://github.com/kckishan/GNE).

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 39
  • 10.1186/s12918-019-0690-2
Ultrafast clustering of single-cell flow cytometry data using FlowGrid
  • Apr 1, 2019
  • BMC Systems Biology
  • Xiaoxin Ye + 1 more

BackgroundFlow cytometry is a popular technology for quantitative single-cell profiling of cell surface markers. It enables expression measurement of tens of cell surface protein markers in millions of single cells. It is a powerful tool for discovering cell sub-populations and quantifying cell population heterogeneity. Traditionally, scientists use manual gating to identify cell types, but the process is subjective and is not effective for large multidimensional data. Many clustering algorithms have been developed to analyse these data but most of them are not scalable to very large data sets with more than ten million cells.ResultsHere, we present a new clustering algorithm that combines the advantages of density-based clustering algorithm DBSCAN with the scalability of grid-based clustering. This new clustering algorithm is implemented in python as an open source package, FlowGrid. FlowGrid is memory efficient and scales linearly with respect to the number of cells. We have evaluated the performance of FlowGrid against other state-of-the-art clustering programs and found that FlowGrid produces similar clustering results but with substantially less time. For example, FlowGrid is able to complete a clustering task on a data set of 23.6 million cells in less than 12 seconds, while other algorithms take more than 500 seconds or get into error.ConclusionsFlowGrid is an ultrafast clustering algorithm for large single-cell flow cytometry data. The source code is available at https://github.com/VCCRI/FlowGrid.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 18
  • 10.1186/s12918-019-0692-0
Boolean network modeling of \u03b2-cell apoptosis and insulin resistance in type 2 diabetes mellitus
  • Apr 1, 2019
  • BMC Systems Biology
  • Pritha Dutta + 4 more

BackgroundMajor alteration in lifestyle of human population has promoted Type 2 diabetes mellitus (T2DM) to the level of an epidemic. This metabolic disorder is characterized by insulin resistance and pancreatic β-cell dysfunction and apoptosis, triggered by endoplasmic reticulum (ER) stress, oxidative stress and cytokines. Computational modeling is necessary to consolidate information from various sources in order to obtain a comprehensive understanding of the pathogenesis of T2DM and to investigate possible interventions by performing in silico simulations.ResultsIn this paper, we propose a Boolean network model integrating the insulin resistance pathway with pancreatic β-cell apoptosis pathway which are responsible for T2DM. The model has five input signals, i.e. ER stress, oxidative stress, tumor necrosis factor α (TNF α), Fas ligand (FasL), and interleukin-6 (IL-6). We performed dynamical simulations using random order asynchronous update and with different combinations of the input signals. From the results, we observed that the proposed model made predictions that closely resemble the expression levels of genes in T2DM as reported in the literature.ConclusionThe proposed model can make predictions about expression levels of genes in T2DM that are in concordance with literature. Although experimental validation of the model is beyond the scope of this study, the model can be useful for understanding the aetiology of T2DM and discovery of therapeutic intervention for this prevalent complex disease. The files of our model and results are available at https://github.com/JieZheng-ShanghaiTech/boolean-t2dm.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 22
  • 10.1186/s12918-019-0697-8
Predicting disease-related phenotypes using an integrated phenotype similarity measurement based on HPO
  • Apr 1, 2019
  • BMC Systems Biology
  • Hansheng Xue + 2 more

BackgroundImproving efficiency of disease diagnosis based on phenotype ontology is a critical yet challenging research area. Recently, Human Phenotype Ontology (HPO)-based semantic similarity has been affectively and widely used to identify causative genes and diseases. However, current phenotype similarity measurements just consider the annotations and hierarchy structure of HPO, neglecting the definition description of phenotype terms.ResultsIn this paper, we propose a novel phenotype similarity measurement, termed as DisPheno, which adequately incorporates the definition of phenotype terms in addition to HPO structure and annotations to measure the similarity between phenotype terms. DisPheno also integrates phenotype term associations into phenotype-set similarity measurement using gene and disease annotations of phenotype terms.ConclusionsCompared with five existing state-of-the-art methods, DisPheno shows great performance in HPO-based phenotype semantic similarity measurement and improves the efficiency of disease identification, especially on noisy patients dataset.

  • Open Access Icon
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 23
  • 10.1186/s12918-019-0696-9
FCMDAP: using miRNA family and cluster information to improve the prediction accuracy of disease related miRNAs
  • Apr 1, 2019
  • BMC Systems Biology
  • Xiaoying Li + 3 more

BackgroundBiological experiments have confirmed the association between miRNAs and various diseases. However, such experiments are costly and time consuming. Computational methods help select potential disease-related miRNAs to improve the efficiency of biological experiments.MethodsIn this work, we develop a novel method using multiple types of data to calculate miRNA and disease similarity based on mutual information, and add miRNA family and cluster information to predict human disease-related miRNAs (FCMDAP). This method not only depends on known miRNA-diseases associations but also accurately measures miRNA and disease similarity and resolves the problem of overestimation. FCMDAP uses the k most similar neighbor recommendation algorithm to predict the association score between miRNA and disease. Information about miRNA cluster is also used to improve prediction accuracy.ResultFCMDAP achieves an average AUC of 0.9165 based on leave-one-out cross validation. Results confirm the 100, 98 and 96% of the top 50 predicted miRNAs reported in case studies on colorectal, lung, and pancreatic neoplasms. FCMDAP also exhibits satisfactory performance in predicting diseases without any related miRNAs and miRNAs without any related diseases.ConclusionsIn this study, we present a computational method FCMDAP to improve the prediction accuracy of disease related miRNAs. FCMDAP could be an effective tool for further biological experiments.