Heterogeneous Data Resources Research Articles

Background and objective: With the advent of bioinformatics, biological databases have been constructed to computerize data. Biological systems can be described as interactions and relationships between elements constituting the systems, and they are organized in various biomedical open databases. These open databases have been used in approaches to predict functional interactions such as protein-protein interactions (PPI), drug-drug interactions (DDI) and disease-disease relationships (DDR). However, just combining interaction data has limited effectiveness in predicting the complex relationships occurring in a whole context. Each contributing source contains information on each element in a specific field of knowledge but there is a lack of inter-disciplinary insight in combining them.Methods: In this study, we propose the RWD Integrated platform for Discovering Associations in Biomedical research (RIDAB) to predict interactions between biomedical entities. RIDAB is established as a graph network to construct a platform that predicts the interactions of target entities. Biomedical open database is combined with EMRs each representing a biomedical network and a real-world data. To integrate databases from different domains to build the platform, mapping of the vocabularies was required. In addition, the appropriate structure of the network and the graph embedding method to be used were needed to be selected to fit the tasks.Results: The feasibility of the platform was evaluated using node similarity and link prediction for drug repositioning task, a commonly used task for biomedical network. In addition, we compared the US Food and Drug Administration (FDA)-approved repositioned drugs with the predicted result. By integrating EMR database with biomedical networks, the platform showed increased f1 score in predicting repositioned drugs, from 45.62% to 57.26%, compared to platforms based on biomedical networks alone.Conclusions: This study demonstrates that the elements of biomedical research findings can be reflected by integrating EMR data with open-source biomedical networks. In addition, showed the feasibility of using the established platform to represent the integration of biomedical networks and reflected the relationship between real world networks.

Read full abstract

We sought to explore, via a systematic review of the literature, the state of the art of knowledge discovery in biomedical databases as it existed in 1992, and then now, 25 years later, mainly focused on supervised learning. We performed a rigorous systematic search of PubMed and latent Dirichlet allocation to identify themes in the literature and trends in the science of knowledge discovery in and between time periods and compare these trends. We restricted the result set using a bracket of five years previous, such that the 1992 result set was restricted to articles published between 1987 and 1992, and the 2015 set between 2011 and 2015. This was to reflect the current literature available at the time to researchers and others at the target dates of 1992 and 2015. The search term was framed as: Knowledge Discovery OR Data Mining OR Pattern Discovery OR Pattern Recognition, Automated. A total 538 and 18,172 documents were retrieved for 1992 and 2015, respectively. The number and type of data sources increased dramatically over the observation period, primarily due to the advent of electronic clinical systems. The period 1992- 2015 saw the emergence of new areas of research in knowledge discovery, and the refinement and application of machine learning approaches that were nascent or unknown in 1992. Over the 25 years of the observation period, we identified numerous developments that impacted the science of knowledge discovery, including the availability of new forms of data, new machine learning algorithms, and new application domains. Through a bibliometric analysis we examine the striking changes in the availability of highly heterogeneous data resources, the evolution of new algorithmic approaches to knowledge discovery, and we consider from legal, social, and political perspectives possible explanations of the growth of the field. Finally, we reflect on the achievements of the past 25 years to consider what the next 25 years will bring with regard to the availability of even more complex data and to the methods that could be, and are being now developed for the discovery of new knowledge in biomedical data.

Read full abstract

Heterogeneous Data Resources Research Articles

Related Topics

Articles published on Heterogeneous Data Resources

Predicting protein and pathway associations for understudied dark kinases using pattern-constrained knowledge graph embedding.

Construction and application of Chinese breast cancer knowledge graph based on multi-source heterogeneous data.

Design of an English Web-based Teaching Resource Sharing Platform based on Mobile Web Technology

RIDAB: Electronic medical record-integrated real world data platform for predicting and summarizing interactions in biomedical research from heterogeneous data resources

High-quality gene/disease embedding in a multi-relational heterogeneous graph after a joint matrix/tensor decomposition

Boosting Climate Analysis With Semantically Uplifted Knowledge Graphs

HKGB: An Inclusive, Extensible, Intelligent, Semi-auto-constructed Knowledge Graph Framework for Healthcare with Clinicians’ Expertise Incorporated

Research on Construction Technology of Multi Heterogeneous Data Resource Graph of Power Grid Corporation

Knowledge Service Model of Port Supply Chain Enterprise Based on Ontology

Multimodal Analytics to Understand Self-Regulation Process of Cognitive and Behavioral Strategies in Real-World Learning

A Novel HDF‐Based Data Compression and Integration Approach to Support BIM‐GIS Practical Applications

Research on Cluster Analysis Method of Heterogeneous Data Resources Based on FCM

Virtual Research Environment Integrating Heterogeneous Data Resources for Materials Science and Engineering

FACTORBASE: multi-relational structure learning with SQL all the way

Generative Adversarial Networks Based Heterogeneous Data Integration and Its Application for Intelligent Power Distribution and Utilization

A Spatio-Temporal Enhanced Metadata Model for Interdisciplinary Instant Point Observations in Smart Cities

Analysis of Heterogeneous Data Integration and Document Clustering method in Digital Library

Progress in Biomedical Knowledge Discovery: A 25-year Retrospective.

Process Materials Scientific Data for Intelligent Service Using a Dataspace Model

Deception Detection in Cyber Conflicts

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Heterogeneous Data Resources Research Articles

Related Topics

Articles published on Heterogeneous Data Resources

Predicting protein and pathway associations for understudied dark kinases using pattern-constrained knowledge graph embedding.

Construction and application of Chinese breast cancer knowledge graph based on multi-source heterogeneous data.

Design of an English Web-based Teaching Resource Sharing Platform based on Mobile Web Technology

RIDAB: Electronic medical record-integrated real world data platform for predicting and summarizing interactions in biomedical research from heterogeneous data resources

High-quality gene/disease embedding in a multi-relational heterogeneous graph after a joint matrix/tensor decomposition

Boosting Climate Analysis With Semantically Uplifted Knowledge Graphs

HKGB: An Inclusive, Extensible, Intelligent, Semi-auto-constructed Knowledge Graph Framework for Healthcare with Clinicians’ Expertise Incorporated

Research on Construction Technology of Multi Heterogeneous Data Resource Graph of Power Grid Corporation

Knowledge Service Model of Port Supply Chain Enterprise Based on Ontology

Multimodal Analytics to Understand Self-Regulation Process of Cognitive and Behavioral Strategies in Real-World Learning

A Novel HDF‐Based Data Compression and Integration Approach to Support BIM‐GIS Practical Applications

Research on Cluster Analysis Method of Heterogeneous Data Resources Based on FCM

Virtual Research Environment Integrating Heterogeneous Data Resources for Materials Science and Engineering

FACTORBASE: multi-relational structure learning with SQL all the way

Generative Adversarial Networks Based Heterogeneous Data Integration and Its Application for Intelligent Power Distribution and Utilization

A Spatio-Temporal Enhanced Metadata Model for Interdisciplinary Instant Point Observations in Smart Cities

Analysis of Heterogeneous Data Integration and Document Clustering method in Digital Library

Progress in Biomedical Knowledge Discovery: A 25-year Retrospective.

Process Materials Scientific Data for Intelligent Service Using a Dataspace Model

Deception Detection in Cyber Conflicts