Pairwise Relationships Research Articles

The datasets of large genotyping biobanks and direct-to-consumer genetic testing companies contain many related individuals. Until now, it has been widely accepted that the most distant relationships that can be detected are around fifteen degrees (approximately 8 th cousins) and that practical relationship estimates have a ceiling around ten degrees (approximately 5 th cousins). However, we show that these assumptions are incorrect and that they are due to a misapplication of relationship estimators. In particular, relationship estimators are applied almost exclusively to putative relatives who have been identified because they share detectable tracts of DNA identically by descent (IBD). However, no existing relationship estimator conditions on the event that two individuals share at least one detectable segment of IBD anywhere in the genome. As a result, the relationship estimates obtained using existing estimators are dramatically biased for distant relationships, inferring all sufficiently distant relationships to be around ten degrees regardless of the depth of the true relationship. Existing relationship estimators are derived under a model that assumes that each pair of related individuals shares a single common ancestor (or mating pair of ancestors). This model breaks down for relationships beyond 10 generations in the past because individuals share many thousands of cryptic common ancestors due to pedigree collapse. We first derive a corrected likelihood that conditions on the event that at least one segment is observed between a pair of putative relatives and we demonstrate that the corrected likelihood largely eliminates the bias in estimates of pairwise relationships and provides a more accurate characterization of the uncertainty in these estimates. We then reformulate the relationship inference problem to account for the fact that individuals share many common ancestors, not just one. We demonstrate that the most distant relationship that can be inferred using IBD may be 200 degrees or more, rather than ten, extending the time-to-common ancestor from approximately 300 years in the past to approximately 3,000 years in the past or more. This dramatic increase in the range of relationship estimators makes it possible to infer relationships whose common ancestors lived before historical events such as European settlement of the Americas, the Transatlantic Slave Trade, and the rise and fall of the Roman Empire.

Read full abstract

The automatic recognition of biomedical relationships is an important step in the semantic understanding of the information contained in the unstructured text of the published literature. The BioRED track at BioCreative VIII aimed to foster the development of such methods by providing the participants the BioRED-BC8 corpus, a collection of 1000 PubMed documents manually curated for diseases, gene/proteins, chemicals, cell lines, gene variants, and species, as well as pairwise relationships between them which are disease-gene, chemical-gene, disease-variant, gene-gene, chemical-disease, chemical-chemical, chemical-variant, and variant-variant. Furthermore, relationships are categorized into the following semantic categories: positive correlation, negative correlation, binding, conversion, drug interaction, comparison, cotreatment, and association. Unlike most of the previous publicly available corpora, all relationships are expressed at the document level as opposed to the sentence level, and as such, the entities are normalized to the corresponding concept identifiers of the standardized vocabularies, namely, diseases and chemicals are normalized to MeSH, genes (and proteins) to National Center for Biotechnology Information (NCBI) Gene, species to NCBI Taxonomy, cell lines to Cellosaurus, and gene/protein variants to Single Nucleotide Polymorphism Database. Finally, each annotated relationship is categorized as 'novel' depending on whether it is a novel finding or experimental verification in the publication it is expressed in. This distinction helps differentiate novel findings from other relationships in the same text that provides known facts and/or background knowledge. The BioRED-BC8 corpus uses the previous BioRED corpus of 600 PubMed articles as the training dataset and includes a set of newly published 400 articles to serve as the test data for the challenge. All test articles were manually annotated for the BioCreative VIII challenge by expert biocurators at the National Library of Medicine, using the original annotation guidelines, where each article is doubly annotated in a three-round annotation process until full agreement is reached between all curators. This manuscript details the characteristics of the BioRED-BC8 corpus as a critical resource for biomedical named entity recognition and relation extraction. Using this new resource, we have demonstrated advancements in biomedical text-mining algorithm development. Database URL: https://codalab.lisn.upsaclay.fr/competitions/16381.

Read full abstract

Pairwise Relationships Research Articles

Related Topics

Articles published on Pairwise Relationships

BMT independence

A Review of Hypergraph Neural Networks

HAGMN-UQ: Hyper association graph matching network with uncertainty quantification for coronary artery semantic labeling

Diffusion process with structural changes for subspace clustering

Learning accurate neighborhood- and self-information for higher-order relation prediction in Heterogeneous Information Networks

Semisupervised Progressive Representation Learning for Deep Multiview Clustering.

FamLink2 – A comprehensive tool for likelihood computations in pedigrees analyses involving linked DNA markers accounting for genotype uncertainties

Spontaneous Brain Activity Emerges from Pairwise Interactions in the Larval Zebrafish Brain

CORRECTING MODEL MISSPECIFICATION IN RELATIONSHIP ESTIMATES.

Generalized Einstein relations between absorption and emission spectra at thermodynamic equilibrium

Multispecies interactions and the community context of the evolution of virulence.

Improving unsupervised pedestrian re‐identification with enhanced feature representation and robust clustering

Context-embedded hypergraph attention network and self-attention for session recommendation

A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data

The biomedical relationship corpus of the BioRED track at the BioCreative VIII challenge and workshop.

White matter microstructure mediates the pairwise relationship between childhood maltreatment, microRNA-9, and the severity of major depressive disorder.

Constructing sympatry networks to assess potential introgression pathways within the major oak sections in the contiguous US states

Identifying the hierarchical emotional areas in the human brain through information fusion

Cross-Modal Federated Human Activity Recognition.

Efficient Local Coherent Structure Learning via Self-Evolution Bipartite Graph.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pairwise Relationships Research Articles

Related Topics

Articles published on Pairwise Relationships

BMT independence

A Review of Hypergraph Neural Networks

HAGMN-UQ: Hyper association graph matching network with uncertainty quantification for coronary artery semantic labeling

Diffusion process with structural changes for subspace clustering

Learning accurate neighborhood- and self-information for higher-order relation prediction in Heterogeneous Information Networks

Semisupervised Progressive Representation Learning for Deep Multiview Clustering.

FamLink2 – A comprehensive tool for likelihood computations in pedigrees analyses involving linked DNA markers accounting for genotype uncertainties

Spontaneous Brain Activity Emerges from Pairwise Interactions in the Larval Zebrafish Brain

CORRECTING MODEL MISSPECIFICATION IN RELATIONSHIP ESTIMATES.

Generalized Einstein relations between absorption and emission spectra at thermodynamic equilibrium

Multispecies interactions and the community context of the evolution of virulence.

Improving unsupervised pedestrian re‐identification with enhanced feature representation and robust clustering

Context-embedded hypergraph attention network and self-attention for session recommendation

A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data

The biomedical relationship corpus of the BioRED track at the BioCreative VIII challenge and workshop.

White matter microstructure mediates the pairwise relationship between childhood maltreatment, microRNA-9, and the severity of major depressive disorder.

Constructing sympatry networks to assess potential introgression pathways within the major oak sections in the contiguous US states

Identifying the hierarchical emotional areas in the human brain through information fusion

Cross-Modal Federated Human Activity Recognition.

Efficient Local Coherent Structure Learning via Self-Evolution Bipartite Graph.