Merged Data Files Research Articles

BackgroundRainbow trout is an economically important fish and a suitable experimental organism in many fields of biology including genome evolution, owing to the occurrence of a salmonid specific whole-genome duplication (4th WGD). Rainbow trout is among some of the most studied teleosts and has benefited from substantial efforts to develop genomic resources (e.g., linkage maps. Here, we first generated a synthetic map by merging segregation data files derived from three independent linkage maps. Then, we used it to evaluate genome conservation between rainbow trout and three teleost models, medaka, stickleback and zebrafish and to further investigate the extent of the 4th WGD in trout genome.ResultsThe INRA linkage map was updated by adding 211 new markers. After standardization of marker names, consistency of marker assignment to linkage groups and marker orders was checked across the three different data sets and only loci showing consistent location over all or almost all of the data sets were kept. This resulted in a synthetic map consisting of 2226 markers and 29 linkage groups spanning over 3600 cM. Blastn searches against medaka, stickleback, and zebrafish genomic databases resulted in 778, 824 and 730 significant hits respectively while blastx searches yielded 505, 513 and 510 significant hits. Homology search results revealed that, for most rainbow trout chromosomes, large syntenic regions encompassing nearly whole chromosome arms have been conserved between rainbow trout and its closest models, medaka and stickleback. Large conserved syntenies were also found between the genomes of rainbow trout and the reconstructed teleost ancestor. These syntenies consolidated the known homeologous affinities between rainbow trout chromosomes due to the 4th WGD and suggested new ones.ConclusionsThe synthetic map constructed herein further highlights the stability of the teleost genome over long evolutionary time scales. This map can be easily extended by incorporating new data sets and should help future rainbow trout whole genome sequence assembly. Finally, the persistence of large conserved syntenies across teleosts should facilitate the identification of candidate genes through comparative mapping, even if the occurrence of intra-chromosomal micro-rearrangement may hinder the accurate prediction their genomic location.

Read full abstract

Abstract. Water column data of carbon and carbon-relevant hydrographic and hydrochemical parameters from 188 previously non-publicly available cruise data sets in the Arctic Mediterranean Seas, Atlantic and Southern Ocean have been retrieved and merged into a new database: CARINA (CARbon dioxide IN the Atlantic Ocean). The data have gone through rigorous quality control procedures to assure the highest possible quality and consistency. The data for the pertinent parameters in the CARINA database were objectively examined in order to quantify systematic differences in the reported values, i.e. secondary quality control. Systematic biases found in the data have been corrected in the three data products: merged data files with measured, calculated and interpolated data for each of the three CARINA regions, i.e. the Arctic Mediterranean Seas, the Atlantic and the Southern Ocean. These products have been corrected to be internally consistent. Ninety-eight of the cruises in the CARINA database were conducted in the Atlantic Ocean, defined here as the region south of the Greenland-Iceland-Scotland Ridge and north of about 30° S. Here we present an overview of the Atlantic Ocean synthesis of the CARINA data and the adjustments that were applied to the data product. We also report the details of the secondary QC (Quality Control) for salinity for this data set. Procedures of quality control – including crossover analysis between stations and inversion analysis of all crossover data – are briefly described. Adjustments to salinity measurements were applied to the data from 10 cruises in the Atlantic Ocean region. Based on our analysis we estimate the internal consistency of the CARINA-ATL salinity data to be 4.1 ppm. With these adjustments the CARINA data products are consistent both internally as well as with GLODAP data, an oceanographic data set based on the World Hydrographic Program in the 1990s, and is now suitable for accurate assessments of, for example, oceanic carbon inventories and uptake rates and for model validation.

Read full abstract

Merged Data Files Research Articles

Related Topics

Articles published on Merged Data Files

Multifile Partitioning for Record Linkage and Duplicate Detection

GazeR: A Package for Processing Gaze Position and Pupil Size Data

Taxamat: Automated biodiversity data management tool - Implications for microbiome studies.

A3.24 Distinct profile of cytokine production by th17 cells in systemic sclerosis

Merging Large-Scale Assessment Data for Secondary Analysis: Experiences with EQAO’s Data

Navigating Complex Sample Analysis Using National Survey Data

A synthetic rainbow trout linkage map provides new insights into the salmonid whole genome duplication and the conservation of synteny among teleosts

CARINA TCO&lt;sub&gt;2&lt;/sub&gt; data in the Atlantic Ocean

CARINA data synthesis project: pH data scale unification and cruise adjustments

Quality control procedures and methods of the CARINA database

Atlantic Ocean CARINA data: overview and salinity adjustments

CARINA alkalinity data in the Atlantic Ocean

CARINA: nutrient data in the Atlantic Ocean

The polarisation method for merging data files and analysing loyalty to product attributes, prices and brands in revealed preference

Effects on accidents of periodic motor vehicle inspection in Norway

Rising trends in cesarean section rates in Egypt.

Residential vacancy chain models of an urban housing market. Exercises in impact and needs assessment

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Merged Data Files Research Articles

Related Topics

Articles published on Merged Data Files

Multifile Partitioning for Record Linkage and Duplicate Detection

GazeR: A Package for Processing Gaze Position and Pupil Size Data

Taxamat: Automated biodiversity data management tool - Implications for microbiome studies.

A3.24 Distinct profile of cytokine production by th17 cells in systemic sclerosis

Merging Large-Scale Assessment Data for Secondary Analysis: Experiences with EQAO’s Data

Navigating Complex Sample Analysis Using National Survey Data

A synthetic rainbow trout linkage map provides new insights into the salmonid whole genome duplication and the conservation of synteny among teleosts

CARINA TCO&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; data in the Atlantic Ocean

CARINA data synthesis project: pH data scale unification and cruise adjustments

Quality control procedures and methods of the CARINA database

Atlantic Ocean CARINA data: overview and salinity adjustments

CARINA alkalinity data in the Atlantic Ocean

CARINA: nutrient data in the Atlantic Ocean

The polarisation method for merging data files and analysing loyalty to product attributes, prices and brands in revealed preference

Effects on accidents of periodic motor vehicle inspection in Norway

Rising trends in cesarean section rates in Egypt.

Residential vacancy chain models of an urban housing market. Exercises in impact and needs assessment

CARINA TCO<sub>2</sub> data in the Atlantic Ocean