IGD: high-performance search for large-scale genomic interval datasets.

Jianglin Feng,Nathan C Sheffield

doi:10.1093/bioinformatics/btaa1062

IGD: high-performance search for large-scale genomic interval datasets.

Jianglin Feng, Nathan C Sheffield

Open Access

https://doi.org/10.1093/bioinformatics/btaa1062

Copy DOI

Journal: Bioinformatics (Oxford, England)	Publication Date: Dec 26, 2020
Citations: 4

Affiliation: University of Virginia

#Integrated Genome Database #Large-scale Genome + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Databases of large-scale genome projects now contain thousands of genomic interval datasets. These data are a critical resource for understanding the function of DNA. However, our ability to examine and integrate interval data of this scale is limited. Here, we introduce the integrated genome database (IGD), a method and tool for searching genome interval datasets more than three orders of magnitude faster than existing approaches, while using only one hundredth of the memory. IGD uses a novel linear binning method that allows us to scale analysis to billions of genomic regions. https://github.com/databio/IGD. Supplementary data are available at Bioinformatics online.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Bioinformatics (Oxford, England)

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.