Abstract

The representation of the gene content of an organism is impacted by several factors, ranging from sampling to sequencing and then the genome assembly task. The genome assembly process can generate errors that are related to insufficient coverage in the data set, an inadequate assembly methodology, and finally, errors related to the limitation of the assembly software used. Thus, some genes remain unidentified both in complete and draft genomes, this incomplete gene knowledge impacts on several organisms, mainly of medical and industrial interest, such as Bifidobacterium breve , a Gram-positive bacterium, found in the gastrointestinal microbiota of mammals, including humans, and has beneficial probiotic activities. Therefore, the objective of this work is to identify the new gene products not represented in the genome of Bifidobacterium breve DS15-17 using the raw reads of this organism. The reads were produced from the sequencing with the Illumina MiSeq platform. PAN2HGENE software was used to identify new gene products. After the analysis, 44 new gene products were identified, 26 with described function and 18 hypothetical proteins. The hypothetical proteins identified were analyzed in the ProtoNet and Superfamily databases.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.