RNA Sequencing Reads Research Articles

RNA transcripts are potential therapeutic targets, yet bacterial transcripts have uncharacterized biodiversity. We developed an algorithm for transcript prediction called tp.py using it to predict transcripts (mRNA and other RNAs) in Escherichia coli K12 and E2348/69 strains (Bacteria:gamma-Proteobacteria), Listeria monocytogenes strains Scott A and RO15 (Bacteria:Firmicute), Pseudomonas aeruginosa strains SG17M and NN2 strains (Bacteria:gamma-Proteobacteria), and Haloferax volcanii (Archaea:Halobacteria). From >5 million E. coli K12 and >3 million E. coli E2348/69 newly generated Oxford Nanopore Technologies direct RNA sequencing reads, 2,487 K12 mRNAs and 1,844 E2348/69 mRNAs were predicted, with the K12 mRNAs containing more than half of the predicted E. coli K12 proteins. While the number of predicted transcripts varied by strain based on the amount of sequence data used, across all strains examined, the predicted average size of the mRNAs was 1.6-1.7 kbp, while the median size of the 5'- and 3'-untranslated regions (UTRs) were 30-90 bp. Given the lack of bacterial and archaeal transcript annotation, most predictions were of novel transcripts, but we also predicted many previously characterized mRNAs and ncRNAs, including post-transcriptionally generated transcripts and small RNAs associated with pathogenesis in the E. coli E2348/69 LEE pathogenicity islands. We predicted small transcripts in the 100-200 bp range as well as >10 kbp transcripts for all strains, with the longest transcript for two of the seven strains being the nuo operon transcript, and for another two strains it was a phage/prophage transcript. This quick, easy, and reproducible method will facilitate the presentation of transcripts, and UTR predictions alongside coding sequences and protein predictions in bacterial genome annotation as important resources for the research community.IMPORTANCEOur understanding of bacterial and archaeal genes and genomes is largely focused on proteins since there have only been limited efforts to describe bacterial/archaeal RNA diversity. This contrasts with studies on the human genome, where transcripts were sequenced prior to the release of the human genome over two decades ago. We developed software for the quick, easy, and reproducible prediction of bacterial and archaeal transcripts from Oxford Nanopore Technologies direct RNA sequencing data. These predictions are urgently needed for more accurate studies examining bacterial/archaeal gene regulation, including regulation of virulence factors, and for the development of novel RNA-based therapeutics and diagnostics to combat bacterial pathogens, like those with extreme antimicrobial resistance.

Read full abstract

In August and September 2022, two disease outbreaks were observed in Kuruma shrimp (Penaeus japonicus) farms in Okinawa, Japan. Diseased animals exhibited clinical signs of whitened musculature extending from the abdominal to the distal region. Histopathology revealed degeneration of fibers due to liquified muscle necrosis and haemocytic infiltration. Analysis of nonhost contigs from the assembled unmapped RNA and DNA shotgun sequence reads showed significant taxonomic alignment to bacterial species. This led to the investigation of a bacterium as a possible causative agent by characterizing its population in the diseased shrimp muscle. The majority of the 16S rRNA sequence recombinant clones had their highest homology with Photobacterium sp. (> 99%). The three bacterial isolates from the whitened muscle tissue were identified as Photobacterium damselae subsp. damselae (WMD-P1, WMD-P2 and WMD-P3) by biochemical and molecular assays and were further characterized. The Pdd genomes consisted of two circular chromosomes with varying numbers of plasmid. Its size ranges from 4.47 Mb to 4.60 Mb with an average GC content of 40.8%, with predicted number of coding sequences (CDs) ranging from 3816 to 4031. hlyA and pldA genes encoding for leukocidin pore-forming toxin and phospholipase were identified. Putative virulence genes are involved in adherence, antiphagocytosis, chemotaxis and motility, iron uptake, quorum sensing, secretion system, and immune evasion. The presence of prophages, genomic islands and antimicrobial resistant genes in the Pdd genomes suggests episodes of horizontal gene transfer. Average nucleotide identity (ANI) and pangenome analyses revealed a high genetic relationship of the Pdd strains (>98%) to isolates from other sources. PCR assays validated the presence of two bacterial virulence genes encoding the pore-forming toxin and phospholipase respectively in all isolates. Also, the strains had chitinase, phospholipase, and hemolytic activities. Intramuscular injection at 1 × 108 CFU/ml bacterial concentration produced pathological signs similar to those in naturally infected shrimp after 24 hpi. Lower concentration of 1 × 103 CFU/ml resulted in morbidity after 10 dpi. These results show that P. damselae subsp. damselae (Pdd) is associated with the occurrence of white muscle disease in Kuruma shrimp.

Read full abstract

RNA Sequencing Reads Research Articles

Related Topics

Articles published on RNA Sequencing Reads

Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.

Deciphering transcript architectural complexity in bacteria and archaea.

Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.

Transcriptome analysis reveals mechanisms of metabolic detoxification and immune responses following farnesyl acetate treatment in Metisa plana

Integrative analysis of nanopore direct RNA sequencing data reveals a role of PUS7-dependent pseudouridylation in regulation of m 6 A and m 5 C modifications.

Ornaments for efficient allele-specific expression estimation with bias correction

Applications of Nanopore sequencing in precision cancer medicine.

Whole-genome sequencing of 13 Arctic plants and draft genomes of Oxyria digyna and Cochlearia groenlandica

MiRglmm: a generalized linear mixed model of isomiR-level counts improves estimation of miRNA-level differential expression and uncovers variable differential expression between isomiRs.

The annotation of GBA1 has been concealed by its protein-coding pseudogene GBAP1.

First report of white muscle disease caused by Photobacterium damselae subsp. damselae in Kuruma shrimp (Penaeus japonicus)

RNA m6A detection using raw current signals and basecalling errors from Nanopore direct RNA sequencing reads.

Analyzing RNA-Seq Data from Chlamydia with Super Broad Transcriptomic Activation: Challenges, Solutions, and Implications for Other Systems.

Nm-Nano: a machine learning framework for transcriptome-wide single-molecule mapping of 2´-O-methylation (Nm) sites in nanopore direct RNA sequencing datasets

Comprehensive profiling of cancer neoantigens from aberrant RNA splicing

Disentangling genetic effects on transcriptional and post-transcriptional gene regulation through integrating exon and intron expression QTLs

Accelerating spliced alignment of long RNA sequencing reads using parallel maximal exact match retrieval

Deciphering Bacterial and Archaeal Transcriptional Dark Matter and Its Architectural Complexity.

Epigenetic Factor MicroRNAs Likely Mediate Vaccine Protection Efficacy against Lymphomas in Response to Tumor Virus Infection in Chickens through Target Gene Involved Signaling Pathways.

The impact of genetically controlled splicing on exon inclusion and protein structure.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

RNA Sequencing Reads Research Articles

Related Topics

Articles published on RNA Sequencing Reads

Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.

Deciphering transcript architectural complexity in bacteria and archaea.

Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.

Transcriptome analysis reveals mechanisms of metabolic detoxification and immune responses following farnesyl acetate treatment in Metisa plana

Integrative analysis of nanopore direct RNA sequencing data reveals a role of PUS7-dependent pseudouridylation in regulation of m 6 A and m 5 C modifications.

Ornaments for efficient allele-specific expression estimation with bias correction

Applications of Nanopore sequencing in precision cancer medicine.

Whole-genome sequencing of 13 Arctic plants and draft genomes of Oxyria digyna and Cochlearia groenlandica

MiRglmm: a generalized linear mixed model of isomiR-level counts improves estimation of miRNA-level differential expression and uncovers variable differential expression between isomiRs.

The annotation of GBA1 has been concealed by its protein-coding pseudogene GBAP1.

First report of white muscle disease caused by Photobacterium damselae subsp. damselae in Kuruma shrimp (Penaeus japonicus)

RNA m6A detection using raw current signals and basecalling errors from Nanopore direct RNA sequencing reads.

Analyzing RNA-Seq Data from Chlamydia with Super Broad Transcriptomic Activation: Challenges, Solutions, and Implications for Other Systems.

Nm-Nano: a machine learning framework for transcriptome-wide single-molecule mapping of 2´-O-methylation (Nm) sites in nanopore direct RNA sequencing datasets

Comprehensive profiling of cancer neoantigens from aberrant RNA splicing

Disentangling genetic effects on transcriptional and post-transcriptional gene regulation through integrating exon and intron expression QTLs

Accelerating spliced alignment of long RNA sequencing reads using parallel maximal exact match retrieval

Deciphering Bacterial and Archaeal Transcriptional Dark Matter and Its Architectural Complexity.

Epigenetic Factor MicroRNAs Likely Mediate Vaccine Protection Efficacy against Lymphomas in Response to Tumor Virus Infection in Chickens through Target Gene Involved Signaling Pathways.

The impact of genetically controlled splicing on exon inclusion and protein structure.