The zebrafish (Danio rerio) has emerged as a model organism for investigating lncRNAs-driven fundamental biological processes, such as circadian rhythms, physiology, metabolism, and various diseases. While state-of-the-art sequencing technologies have identified an increasing number of lncRNAs in zebrafish, their annotations are far from complete. In this study, we collect 28,925 lncRNAs from both the published studies and our own RNA-seq analyses and establish a novel webserver-based database called SUDAZFLNC (https://sudarna.website/). The database, containing 28,925 lncRNAs, 25,432 mRNAs, and 368 miRNAs, provides several crucial features and annotations for the zebrafish RNAs, such as sequence identifiers (IDs), sequence length, hexamer score, coding probabilities, GO and KEGG annotations, and micropeptides. SUDAZFLNC also includes time-course expression profiles of 3288 lncRNAs, 25,432 mRNAs, and 342 miRNAs generated from our RNA-seq experiments, and 149, 4407, and 43 rhythmically expressed lncRNAs, mRNAs, and miRNAs, respectively. Based on the peak expression patterns, we classified these RNAs into morning RNAs, evening RNAs, and night RNAs. Users of the database can access the RNA sequences and their expression profiles by searching the corresponding IDs from the Graphical User Interface (GUI) of the database. The database supports several features to investigate RNA sequences and expression profiles, including BLAST, search of sequence and data, ID conversion, and RNA-RNA interaction prediction. This is the largest curated database of zebrafish RNAs and their expression profiles to date.
Read full abstract