The coverage of the protein database directly determines the results of shotgun proteomics. In this study, PacBio single-molecule real-time sequencing technology was performed on postmortem silver carp muscle transcripts. A total of 42.43 Gb clean data, 35,834 nonredundant transcripts, and 15,413 unigenes were obtained. In total, 99.32% of the unigenes were successfully annotated and assigned specific functions. PacBio long-read isoform sequencing (Iso-Seq) analysis can provide more accurate protein information with a higher proportion of complete coding sequences and longer lengths. Subsequently, 2671 proteins were identified in deep 4D proteomics informed by a full-length transcriptomics technique, which has been shown to improve the identification of low-abundance muscle proteins and potential protein isoforms. The feature of the sarcomeric protein profile and information on more than 30 major proteins in the white dorsal muscle of silver carp were reported here for the first time. Overall, this study provides valuable transcriptome data resources and the comprehensive muscle protein information detected to date for further study into the processing characteristic of early postmortem fish muscle, as well as a spectral library for data-independent acquisition and data processing. This batch of muscle-specific dependent acquisition data is available via PRIDE with identifier PXD043702.
Read full abstract