Abstract

Bioactive peptides exhibit key roles in a wide variety of complex processes, such as regulation of body weight, learning, aging, and innate immune response. Next to the classical bioactive peptides, emerging from larger precursor proteins by specific proteolytic processing, a new class of peptides originating from small open reading frames (sORFs) have been recognized as important biological regulators. But their intrinsic properties, specific expression pattern and location on presumed non-coding regions have hindered the full characterization of the repertoire of bioactive peptides, despite their predominant role in various pathways. Although the development of peptidomics has offered the opportunity to study these peptides in vivo, it remains challenging to identify the full peptidome as the lack of cleavage enzyme specification and large search space complicates conventional database search approaches. In this study, we introduce a proteogenomics methodology using a new type of mass spectrometry instrument and the implementation of machine learning tools toward improved identification of potential bioactive peptides in the mouse brain. The application of trapped ion mobility spectrometry (tims) coupled to a time-of-flight mass analyzer (TOF) offers improved sensitivity, an enhanced peptide coverage, reduction in chemical noise and the reduced occurrence of chimeric spectra. Subsequent machine learning tools MS2PIP, predicting fragment ion intensities and DeepLC, predicting retention times, improve the database searching based on a large and comprehensive custom database containing both sORFs and alternative ORFs. Finally, the identification of peptides is further enhanced by applying the post-processing semi-supervised learning tool Percolator. Applying this workflow, the first peptidomics workflow combined with spectral intensity and retention time predictions, we identified a total of 167 predicted sORF-encoded peptides, of which 48 originating from presumed non-coding locations, next to 401 peptides from known neuropeptide precursors, linked to 66 annotated bioactive neuropeptides from within 22 different families. Additional PEAKS analysis expanded the pool of SEPs on presumed non-coding locations to 84, while an additional 204 peptides completed the list of peptides from neuropeptide precursors. Altogether, this study provides insights into a new robust pipeline that fuses technological advancements from different fields ensuring an improved coverage of the neuropeptidome in the mouse brain.

Highlights

  • The term “peptidomics” was first used two decades ago to describe a quantitative and qualitative analysis of the endogenous peptide pool in biological samples (Clynen et al, 2001; SchulzKnappe et al, 2001; Verhaert et al, 2001; Baggerman et al, 2002)

  • To the fragment ion peak intensities predicted by MS2PIP, retention times predicted by DeepLC were included, increasing the information for every Peptide to spectrum matches (PSMs) from 26 to 103 features

  • To the total of 607 peptides originating from known neuropeptide precursors, many of which performing fundamental biological roles (Le et al, 2013; Hayakawa et al, 2019), we were interested in identifying small open reading frames (sORF)-encoded peptides from so-called non-coding regions

Read more

Summary

Introduction

The term “peptidomics” was first used two decades ago to describe a quantitative and qualitative analysis of the endogenous peptide pool in biological samples (Clynen et al, 2001; SchulzKnappe et al, 2001; Verhaert et al, 2001; Baggerman et al, 2002). The long noncoding RNA (lncRNA) Aw112010 harbors a peptide vital to the mucosal immunity (Jackson et al, 2018) where the peptide produced from LINC00493 interacts with mitochondrial proteins (Wang et al, 2021) Another example is Nobody, a recently characterized human microprotein involved in the mRNA decapping machinery, translated from a transcript originally annotated as non-coding (D’Lima et al, 2017) and identified in mouse (Budamgunta et al, 2018)

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call