Abstract
Small Open Reading Frames (smORFs) of less than 100 codons remain mostly uncharacterised. About a thousand smORFs per genome encode peptides and microproteins about 70–80 aa long, often containing recognisable protein structures and markers of translation, and these are referred to as short Coding Sequences (sCDSs). The characterisation of individual sCDSs has provided examples of smORFs’ function and conservation, but we cannot infer the functionality of all other metazoan smORFs from these. sCDS function has been characterised at a genome-wide scale in yeast and bacteria, showing that hundreds can produce a phenotype, but attempts in metazoans have been less successful. Either most sCDSs are not functional, or classic experimental techniques do not work with smORFs due to their shortness. Here, we combine extensive proteomics with bioinformatics and genetics in order to detect and corroborate sCDS function in Drosophila. Our studies nearly double the number of sCDSs with detected peptides and microproteins and an experimentally corroborated function. Finally, we observe a correlation between proven sCDS protein function and bioinformatic markers such as conservation and GC content. Our results support that sCDSs peptides and microproteins act as membrane-related regulators of canonical proteins, regulators whose functions are best understood at the cellular level, and whose mutants produce little, if any, overt morphological phenotypes.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.