The Charge Clusters (CCs) are involved in key functions and are distributed according to the organism, the protein’s type, and the charge of amino acids. In the present study, we have explored the occurrence, position, and annotation as a first large-scale study of the CCs in land plants mitochondrial proteomes. A new python script was used for data curation. The Finding Clusters Charge in Protein Sequences Program was performed after adjusting the reading window size. A 44316 protein sequences belonging to 52 species of land plants were analysed. The occurrence of Negative Charge Clusters (NCCs) (1.2%) is two times more frequent than the Positive Charge Clusters (PCCs) (0.64%). Moreover, 39 and 30 NCCs were conserved in 88 and 41 proteins in intra and in inter proteomes respectively, while 14 and 21 PCCs were conserved in 53 and 85 protein sequences in intra and inter proteomes consecutively. Sequences carrying mixed CCs are rare (0.12%). Despite this low abundance, CCs play a crucial role in protein function. The CCs tend to be located mainly in the terminal regions of proteins which guarantees specific protein targeting and import into the mitochondria. In addition, the functional annotation of CCs according to Gene Ontology shows that CCs are involved in binding functions of either proteins or macromolecules which are deployed in different metabolic and cellular processes such as RNA editing and transcription. This study may provide valuable information while considering the CCs in understanding the environmental adaptation of plants. Communicated by Ramaswamy H. Sarma
Read full abstract