Abstract
We explore the structural similarities in three different languages, first in the protein language whose primary letters are the amino acids, second in the musical language whose primary letters are the notes, and third in the poetry language whose primary letters are the alphabet. For proteins, the non local (secondary) letters are the types of foldings in space (α-helices, β-sheets, etc.); for music, one is dealing with clear-cut repetition units called musical forms and for poems the structure consists of grammatical forms (names, verbs, etc.). We show in this paper that the mathematics of such secondary structures relies on finitely presented groups fp on r letters, where r counts the number of types of such secondary non local segments. The number of conjugacy classes of a given index (also the number of graph coverings over a base graph) of a group fp is found to be close to the number of conjugacy classes of the same index in the free group Fr−1 on r−1 generators. In a concrete way, we explore the group structure of a variant of the SARS-Cov-2 spike protein and the group structure of apolipoprotein-H, passing from the primary code with amino acids to the secondary structure organizing the foldings. Then, we look at the musical forms employed in the classical and contemporary periods. Finally, we investigate in much detail the group structure of a small poem in prose by Charles Baudelaire and that of the Bateau Ivre by Arthur Rimbaud.
Highlights
Published: 1 November 2021In this paper, we point out for the first time a remarkable analogy between the pattern structure of bonds between amino acids in a proteinand the non local structures observed in tonal music and in poems
After we received an invitation to contribute to the present special issue of Sci “Mathematics and poetry, with a view towards machine learning” we thought that our current group theoretical approach of protein language [3] could be converted into an understanding of the poetic language, as well as an understanding of some musical structures
There are many papers attempting to relate group theory to the genetic code, as reviewed in [2] but we found none of them featuring the secondary structure of proteins along the chain of amino acids, as we did in [3] and as we do below with the graph coverings
Summary
We point out for the first time a remarkable analogy between the pattern structure of bonds between amino acids in a protein (the protein secondary structure [1]). We explain the origin of these analogies with finitely generated groups and graph covering theory. A protein is a long polymeric linear chain encoded with 20 letters (the 20 amino acids). The surjective mapping of the 43 = 64 codons to the 20 amino acids is the DNA genetic code It can be given a mathematical theory with appropriate finite groups [2,3].
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.