Male gibbon loud morning calls conform to Zipf's law of brevity and Menzerath's law: insights into the origin of human language
Male gibbon loud morning calls conform to Zipf's law of brevity and Menzerath's law: insights into the origin of human language
- Research Article
27
- 10.1007/s10071-021-01495-3
- Mar 8, 2021
- Animal Cognition
Vocal and gestural sequences of several primates have been found to conform to two general principles of information compression: the compensation between the duration of a construct and that of its components (Menzerath-Altmann law) and an inverse relationship between signal duration and its occurrence (Zipf's law of abbreviation). Even though Zipf's law of brevity has been proposed as a universal in animal communication, evidence on non-human primate vocal behavior conformity to linguistic laws is still debated, and information on strepsirrhine primates is lacking. We analyzed the vocal behavior of the unique singing lemur species (Indri indri) to assess whether the song of the species shows evidence for compression. As roars have a chaotic structure that impedes the recognition of each individual utterance, and long notes are usually given by males, we focused on the core part of the song (i.e., the descending phrases, composed of two-six units). Our results indicate that indris' songs conform to Zipf's and Menzerath-Altmann linguistic laws. Indeed, shorter phrases are more likely to be included in the song, and units' duration decrease at the increase of the size of the phrases. We also found that, despite a sexual dimorphism in the duration of both units and phrases, these laws characterize sequences of both males and females. Overall, we provide the first evidence for a trade-off between signal duration and occurrence in the vocal behavior of a strepsirrhine species, suggesting that selective pressures for vocal compression are more ancestral than previously assumed within primates.
- Research Article
29
- 10.1098/rsos.200151
- Apr 1, 2020
- Royal Society Open Science
Evidence for compression, or minimization of code length, has been found across biological systems from genomes to human language and music. Two linguistic laws—Menzerath's Law (which states that longer sequences consist of shorter constituents) and Zipf's Law of abbreviation (a negative relationship between signal length and frequency of use)—are predictions of compression. It has been proposed that compression is a universal in animal communication, but there have been mixed results, particularly in reference to Zipf's Law of abbreviation. Like songbirds, male gibbons (Hylobates muelleri) engage in long solo bouts with unique combinations of notes which combine into phrases. We found strong support for Menzerath's Law as the longer a phrase, the shorter the notes. To identify phrase types, we used state-of-the-art affinity propagation clustering, and were able to predict phrase types using support vector machines with a mean accuracy of 74%. Based on unsupervised phrase type classification, we did not find support for Zipf's Law of abbreviation. Our results indicate that adherence to linguistic laws in male gibbon solos depends on the unit of analysis. We conclude that principles of compression are applicable outside of human language, but may act differently across levels of organization in biological systems.
- Research Article
27
- 10.1098/rsos.220849
- Nov 1, 2022
- Royal Society Open Science
Two language laws have been identified as consistent patterns shaping animal behaviour, both acting on the organizational level of communicative systems. Zipf's law of brevity describes a negative relationship between behavioural length and frequency. Menzerath's law defines a negative correlation between the number of behaviours in a sequence and average length of the behaviour composing it. Both laws have been linked with the information-theoretic principle of compression, which tends to minimize code length. We investigated their presence in a case study of male chimpanzee sexual solicitation gesture. We failed to find evidence supporting Zipf's law of brevity, but solicitation gestures followed Menzerath's law: longer sequences had shorter average gesture duration. Our results extend previous findings suggesting gesturing may be limited by individual energetic constraints. However, such patterns may only emerge in sufficiently large datasets. Chimpanzee gestural repertoires do not appear to manifest a consistent principle of compression previously described in many other close-range systems of communication. Importantly, the same signallers and signals were previously shown to adhere to these laws in subsets of the repertoire when used in play; highlighting that, in addition to selection on the signal repertoire, ape gestural expression appears shaped by factors in the immediate socio-ecological context.
- Research Article
6
- 10.1098/rsbl.2022.0162
- Jul 1, 2022
- Biology Letters
Compression has been presented as a general principle of animal communication. Zipf's Law of brevity is a manifestation of this postulate and can be generalized as the tendency of more frequent communicative elements to be shorter. Previous works supported this claim, showing evidence of Zipf's Law of brevity in animal acoustical communication and human language. However, a significant part of the communicative effort in biological systems is carried out in other transmission channels, such as those based on infochemicals. To fill this gap, we seek, for the first time, evidence of this principle in infochemical communication by analysing the statistical tendency of more frequent infochemicals to be chemically shorter and lighter. We analyse data from the largest and most comprehensive open-access infochemical database known as Pherobase, recovering Zipf's Law of brevity in interspecific communication (allelochemicals) but not in intraspecific communication (pheromones). Moreover, these results are robust even when addressing different magnitudes of study or mathematical approaches. Therefore, different dynamics from the compression principle would dominate intraspecific chemical communication, defying the universality of Zipf's Law of brevity. To conclude, we discuss the exception found for pheromones in the light of other potential communicative paradigms such as pressures on successful communication or the Handicap principle.
- Research Article
52
- 10.1098/rsbl.2019.0589
- Feb 1, 2020
- Biology Letters
Information compression is a general principle of human language: the most frequent words are shorter in length (Zipf's Law of Brevity) and the duration of constituents decreases as the size of the linguistic construct increases (Menzerath-Altmann Law). Vocal sequences of non-human primates have been shown to conform to both these laws, suggesting information compression might be a more general principle. Here, we investigated whether display songs of the African penguin, which mediate recognition, intersexual mate choice and territorial defence, conform with these laws. Display songs are long, loud sequences combining three types of syllables. We found that the shortest type of syllable was the most frequent (with the shortest syllable being repeated stereotypically, potentially favouring signal redundancy in crowded environments). We also found that the average duration of the song's constituents was negatively correlated with the size of the song (a consequence of increasing the relative number of the shortest syllable type, rather than reducing the duration across all syllable types, thus preserving the communication of size-related information in the duration of the longest syllable type). Our results provide the first evidence for conformity to Zipf's and Menzerath-Altmann Laws in the vocal sequences of a non-primate species, indicating that these laws can coexist with selection pressures specific to the species' ecology.
- Research Article
87
- 10.1098/rspb.2018.2900
- Feb 13, 2019
- Proceedings of the Royal Society B: Biological Sciences
Studies testing linguistic laws outside language have provided important insights into the organization of biological systems. For example, patterns consistent with Zipf's law of abbreviation (which predicts a negative relationship between word length and frequency of use) have been found in the vocal and non-vocal behaviour of a range of animals, and patterns consistent with Menzerath's law (according to which longer sequences are made up of shorter constituents) have been found in primate vocal sequences, and in genes, proteins and genomes. Both laws have been linked to compression-the information theoretic principle of minimizing code length. Here, we present the first test of these laws in animal gestural communication. We initially did not find the negative relationship between gesture duration and frequency of use predicted by Zipf's law of abbreviation, but this relationship was seen in specific subsets of the repertoire. Furthermore, a pattern opposite to that predicted was seen in one subset of gestures-whole body signals. We found a negative correlation between number and mean duration of gestures in sequences, in line with Menzerath's law. These results provide the first evidence that compression underpins animal gestural communication, and highlight an important commonality between primate gesturing and language.
- Research Article
17
- 10.1126/sciadv.ads6014
- Feb 7, 2025
- Science advances
Vocal communication systems in humans and other animals experience selection for efficiency-optimizing the benefits they convey relative to the costs of producing them. Two hallmarks of efficiency, Menzerath's law and Zipf's law of abbreviation, predict that longer sequences will consist of shorter elements and more frequent elements will be shorter, respectively. Here, we assessed the evidence for both laws in cetaceans by analyzing vocal sequences from 16 baleen and toothed whale species and comparing them to 51 human languages. Eleven whale species exhibit Menzerath's law, sometimes with greater effect sizes than human speech. Two of the five whale species with categorized element types exhibit Zipf's law of abbreviation. On average, whales also tend to shorten elements and intervals toward the end of sequences, although this varies by species. Overall, the results of this study suggest that the vocalizations of many cetacean species have undergone compression for increased efficiency in time.
- Research Article
4
- 10.1098/rspb.2023.1503
- Sep 27, 2023
- Proceedings. Biological sciences
Human language follows statistical regularities or linguistic laws. For instance, Zipf's law of brevity states that the more frequently a word is used, the shorter it tends to be. All human languages adhere to this word structure. However, it is unclear whether Zipf's law emerged de novo in humans or whether it also exists in the non-linguistic vocal systems of our primate ancestors. Using a vocal conditioning paradigm, we examined the capacity of marmoset monkeys to efficiently encode vocalizations. We observed that marmosets adopted vocal compression strategies at three levels: (i) increasing call rate, (ii) decreasing call duration and (iii) increasing the proportion of short calls. Our results demonstrate that marmosets, when able to freely choose what to vocalize, exhibit vocal statistical regularities consistent with Zipf's law of brevity that go beyond their context-specific natural vocal behaviour. This suggests that linguistic laws emerged in non-linguistic vocal systems in the primate lineage.
- Research Article
18
- 10.1098/rsos.201557
- Nov 1, 2020
- Royal Society Open Science
Across diverse systems including language, music and genomes, there is a tendency for longer sequences to contain shorter constituents; this phenomenon is known as Menzerath's Law. Whether Menzerath's Law is a universal in biological systems, is the result of compression (wherein shortest possible strings represent the maximum amount of information) or emerges from an inevitable relationship between sequence and constituent length remains a topic of debate. In non-human primates, the vocalizations of geladas, male gibbons and chimpanzees exhibit patterns consistent with Menzerath's Law. Here, we use existing datasets of three duetting primate species (tarsiers, titi monkeys and gibbons) to examine the wide-scale applicability of Menzerath's Law. Primate duets provide a useful comparative model to test for the broad-scale applicability of Menzerath's Law, as they evolved independently under presumably similar selection pressures and are emitted under the same context(s) across taxa. Only four out of the eight call types we examined were consistent with Menzerath's Law. Two of these call types exhibited a negative relationship between the position of the note in the call and note duration, indicating that adherence to Menzerath's Law in these call types may be related to breathing constraints. Exceptions to Menzerath's Law occur when notes are relatively homogeneous, or when species-specific call structure leads to a deterministic decrease in note duration. We show that adherence to Menzerath's Law is the exception rather than the rule in duetting primates. It is possible that selection pressures for long-range signals that can travel effectively over large distances was stronger than that of compression in primate duets. Future studies investigating adherence to Menzerath's Law across the vocal repertoires of these species will help us better elucidate the pressures that shape both short- and long-distance acoustic signals.
- Research Article
87
- 10.1098/rsbl.2010.0455
- Jun 23, 2010
- Biology Letters
Semple et al. (Semple et al. in press, Biol. Lett. (doi:10.1098/rsbl.2009.1062)) argued that the 'law of brevity' (an inverse relationship between word length and frequency of use) applies not only to human language but also to vocal signalling in non-human primates, because coding efficiency is paramount in both situations. We analysed the frequency of use of signals of different duration in the vocal repertoires of two Neotropical primate species studied in the wild-the common marmoset (Callithrix jacchus) and the golden-backed uakari (Cacajao melanocephalus). The key prediction of the law of brevity was not supported in either species: although the most frequently emitted calls were relatively brief, they were not the shortest signals in the repertoire. The costs and benefits associated with signals of different duration must be appreciated to understand properly their frequency of use. Although relatively brief vocal signals may be favoured by natural selection in order to minimize energetic costs, the very briefest signals may be ambiguous, contain reduced information or be difficult to detect or locate, and may therefore be selected against. Analogies between human language and vocal communication in animals can be misleading as a basis for understanding frequency of use, because coding efficiency is not the only factor of importance in animal communication, and the costs and benefits associated with different signal durations will vary in a species-specific manner.
- Research Article
105
- 10.1073/pnas.1522072113
- Apr 18, 2016
- Proceedings of the National Academy of Sciences
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath's law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath's law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath's law reflects compression-the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language.
- Research Article
1
- 10.1371/journal.pone.0324713
- May 20, 2025
- PloS one
The examination and application of Zipf's law is a significant topic in quantitative linguistics. This study presents an in-depth empirical investigation of this law in 651 Chinese provincial government work reports (2003-2023). Employing natural language processing techniques (including Jieba word segmentation with a custom dictionary) and a double-logarithmic regression model, we analyzed word frequency distributions. Our findings indicate that the Zipf coefficient in these reports is close to 1, confirming general adherence to Zipf's law. Over the 21-year period, the Zipf coefficient exhibits fluctuations, with a notable inflection point in 2011, after which it follows a consistent upward trend. This shift is likely influenced by the 18th National Congress of the Communist Party of China, which marked a transition toward more standardized and centralized policy communication. While regional differences among eastern, central, western, and northeastern provinces are minimal, centrally governed municipalities exhibit higher Zipf coefficients than other provincial-level regions. Although our findings largely confirm the applicability of Zipf's Law to this specific corpus, this study is limited by the exclusion of prefecture- and county-level reports. Future research can address this limitation by incorporating a broader range of administrative levels and by conducting cross-country and cultural comparisons of political documents. Further investigation of alternate quantitative linguistic laws (e.g., Heaps' Law, Menzerath's Law) within this corpus is also warranted.
- Research Article
37
- 10.1002/cplx.21429
- Nov 23, 2012
- Complexity
The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in language, music and genomes as a tendency of the mean size of the parts to decrease as the number of parts increases in many situations. This statistical regularity emerges also in the context of genomes, for instance, as a tendency of species with more chromosomes to have a smaller mean chromosome size. It has been argued that the instantiation of this law in genomes is not indicative of any parallel between language and genomes because (a) the law is inevitable and (b) noncoding DNA dominates genomes. Here mathematical, statistical, and conceptual challenges of these criticisms are discussed. Two major conclusions are drawn: the law is not inevitable and languages also have a correlate of noncoding DNA. However, the wide range of manifestations of the law in and outside genomes suggests that the striking similarities between noncoding DNA and certain linguistics units could be anecdotal for understanding the recurrence of that statistical law. © 2012 Wiley Periodicals, Inc. Complexity, 2012
- Book Chapter
1
- 10.1002/9781118900772.etrds0380
- May 15, 2015
- Emerging Trends in the Social and Behavioral Sciences
Vocal communication is common in the animal kingdom. Researchers often examine vocal communication in nonhuman primates (primates) with the aim of identifying similarities and differences with human language and speech, in order to trace the evolutionary origins of our complex communication system. Primates can produce distinct calls in response to specific events, such as the discovery of a certain predator, and listeners seem to understand what these calls refer to. Although on the surface there are similarities between this type of communication and human referential words, the mental processes that underlie them may be very different. While in general the flexibility shown by primate receivers may demonstrate some commonalities with humans, there is much more controversy over whether there are similarities between the production of primate vocalizations and language. It is widely accepted that primates, unlike humans, lack the ability to generate new vocalizations. Although this means primates have a closed repertoire of calls that cannot be expanded, primates are capable of combining their existing calls to generate new messages. The degree of voluntary control and intentionality involved in the use of calls is also a matter of debate, with recent evidence on both a neural and behavioral level challenging traditional assumptions that primate vocalizations are used in an automatic, reflexive manner. More research is needed to examine the mental processes underlying communicative behavior in both the producer and the receiver. In the future adopting a more holistic, multimodal approach to studying primate communication is likely to challenge and ultimately improve our understanding of primate communication and the evolution of human language.
- Research Article
144
- 10.1111/cogs.12061
- Jul 29, 2013
- Cognitive Science
A key aim in biology and psychology is to identify fundamental principles underpinning the behavior of animals, including humans. Analyses of human language and the behavior of a range of non-human animal species have provided evidence for a common pattern underlying diverse behavioral phenomena: Words follow Zipf's law of brevity (the tendency of more frequently used words to be shorter), and conformity to this general pattern has been seen in the behavior of a number of other animals. It has been argued that the presence of this law is a sign of efficient coding in the information theoretic sense. However, no strong direct connection has been demonstrated between the law and compression, the information theoretic principle of minimizing the expected length of a code. Here, we show that minimizing the expected code length implies that the length of a word cannot increase as its frequency increases. Furthermore, we show that the mean code length or duration is significantly small in human language, and also in the behavior of other species in all cases where agreement with the law of brevity has been found. We argue that compression is a general principle of animal behavior that reflects selection for efficiency of coding.