Abstract
This study explores the behaviour of quantitative motifs in random numerical data, focussing on the property of length. One hundred and twenty sequences of pseudorandom integers were generated and segmented into quantitative motifs using the standard definition. The length–frequency relationship was modelled in each case using the Zipf-Alekseev function. Different combinations of sequence length, integer range, and pseudorandom number generator were trialled. The model-fitting was successful in all cases (𝑅2 1 > 0.9 in 118/120 cases and > 0.8 in 2/120), and the model parameters fell in the same range as those obtained in a previous textual study of motifs. Integer range was the main factor affecting the parameter values, with sequence length affecting the degree of spread. Further comparisons between textual and random data are needed, as well as a theoretical explanation of why motifs in random data demonstrate lawful behaviour. In future textual motif studies, particular attention needs to be paid to possible dependencies on value range and text length.
Published Version
Join us for a 30 min session where you can share your feedback and ask us any queries you have