Recognizing Reduplicated Forms: Finite-State Buffered Machines

Yang Wang

doi:10.18653/v1/2021.sigmorphon-1.20

Abstract

Total reduplication is common in natural language phonology and morphology. However, formally as copying on reduplicants of unbounded size, unrestricted total reduplication requires computational power beyond context-free, while other phonological and morphological patterns are regular, or even sub-regular. Thus, existing language classes characterizing reduplicated strings inevitably include typologically unattested context-free patterns, such as reversals. This paper extends regular languages to incorporate reduplication by introducing a new computational device: finite state buffered machine (FSBMs). We give its mathematical definitions and discuss some closure properties of the corresponding set of languages. As a result, the class of regular languages and languages derived from them through a copying mechanism is characterized. Suggested by previous literature, this class of languages should approach the characterization of natural language word sets.

Highlights

Formal language theory (FLT) provides computational mechanisms characterizing different classes of abstract languages based on their inherent structures
Several findings suggest that those four levels do not align with natural languages precisely, some leading to major refinements on the Chomsky Hierarchy (CH)
We analyze another mismatch between existing well-known language classes and empirical findings: reduplication, which involves copying operations on certain base forms (Inkelas and Zoll, 2005)

Summary

Definitions

FSBMs are two-taped automata with finite-state core control. One tape stores the input, as in normal FSAs; the other serves as an unbounded memory buffer, storing reduplicants temporarily for future identity checking. The buffer interacts with the input in restricted ways: 1) the buffer is queue-like; 2) the buffer needs to work on the same alphabet as the input, unlike the stack in a pushdown automata (PDA), for example; 3) once one symbol is removed from the buffer, everything else must be wiped off before the buffer is available for other symbol addition. These restrictions together ensure the machine does not generate string reversals or other non-reduplicative non-regular patterns. Transitions between two H states check input-memory identity and consume symbols in both the input and the buffer. The language recognized by an FSBM M is denoted by L(M ). w ∈ L(M ) iff there’s a run of M on w

Examples

Complete-path FSBMs

Intersection with FSAs q1 a q2 b q3 b q4 a q5

Some closure properties of FSBMs

Homomorphism and inverse alphabetic homomorphism

Other closure properties

Discussion and conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Recognizing Reduplicated Forms: Finite-State Buffered Machines

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 2	License type: cc-by

Similar Papers

Recognizing Reduplicated Forms: Finite-State Buffered Machines

-

22 Jul 2021
22 Jul 2021

Complexity of Suffix-Free Regular Languages
Janusz Brzozowski ... Marek Szykuła
-
Janusz Brzozowski, et. al.Janusz Brzozowski ... Marek Szykuła
01 Jan 2015
01 Jan 2015

Continuity and Rational Functions
...
-
, et. al. ...
01 Jan 2017
01 Jan 2017

A Logical Characterization of Systolic Languages
Angelo Monti ... Adriano Peron
-
Angelo Monti, et. al.Angelo Monti ... Adriano Peron
01 Jan 1998
01 Jan 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Recognizing Reduplicated Forms: Finite-State Buffered Machines

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers