Abstract

AbstractThe paper presents the results of automatic classification of the dialogues of Russian-speaking children with typical and atypical development using machine learning methods. The study proposes an approach to developing the automatic system for classification the state of children (typical development, autism spectrum disorders, and Down syndrome) based on the linguistic characteristics of speech. 62 boys aged 8–11 years including 20 children with typical development (TD), 28 with autism spectrum disorders (ASD), and 14 with Down syndrome (DS) were interviewed. The dataset contains 69 files with dialogues between adult and child. Only children’s responses were used for further analysis. Morphological, graphematic, and emotional characteristics of speech were extracted from the text of the dialogues. A total of 62 linguistic features were extracted from each dialogue: the number of replies, sentences, tokens, pauses, and unfinished words; the relative frequencies of parts of speech and some grammatical categories (animacy, number, aspect, involvement, mood, person, tense), and the statistics of positive and negative words use. The Mann-Whitney U test was used to assess differences in the linguistic features of the speech. The differences between boys with TD, ASD, and DS in 40 linguistic features of their speech were revealed. These features were used to develop classification models using machine learning methods: Gradient Boosting, Random Forest, Ada Boost. The revealed features showed a good differentiating ability. The classification accuracy for the dialogues of boys with TD, ASD and DS was 88%.KeywordsLinguistic featuresAutism spectrum disordersDown syndromeAutomatic classification

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call