Classifying dialogue in high-dimensional space

José P González-Brenes,Jack Mostow

doi:10.1145/1966407.1966413

Abstract

The richness of multimodal dialogue makes the space of possible features required to describe it very large relative to the amount of training data. However, conventional classifier learners require large amounts of data to avoid overfitting, or do not generalize well to unseen examples. To learn dialogue classifiers using a rich feature set and fewer data points than features, we apply a recent technique, ℓ 1 -regularized logistic regression. We demonstrate this approach empirically on real data from Project LISTEN's Reading Tutor, which displays a story on a computer screen and listens to a child read aloud. We train a classifier to predict task completion (i.e., whether the student will finish reading the story) with 71% accuracy on a balanced, unseen test set. To characterize differences in the behavior of children when they choose the story they read, we likewise train and test a classifier that with 73.6% accuracy infers who chose the story based on the ensuing dialogue. Both classifiers significantly outperform baselines and reveal relevant features of the dialogue.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classifying dialogue in high-dimensional space

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Speech and Language Processing

Lead the way for us

Journal: ACM Transactions on Speech and Language Processing	Publication Date: May 1, 2011
Citations: 37

Similar Papers

Skill-specific spoken dialogs in a reading tutor that listens
Gregory Aist
-
Gregory AistGregory Aist
01 Jan 1998
01 Jan 1998

Giving Help and Praise in a Reading Tutor with Imperfect Listening--Because Automated Speech Recognition Means Never Being Able to Say You're Certain
Jack Mostow ... Gregory Aist
CALICO Journal | VOL. 16
Jack Mostow, et. al.Jack Mostow ... Gregory Aist
14 Jan 2013
CALICO Journal | VOL. 16

Kernel-Based Manifold-Oriented Stochastic Neighbor Projection Method
Jianwei Zheng ... Xinli Xu
-
Jianwei Zheng, et. al.Jianwei Zheng ... Xinli Xu
27 May 2013
27 May 2013

Risks of feature leakage and sample size dependencies in deep feature extraction for breast mass classification.
Ravi K Samala ... Lubomir Hadjiiski
Medical physics | VOL. 48
Ravi K Samala, et. al.Ravi K Samala ... Lubomir Hadjiiski
12 Apr 2021
Medical physics | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classifying dialogue in high-dimensional space

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Speech and Language Processing