Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study

Ravi Iyer,Denny Meyer,Maja Nedeljkovic

doi:10.2196/42249

Ravi Iyer, Denny Meyer + Show 1 more

Open Access

https://doi.org/10.2196/42249

Copy DOI

Journal: JMIR Formative Research	Publication Date: Dec 19, 2022
Citations: 7	License type: cc-by

Affiliation: Swinburne University of Technology

Abstract

Elevated psychological distress has demonstrated impacts on individuals' health. Reliable and efficient ways to detect distress are key to early intervention. Artificial intelligence has the potential to detect states of emotional distress in an accurate, efficient, and timely manner. The aim of this study was to automatically classify short segments of speech obtained from callers to national suicide prevention helpline services according to high versus low psychological distress and using a range of vocal characteristics in combination with machine learning approaches. A total of 120 telephone call recordings were initially converted to 16-bit pulse code modulation format. Short variable-length segments of each call were rated on psychological distress using the distress thermometer by the responding counselor and a second team of psychologists (n=6) blinded to the initial ratings. Following this, 24 vocal characteristics were initially extracted from 40-ms speech frames nested within segments within calls. After highly correlated variables were eliminated, 19 remained. Of 19 vocal characteristics, 7 were identified and validated as predictors of psychological distress using a penalized generalized additive mixed effects regression model, accounting for nonlinearity, autocorrelation, and moderation by sex. Speech frames were then grouped using k-means clustering based on the selected vocal characteristics. Finally, component-wise gradient boosting incorporating these clusters was used to classify each speech frame according to high versus low psychological distress. Classification accuracy was confirmed via leave-one-caller-out cross-validation, ensuring that speech segments from individual callers were not used in both the training and test data. The sample comprised 87 female and 33 male callers. From an initial pool of 19 characteristics, 7 vocal characteristics were identified. After grouping speech frames into 2 separate clusters (correlation with sex of caller, Cramer's V =0.02), the component-wise gradient boosting algorithm successfully classified psychological distress to a high level of accuracy, with an area under the receiver operating characteristic curve of 97.39% (95% CI 96.20-98.45) and an area under the precision-recall curve of 97.52 (95% CI 95.71-99.12). Thus, 39,282 of 41,883 (93.39%) speech frames nested within 728 of 754 segments (96.6%) were classified as exhibiting low psychological distress, and 71455 of 75503 (94.64%) speech frames nested within 382 of 423 (90.3%) segments were classified as exhibiting high psychological distress. As the probability of high psychological distress increases, male callers spoke louder, with greater vowel articulation but with greater roughness (subharmonic depth). In contrast, female callers exhibited decreased vocal clarity (entropy), greater proportion of signal noise, higher frequencies, increased breathiness (spectral slope), and increased roughness of speech with increasing psychological distress. Individual caller random effects contributed 68% to risk reduction in the classification algorithm, followed by cluster configuration (23.4%), spectral slope (4.4%), and the 50th percentile frequency (4.2%). The high level of accuracy achieved suggests possibilities for real-time detection of psychological distress in helpline settings and has potential uses in pre-emptive triage and evaluations of counseling outcomes. ANZCTR ACTRN12622000486729; https://www.anzctr.org.au/ACTRN12622000486729.aspx.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study

Abstract

Talk to us

Similar Papers

More From: JMIR Formative Research

Lead the way for us

Similar Papers

Primary Absolute Cardiovascular Disease Risk and Prevention in Relation to Psychological Distress in the Australian Population: A Nationally Representative Cross-Sectional Study.
Jennifer Welsh ... Grace Joshy
Frontiers in public health | VOL. 7
Jennifer Welsh, et. al.Jennifer Welsh ... Grace Joshy
31 May 2019
Frontiers in public health | VOL. 7

Psychological Distress and Risk of Myocardial Infarction and Stroke in the 45 and Up Study
Caroline A Jackson ... Cathie L.M Sudlow
Circulation: Cardiovascular Quality and Outcomes | VOL. 11
Caroline A Jackson, et. al.Caroline A Jackson ... Cathie L.M Sudlow
01 Sep 2018
Circulation: Cardiovascular Quality and Outcomes | VOL. 11

OP99 Psychological distress and incident stroke risk in the 45 and up study
Ca Jackson ... Gd Mishra
Journal of Epidemiology and Community Health | VOL. 71
Ca Jackson, et. al.Ca Jackson ... Gd Mishra
01 Sep 2017
OP99 Psychological distress and incident stroke risk in the 45 and up study
Ca Jackson ... Gd Mishra

Psychological distress and C-reactive protein: do health behaviours and pathophysiological factors modify the association?
Pekka Johannes Puustinen ... Mauno Vanhala
European Archives of Psychiatry and Clinical Neuroscience | VOL. 261
Pekka Johannes Puustinen, et. al.Pekka Johannes Puustinen ... Mauno Vanhala
14 Aug 2010
European Archives of Psychiatry and Clinical Neuroscience | VOL. 261

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study

Abstract

Talk to us

Similar Papers

More From: JMIR Formative Research