The EarSAVAS Dataset

Xiyuxing Zhang,Yuxuan Han,Chen Liang,Yuntao Wang,Jiankai Tang,Ishan Chatterjee,Yuanchun Shi,Xin Yi,Shwetak Patel

doi:10.1145/3659616

Abstract

Subject-aware vocal activity sensing on wearables, which specifically recognizes and monitors the wearer's distinct vocal activities, is essential in advancing personal health monitoring and enabling context-aware applications. While recent advancements in earables present new opportunities, the absence of relevant datasets and effective methods remains a significant challenge. In this paper, we introduce EarSAVAS, the first publicly available dataset constructed specifically for subject-aware human vocal activity sensing on earables. EarSAVAS encompasses eight distinct vocal activities from both the earphone wearer and bystanders, including synchronous two-channel audio and motion data collected from 42 participants totaling 44.5 hours. Further, we propose EarVAS, a lightweight multi-modal deep learning architecture that enables efficient subject-aware vocal activity recognition on earables. To validate the reliability of EarSAVAS and the efficiency of EarVAS, we implemented two advanced benchmark models. Evaluation results on EarSAVAS reveal EarVAS's effectiveness with an accuracy of 90.84% and a Macro-AUC of 89.03%. Comprehensive ablation experiments were conducted on benchmark models and demonstrated the effectiveness of feedback microphone audio and highlighted the potential value of sensor fusion in subject-aware vocal activity sensing on earables. We hope that the proposed EarSAVAS and benchmark models can inspire other researchers to further explore efficient subject-aware human vocal activity sensing on earables.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The EarSAVAS Dataset

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

Lead the way for us

Journal: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies	Publication Date: May 13, 2024
License type: cc-by

Similar Papers

Predicting decompression surgery by applying multimodal deep learning to patients’ structured and unstructured health data
Chethan Jujjavarapu ... Jeffrey G Jarvik
BMC Medical Informatics and Decision Making | VOL. 23
Chethan Jujjavarapu, et. al.Chethan Jujjavarapu ... Jeffrey G Jarvik
06 Jan 2023
BMC Medical Informatics and Decision Making | VOL. 23

Automated acoustic monitoring of endangered common spadefoot toad populations reveals patterns of vocal activity
Guillaume Dutilleux ... Charlotte Curé
Freshwater Biology | VOL. 65
Guillaume Dutilleux, et. al.Guillaume Dutilleux ... Charlotte Curé
16 Apr 2018
Freshwater Biology | VOL. 65

Multi-modal deep learning for automated assembly of periapical radiographs
L Pfänder ... F Schwendicke
Journal of Dentistry | VOL. 135
L Pfänder, et. al.L Pfänder ... F Schwendicke
21 Jun 2023
Journal of Dentistry | VOL. 135

Illuminating the Nocturnal Habits of Owls with Emerging Tagging Technologies
Connor M Wood ... Sheila Whitmore
Wildlife Society Bulletin | VOL. 45
Connor M Wood, et. al.Connor M Wood ... Sheila Whitmore
12 Feb 2021
Wildlife Society Bulletin | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The EarSAVAS Dataset

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies