Abstract

<p>Audio messaging and voice-based interactions are growing in popularity. Lexical features of a manually-curated dataset of real-world audio tweets, as well as text and video/image tweets from the same user accounts, are analyzed to explore how user-generated audio differs from text. The toxicity, sentiment, topic and length of audio tweet transcripts are compared with their accompanying text, date-matched text tweets from the same users and date-matched video/image tweets and their accompanying text. Audio tweets were significantly less toxic than both text tweets and text that accompanied the audio tweet, as well as significantly lower sentiment than their accompanying text. The topics and word counts of audio, text and video/image tweets also differed. These findings are then used to derive design implications for audio and conversational agent interaction. This research contributes preliminary insights about audio social media messages that may help researchers and designers of audio- and agent-based interaction better understand and design for different media formats.</p>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.