Audio Forensics Research Articles

ABSTRACT This paper presents a new case study examining the use of police transcripts to assist the court in understanding poor-quality forensic audio admitted as evidence in criminal trials. The 1995 trial it studies was the first major Australian case to include extensive expert opinions about police transcripts provided by the prosecution. Despite the fact that experts on both sides noted serious problems with the police transcripts, the judge allowed them to assist the jury, with the expert opinions offered as (ineffectual) subsequent commentary. The legal procedures the judge used in doing this were upheld on appeal, and have been followed ever since as a model for judges admitting opinions of both police and experts. The paper demonstrates how these procedures (unintentionally) privileged the opinions of police “ad hoc” experts over those of genuine experts, enabling the erroneous transcripts to influence not only the 1995 verdict, but a 1997 appeal and a 2014 inquiry. Analysis reveals the reason for these anomalies as the fact that the procedures incorporate misconceptions about spoken language and its representation in a transcript, which, though they have been thoroughly refuted by linguistic science over many decades, remain deeply embedded in the “common knowledge” accepted by wider society – including powerful institutions such as the law. The paper ends by calling on Australian linguists to find effective ways to address the misconceptions that affect the legal handling of forensic audio, by building further on the success of other branches of forensic linguistics in seeking direct engagement with the judiciary outside the trial process. The first step in achieving this is for linguists to gain a thorough understanding of how the legal procedures for handling poor-quality forensic audio operate, both in principle and in practice. The aim of the present paper is to contribute to that understanding.

Read full abstract

The transcription of covert recordings used as evidence in court is a huge issue for forensic linguistics. Covert recordings are typically made under conditions in which the device needs to be hidden, and so the resulting speech is generally indistinct, with overlapping voices and background noise, and in many cases the acoustic record cannot be analyzed via conventional phonetic techniques (i.e. phonetic segments are unclear, or there are no cues at all present acoustically). In the case of indistinct audio, the resulting transcripts that are produced, often by police working on the case, are often questionable and despite their unreliable nature can be provided as evidence in court. Injustices can, and have, occurred. Given the growing performance of automatic speech recognition (ASR) technologies, and growing reliance on such technologies in everyday life, a common question asked, especially by lawyers and other legal professionals, is whether ASR can solve the problem of what was said in indistinct forensic audio, and this is the main focus of the current paper. The paper also looks at forced alignment, a way of automatically aligning an existing transcriptions to audio. This is an area that needs to be explored in the context of forensic linguistics because transcripts can technically be “aligned” with any audio, making it seem as if it is “correct” even if it is not. The aim of this research is to demonstrate how automatic transcription systems fare using forensic-like audio, and with more than one system. Forensic-like audio is most appropriate for research, because there is greater certainty with what the speech material consists of (unlike in forensic situations where it cannot be verified). Examples of how various ASR systems cope with indistinct audio are shown, highlighting that when a good-quality recording is used ASR systems cope well, with the resulting transcript being usable and, for the most part, accurate. When a poor-quality, forensic-like recording is used, on the other hand, the resulting transcript is effectively unusable, with numerous errors and very few words recognized (and in some cases, no words recognized). The paper also demonstrates some of the problems that arise when forced-alignment is used with indistinct forensic-like audio—the transcript is simply “forced” onto an audio signal giving completely wrong alignment. This research shows that the way things currently stand, computational methods are not suitable for solving the issue of transcription of indistinct forensic audio for a range of reasons. Such systems cannot transcribe what was said in indistinct covert recordings, nor can they determine who uttered the words and phrases in such recordings, nor prove that a transcript is “right” (or wrong). These systems can indeed be used advantageously in research, and for various other purposes, and the reasons they do not work for forensic transcription stems from the nature of the recording conditions, as well as the nature of the forensic context.

Read full abstract

Audio Forensics Research Articles

Related Topics

Articles published on Audio Forensics

Exploring the Effectiveness of the Phase Features on Double Compressed AMR Speech Detection

1D-CNN-based audio tampering detection using ENF signals

Detecting Forged Audio Files Using "Mixed Paste" Command: A Deep Learning Approach Based on Korean Phonemic Features.

Automatic speech recognition and the transcription of indistinct forensic audio: how do the new generation of systems fare?

A novel Approach for Audio-based Video Analysis via MFCC Features

Anti Forensik Voice Note Menggunakan Whatsapp Mod

The Eastman transcripts: A case study calling Australian linguists to action against legal misconceptions about language in forensic evidence

Robust Audio Copy-Move Forgery Detection Using Constant Q Spectral Sketches and GA-SVM

Audio forensics behind the Iron Curtain: from raw sounds to expert testimony

Source Microphone Identification Using Swin Transformer

Robust audio copy-move forgery detection on short forged slices using sliding window

Source Acquisition Device Identification from Recorded Audio Based on Spatiotemporal Representation Learning with Multi-Attention Mechanisms

Transformer for authenticating the source microphone in digital audio forensics

ANALYSIS OF THE IMPACT OF DISTORTION ON SOUND RECORDINGS AS ANTI FORENSIC ACTIVITIES

A Universal Audio Steganalysis Scheme Based on Multiscale Spectrograms and DeepResNet

Analisis Rekaman Suara Magic Call pada Provider Seluler dengan Rekaman Suara Asli.

Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram

Other than Ethical: STS-Oriented Approaches to Communist Audio Forensics

A Framework for Deciding How to Create and Evaluate Transcripts for Forensic and Other Purposes

Does Automatic Speech Recognition (ASR) Have a Role in the Transcription of Indistinct Covert Recordings for Forensic Purposes?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Audio Forensics Research Articles

Related Topics

Articles published on Audio Forensics

Exploring the Effectiveness of the Phase Features on Double Compressed AMR Speech Detection

1D-CNN-based audio tampering detection using ENF signals

Detecting Forged Audio Files Using "Mixed Paste" Command: A Deep Learning Approach Based on Korean Phonemic Features.

Automatic speech recognition and the transcription of indistinct forensic audio: how do the new generation of systems fare?

A novel Approach for Audio-based Video Analysis via MFCC Features

Anti Forensik Voice Note Menggunakan Whatsapp Mod

The Eastman transcripts: A case study calling Australian linguists to action against legal misconceptions about language in forensic evidence

Robust Audio Copy-Move Forgery Detection Using Constant Q Spectral Sketches and GA-SVM

Audio forensics behind the Iron Curtain: from raw sounds to expert testimony

Source Microphone Identification Using Swin Transformer

Robust audio copy-move forgery detection on short forged slices using sliding window

Source Acquisition Device Identification from Recorded Audio Based on Spatiotemporal Representation Learning with Multi-Attention Mechanisms

Transformer for authenticating the source microphone in digital audio forensics

ANALYSIS OF THE IMPACT OF DISTORTION ON SOUND RECORDINGS AS ANTI FORENSIC ACTIVITIES

A Universal Audio Steganalysis Scheme Based on Multiscale Spectrograms and DeepResNet

Analisis Rekaman Suara Magic Call pada Provider Seluler dengan Rekaman Suara Asli.

Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram

Other than Ethical: STS-Oriented Approaches to Communist Audio Forensics

A Framework for Deciding How to Create and Evaluate Transcripts for Forensic and Other Purposes

Does Automatic Speech Recognition (ASR) Have a Role in the Transcription of Indistinct Covert Recordings for Forensic Purposes?