The University of Southampton
University of Southampton Institutional Repository

Automated generation of “good enough” transcripts as a first step to transcription of audio-recorded data

Automated generation of “good enough” transcripts as a first step to transcription of audio-recorded data
Automated generation of “good enough” transcripts as a first step to transcription of audio-recorded data
In the last decade automated captioning services have appeared in mainstream technology use. Until now the focus of these services have been on the technical aspects, supporting pupils with special educational needs and supporting teaching and learning of second language students. Only limited explorations have been attempted regarding its use for research purposes: transcription of audio recordings. This paper presents a proof-of-concept exploration utilising three examples of automated transcription of audio recordings from different contexts; an interview, a public hearing and a classroom setting, and compares them against ‘manual’ transcription techniques in each case. It begins with an overview of literature on automated captioning and the use of voice recognition tools for the purposes of transcription. An account is provided of the specific processes and tools used for the generation of the automated captions followed by some basic processing of the captions to produce automated transcripts. Originality checking software was used to determine a percentage match between the automated transcript and a manual version as a basic measure of the potential usability of each of the automated transcripts. Some analysis of the more common and persistent mismatches observed between automated and manual transcripts is provided, revealing that the majority of mismatches would be easily identified and rectified in a review and edit of the automated transcript. Finally, some of the challenges and limitations of the approach are considered. These limitations notwithstanding, we conclude that this form of automated transcription provides ‘good enough’ transcription for first versions of transcripts. The time and cost advantages of this could be considerable, even for the production of summary or gisted transcripts.
1-14
Bokhove, Christian
7fc17e5b-9a94-48f3-a387-2ccf60d2d5d8
Downey, Christopher
bb95b259-2e31-401b-8edf-78e8d76bfb8c
Bokhove, Christian
7fc17e5b-9a94-48f3-a387-2ccf60d2d5d8
Downey, Christopher
bb95b259-2e31-401b-8edf-78e8d76bfb8c

Bokhove, Christian and Downey, Christopher (2018) Automated generation of “good enough” transcripts as a first step to transcription of audio-recorded data. Methodological Innovations, 11 (2), 1-14. (doi:10.1177/2059799118790743).

Record type: Article

Abstract

In the last decade automated captioning services have appeared in mainstream technology use. Until now the focus of these services have been on the technical aspects, supporting pupils with special educational needs and supporting teaching and learning of second language students. Only limited explorations have been attempted regarding its use for research purposes: transcription of audio recordings. This paper presents a proof-of-concept exploration utilising three examples of automated transcription of audio recordings from different contexts; an interview, a public hearing and a classroom setting, and compares them against ‘manual’ transcription techniques in each case. It begins with an overview of literature on automated captioning and the use of voice recognition tools for the purposes of transcription. An account is provided of the specific processes and tools used for the generation of the automated captions followed by some basic processing of the captions to produce automated transcripts. Originality checking software was used to determine a percentage match between the automated transcript and a manual version as a basic measure of the potential usability of each of the automated transcripts. Some analysis of the more common and persistent mismatches observed between automated and manual transcripts is provided, revealing that the majority of mismatches would be easily identified and rectified in a review and edit of the automated transcript. Finally, some of the challenges and limitations of the approach are considered. These limitations notwithstanding, we conclude that this form of automated transcription provides ‘good enough’ transcription for first versions of transcripts. The time and cost advantages of this could be considerable, even for the production of summary or gisted transcripts.

Text
art autocaptions methinno revision 2 pure - Accepted Manuscript
Download (4MB)
Text
2059799118790743 - Version of Record
Download (1MB)

More information

Accepted/In Press date: 1 July 2018
e-pub ahead of print date: 9 August 2018

Identifiers

Local EPrints ID: 422043
URI: http://eprints.soton.ac.uk/id/eprint/422043
PURE UUID: 431d8e36-f5f2-4118-865a-8d69760426eb
ORCID for Christian Bokhove: ORCID iD orcid.org/0000-0002-4860-8723
ORCID for Christopher Downey: ORCID iD orcid.org/0000-0002-6094-0534

Catalogue record

Date deposited: 13 Jul 2018 16:30
Last modified: 16 Mar 2024 06:49

Export record

Altmetrics

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×