The University of Southampton
University of Southampton Institutional Repository

Using speech recognition for real-time captioning of multiple speakers

Using speech recognition for real-time captioning of multiple speakers
Using speech recognition for real-time captioning of multiple speakers
Meetings and seminars involving many people speaking can be some of the hardest situations for deaf people to be able to follow what is being said and also for people with physical, visual or cognitive disabilities to take notes or remember key points. People may also be absent during important interactions or they may arrive late or leave early. Real time captioning using phonetic keyboards can provide an accurate live as well as archived transcription of what has been said but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of applications that use speech recognition to provide automatic real time text transcriptions in situations when there can be many people speaking.
1070-986X
56-57
Wald, Mike
90577cfd-35ae-4e4a-9422-5acffecd89d5
Bain, Keith
85da94eb-1c50-4647-a543-4f0771efe937
Wald, Mike
90577cfd-35ae-4e4a-9422-5acffecd89d5
Bain, Keith
85da94eb-1c50-4647-a543-4f0771efe937

Wald, Mike and Bain, Keith (2008) Using speech recognition for real-time captioning of multiple speakers. IEEE MultiMedia, 15 (4), 56-57. (doi:10.1109/MMUL.2008.99).

Record type: Article

Abstract

Meetings and seminars involving many people speaking can be some of the hardest situations for deaf people to be able to follow what is being said and also for people with physical, visual or cognitive disabilities to take notes or remember key points. People may also be absent during important interactions or they may arrive late or leave early. Real time captioning using phonetic keyboards can provide an accurate live as well as archived transcription of what has been said but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of applications that use speech recognition to provide automatic real time text transcriptions in situations when there can be many people speaking.

Text
IEEEmultimedia_short_article2.doc - Accepted Manuscript
Download (327kB)

More information

Published date: October 2008
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 266633
URI: http://eprints.soton.ac.uk/id/eprint/266633
ISSN: 1070-986X
PURE UUID: aa88d7ff-7940-4957-a662-d97e8f8335cd

Catalogue record

Date deposited: 08 Sep 2008 14:53
Last modified: 14 Mar 2024 08:32

Export record

Altmetrics

Contributors

Author: Mike Wald
Author: Keith Bain

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×