Using speech recognition for real-time captioning of multiple speakers
Using speech recognition for real-time captioning of multiple speakers
Meetings and seminars involving many people speaking can be some of the hardest situations for deaf people to be able to follow what is being said and also for people with physical, visual or cognitive disabilities to take notes or remember key points. People may also be absent during important interactions or they may arrive late or leave early. Real time captioning using phonetic keyboards can provide an accurate live as well as archived transcription of what has been said but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of applications that use speech recognition to provide automatic real time text transcriptions in situations when there can be many people speaking.
56-57
Wald, Mike
90577cfd-35ae-4e4a-9422-5acffecd89d5
Bain, Keith
85da94eb-1c50-4647-a543-4f0771efe937
October 2008
Wald, Mike
90577cfd-35ae-4e4a-9422-5acffecd89d5
Bain, Keith
85da94eb-1c50-4647-a543-4f0771efe937
Wald, Mike and Bain, Keith
(2008)
Using speech recognition for real-time captioning of multiple speakers.
IEEE MultiMedia, 15 (4), .
(doi:10.1109/MMUL.2008.99).
Abstract
Meetings and seminars involving many people speaking can be some of the hardest situations for deaf people to be able to follow what is being said and also for people with physical, visual or cognitive disabilities to take notes or remember key points. People may also be absent during important interactions or they may arrive late or leave early. Real time captioning using phonetic keyboards can provide an accurate live as well as archived transcription of what has been said but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of applications that use speech recognition to provide automatic real time text transcriptions in situations when there can be many people speaking.
Text
IEEEmultimedia_short_article2.doc
- Accepted Manuscript
More information
Published date: October 2008
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 266633
URI: http://eprints.soton.ac.uk/id/eprint/266633
ISSN: 1070-986X
PURE UUID: aa88d7ff-7940-4957-a662-d97e8f8335cd
Catalogue record
Date deposited: 08 Sep 2008 14:53
Last modified: 14 Mar 2024 08:32
Export record
Altmetrics
Contributors
Author:
Mike Wald
Author:
Keith Bain
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics