Using speech recognition for real-time captioning of multiple speakers

Wald, Mike and Bain, Keith (2008) Using speech recognition for real-time captioning of multiple speakers. IEEE MultiMedia, 15 (4), 56-57. (doi:10.1109/MMUL.2008.99).

Record type: Article

Abstract

Meetings and seminars involving many people speaking can be some of the hardest situations for deaf people to be able to follow what is being said and also for people with physical, visual or cognitive disabilities to take notes or remember key points. People may also be absent during important interactions or they may arrive late or leave early. Real time captioning using phonetic keyboards can provide an accurate live as well as archived transcription of what has been said but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of applications that use speech recognition to provide automatic real time text transcriptions in situations when there can be many people speaking.

Text

IEEEmultimedia_short_article2.doc - Accepted Manuscript

Download (327kB)