The University of Southampton
University of Southampton Institutional Repository

Interactive augmented reality system for learning phonetics using artificial intelligence

Interactive augmented reality system for learning phonetics using artificial intelligence
Interactive augmented reality system for learning phonetics using artificial intelligence
The increasing adoption of language learning apps that utilize Augmented Reality (AR) and Artificial Intelligence (AI) for speech recognition has sparked interest in the potential benefits for phonetics education. However, currently available AR apps only focus on teaching letter names and vocabulary, lacking the potential for a more immersive learning experience. To address this limitation, this paper introduces an interactive AR system that integrates AI speech recognition with AR to provide an engaging and interactive learning experience. To showcase the capabilities of the proposed system, we have created a prototype for the Arabic Phonetic Atlas textbook. This prototype enhances reading the /s/ sound page in the Atlas by incorporating a 3D animated model of the speech organs onto the existing 2D image. The dynamic animation of the 3D model reflects the sound description provided in the Atlas. The system also offers real-time user pronunciation feedback through a customized AI phoneme recognition system. A comprehensive user study was conducted to evaluate the usability and learning impact of the proposed system, involving 83 adult participants aged between 18-40. The assessment approach involved the use of both direct and indirect observations, as well as various surveys to gather both numerical and qualitative information. The findings indicate not only a greater level of understanding compared to conventional methods but also an improved capability to master specific phonemes quickly and effortlessly. In addition, they are showing great potential for the proposed system to be incorporated into the conventional classroom setting as an instructional aid.
2169-3536
78219-78231
Tolba, Rahma M.
f4169a0f-c0b5-4bd2-80f8-55870d3cbf64
Elarif, Taha
31169e5b-6286-4d80-8fdb-3d6348eecfdc
Taha, Zaki
f9991f61-6694-4598-bf1a-f4f371d809dc
Hammady, Ramy
9d5ff940-2d85-44e7-b001-222ae2feb935
Tolba, Rahma M.
f4169a0f-c0b5-4bd2-80f8-55870d3cbf64
Elarif, Taha
31169e5b-6286-4d80-8fdb-3d6348eecfdc
Taha, Zaki
f9991f61-6694-4598-bf1a-f4f371d809dc
Hammady, Ramy
9d5ff940-2d85-44e7-b001-222ae2feb935

Tolba, Rahma M., Elarif, Taha, Taha, Zaki and Hammady, Ramy (2024) Interactive augmented reality system for learning phonetics using artificial intelligence. IEEE Access, 12, 78219-78231. (doi:10.1109/ACCESS.2024.3406494).

Record type: Article

Abstract

The increasing adoption of language learning apps that utilize Augmented Reality (AR) and Artificial Intelligence (AI) for speech recognition has sparked interest in the potential benefits for phonetics education. However, currently available AR apps only focus on teaching letter names and vocabulary, lacking the potential for a more immersive learning experience. To address this limitation, this paper introduces an interactive AR system that integrates AI speech recognition with AR to provide an engaging and interactive learning experience. To showcase the capabilities of the proposed system, we have created a prototype for the Arabic Phonetic Atlas textbook. This prototype enhances reading the /s/ sound page in the Atlas by incorporating a 3D animated model of the speech organs onto the existing 2D image. The dynamic animation of the 3D model reflects the sound description provided in the Atlas. The system also offers real-time user pronunciation feedback through a customized AI phoneme recognition system. A comprehensive user study was conducted to evaluate the usability and learning impact of the proposed system, involving 83 adult participants aged between 18-40. The assessment approach involved the use of both direct and indirect observations, as well as various surveys to gather both numerical and qualitative information. The findings indicate not only a greater level of understanding compared to conventional methods but also an improved capability to master specific phonemes quickly and effortlessly. In addition, they are showing great potential for the proposed system to be incorporated into the conventional classroom setting as an instructional aid.

Text
Interactive_Augmented_Reality_System_for_Learning_Phonetics_Using_Artificial_Intelligence - Version of Record
Available under License Creative Commons Attribution.
Download (2MB)

More information

Accepted/In Press date: 24 May 2024
Published date: 28 May 2024

Identifiers

Local EPrints ID: 500888
URI: http://eprints.soton.ac.uk/id/eprint/500888
ISSN: 2169-3536
PURE UUID: 3cb0bda0-6169-4bf7-9b5a-076c71aa3843
ORCID for Ramy Hammady: ORCID iD orcid.org/0000-0003-4764-6039

Catalogue record

Date deposited: 15 May 2025 16:30
Last modified: 22 Aug 2025 02:49

Export record

Altmetrics

Contributors

Author: Rahma M. Tolba
Author: Taha Elarif
Author: Zaki Taha
Author: Ramy Hammady ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×