Corpus-based discourse analysis: Automatic Speech Recognition (ASR) technologies and spoken corpus collection

  • PhD student Uzbekistan State World Languages University, Teacher, TIFT University

DOI

https://doi.org/10.47689/2181-3701-vol3-iss4-pp1-8

Keywords

Automatic Speech Recognition (ASR) , speech transcription , corpus , spoken discourse analysis , second language acquisition

Abstract

This study investigates the integration of Automatic Speech Recognition (ASR) technologies in the collection and analysis of spoken learner corpora, with a focus on L2 contexts. Employing a mixed-methods design, the research evaluates the effectiveness of ASR systems, specifically, Whisper and BERT-based models in producing accurate transcriptions and facilitating language acquisition. Quantitative results demonstrate high transcription accuracy, while qualitative data reveal that ASR-supported feedback significantly enhances learner engagement, pronunciation, and speaking proficiency. Technological limitations and ethical concerns related to data privacy and feedback mechanisms are also taken into account. Overall, the findings highlight the transformative potential of ASR technologies in language education by enabling scalable, real-time assessment and personalized feedback, while underscoring the need for continued refinement and equitable implementation across diverse learner populations.

References

Nakamura, S., Spring, R., & Sakurai, S. (2024). The impact of ASR-based interactive video activities on speaking skills: Japanese EFL learners’ perceptions. The Electronic Journal for English as a Second Language, 27(4). https://doi.org/10.55593/ej.27108a5

Gladia. (2023, December 19). A review of the best ASR engines and the models powering them in 2024. Gladia Blog. https://www.gladia.io/blog/a-review-of-the-best-asr-engines-and-the-models-powering-them-in-2024

Michot, J., Hürlimann, M., Deriu, J., Sauer, L., Mlynchyk, K., & Cieliebak, M. (2024). Error-preserving automatic speech recognition of young English learners’ language. Beta archives, https://arxiv.org/html/2406.03235v1

Qian, Z., Xiao, K., & Yu, C. (2023). A survey of technologies for automatic dysarthric speech recognition. Journal of Audio, Speech, and Music Processing, 2023(48). https://doi.org/10.1186/s13636-023-00318-2

Bhatnagar, N. (2024 Deep dive into ASR systems. Medium. https://medium.com/@captnitinbhatnagar/deep-dive-into-asr-systems-c571a576ff26

Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, Article 1210187. https://doi.org/10.3389/fpsyg.2023.1210187

Southwell, R., Pugh, S., Perkoff, E. M., Clevenger, C., Bush, J., Lieber, R., Ward, W., Foltz, P., & D’Mello, S. (2022). Challenges and feasibility of automatic speech recognition for modeling student collaborative discourse in classrooms. In Proceedings of the 15th International Conference on Educational Data Mining (EDM 2022) (pp. 342–353). https://educationaldatamining.org/edm2022/proceedings/2022.EDM-long-papers.26/index.html

Downloads

63 20

Published

Corpus-based discourse analysis: Automatic Speech Recognition (ASR) technologies and spoken corpus collection

How to Cite

Asrorova, N. 2025. Corpus-based discourse analysis: Automatic Speech Recognition (ASR) technologies and spoken corpus collection. Foreign Linguistics and Lingvodidactics. 3, 4 (Jul. 2025), 1–8. DOI:https://doi.org/10.47689/2181-3701-vol3-iss4-pp1-8.

Issue

Section

13.00.00 - PEDAGOGICAL SCIENCES