DOI
https://doi.org/10.47689/2181-3701-vol3-iss4-pp1-8Keywords
Automatic Speech Recognition (ASR) , speech transcription , corpus , spoken discourse analysis , second language acquisitionAbstract
This study investigates the integration of Automatic Speech Recognition (ASR) technologies in the collection and analysis of spoken learner corpora, with a focus on L2 contexts. Employing a mixed-methods design, the research evaluates the effectiveness of ASR systems, specifically, Whisper and BERT-based models in producing accurate transcriptions and facilitating language acquisition. Quantitative results demonstrate high transcription accuracy, while qualitative data reveal that ASR-supported feedback significantly enhances learner engagement, pronunciation, and speaking proficiency. Technological limitations and ethical concerns related to data privacy and feedback mechanisms are also taken into account. Overall, the findings highlight the transformative potential of ASR technologies in language education by enabling scalable, real-time assessment and personalized feedback, while underscoring the need for continued refinement and equitable implementation across diverse learner populations.
References
Nakamura, S., Spring, R., & Sakurai, S. (2024). The impact of ASR-based interactive video activities on speaking skills: Japanese EFL learners’ perceptions. The Electronic Journal for English as a Second Language, 27(4). https://doi.org/10.55593/ej.27108a5
Gladia. (2023, December 19). A review of the best ASR engines and the models powering them in 2024. Gladia Blog. https://www.gladia.io/blog/a-review-of-the-best-asr-engines-and-the-models-powering-them-in-2024
Michot, J., Hürlimann, M., Deriu, J., Sauer, L., Mlynchyk, K., & Cieliebak, M. (2024). Error-preserving automatic speech recognition of young English learners’ language. Beta archives, https://arxiv.org/html/2406.03235v1
Qian, Z., Xiao, K., & Yu, C. (2023). A survey of technologies for automatic dysarthric speech recognition. Journal of Audio, Speech, and Music Processing, 2023(48). https://doi.org/10.1186/s13636-023-00318-2
Bhatnagar, N. (2024 Deep dive into ASR systems. Medium. https://medium.com/@captnitinbhatnagar/deep-dive-into-asr-systems-c571a576ff26
Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, Article 1210187. https://doi.org/10.3389/fpsyg.2023.1210187
Southwell, R., Pugh, S., Perkoff, E. M., Clevenger, C., Bush, J., Lieber, R., Ward, W., Foltz, P., & D’Mello, S. (2022). Challenges and feasibility of automatic speech recognition for modeling student collaborative discourse in classrooms. In Proceedings of the 15th International Conference on Educational Data Mining (EDM 2022) (pp. 342–353). https://educationaldatamining.org/edm2022/proceedings/2022.EDM-long-papers.26/index.html
Downloads
63 20Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Наргиза Асророва

This work is licensed under a Creative Commons Attribution 4.0 International License.











