Towards Building a Language-Independent Speech Scoring Assessment

Article Status
Published
Authors/contributors
Title
Towards Building a Language-Independent Speech Scoring Assessment
Abstract
Automatic speech scoring is crucial in language learning, providing targeted feedback to language learners by assessing pronunciation, fluency, and other speech qualities. However, the scarcity of human-labeled data for languages beyond English poses a significant challenge in developing such systems. In this work, we propose a Language-Independent scoring approach to evaluate speech without relying on labeled data in the target language. We introduce a multilingual speech scoring system that leverages representations from the wav2vec 2.0 XLSR model and a force-alignment technique based on CTC-Segmentation to construct speech features. These features are used to train a machine learning model to predict pronunciation and fluency scores. We demonstrate the potential of our method by predicting expert ratings on a speech dataset spanning five languages - English, French, Spanish, German and Portuguese, and comparing its performance against Language-Specific models trained individually on each language, as well as a jointly-trained model on all languages. Results indicate that our approach shows promise as an initial step towards a universal language independent speech scoring.
Publication
Proceedings of the AAAI Conference on Artificial Intelligence
Volume
38
Issue
21
Pages
23200-23206
Date
2024-3-24
Journal Abbr
AAAI
ISSN
2374-3468
Accessed
14/06/2024, 04:28
Library Catalogue
DOI.org (Crossref)
Extra
Citation Key: gupta2024 <标题>: 迈向构建一种语言无关的语音评分评估 <AI Smry>: A multilingual speech scoring system that leverages representations from the wav2vec 2.0 XLSR model and a force-alignment technique based on CTC-Segmentation to construct speech features and shows promise as an initial step towards a universal language independent speech scoring.
Citation
Gupta, S., Unnam, A., Yadav, K., & Aggarwal, V. (2024). Towards Building a Language-Independent Speech Scoring Assessment. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23200–23206. https://doi.org/10.1609/aaai.v38i21.30366
Powered by Zotero and Kerko.