Analysis of data augmentation methods for low-resource Maltese ASR

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/110339

Title:	Analysis of data augmentation methods for low-resource Maltese ASR
Authors:	DeMarco, Andrea Mena, Carlos Gatt, Albert Borg, Claudia Williams, Aiden van der Plas, Lonneke
Keywords:	Speech perception Speech synthesis Linguistics Automatic speech recognition Natural language processing (Computer science) Computational linguistics Speech processing systems
Issue Date:	2021
Citation:	DeMarco, A., Mena, C., Gatt, A., Borg, C., Williams, A., & van der Plas, L. (2021). Analysis of Data Augmentation Methods for Low-Resource Maltese ASR. arXiv e-prints, DOI: 10.48550/arXiv.2111.07793.
Abstract:	Recent years have seen an increased interest in the computational speech processing of Maltese, but resources remain sparse. In this paper, we consider data augmentation techniques for improving speech recognition for low-resource languages, focusing on Maltese as a test case. We consider three different types of data augmentation: unsupervised training, multilingual training and the use of synthesized speech as training data. The goal is to determine which of these techniques, or combination of them, is the most effective to improve speech recognition for languages where the starting point is a small corpus of approximately 7 hours of transcribed speech. Our results show that combining the data augmentation techniques studied here lead us to an absolute WER improvement of 15% without the use of a language model.
URI:	https://www.um.edu.mt/library/oar/handle/123456789/110339
Appears in Collections:	Scholarly Works - InsSSA

Files in This Item:

File	Description	Size	Format
Analysis_of_data_augmentation_methods_for_low_resource_Maltese_ASR.pdf		243.37 kB	Adobe PDF	View/Open

Show full item record Statistics