Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/110339
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2023-06-02T06:31:10Z-
dc.date.available2023-06-02T06:31:10Z-
dc.date.issued2021-
dc.identifier.citationDeMarco, A., Mena, C., Gatt, A., Borg, C., Williams, A., & van der Plas, L. (2021). Analysis of Data Augmentation Methods for Low-Resource Maltese ASR. arXiv e-prints, DOI: 10.48550/arXiv.2111.07793.en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/110339-
dc.description.abstractRecent years have seen an increased interest in the computational speech processing of Maltese, but resources remain sparse. In this paper, we consider data augmentation techniques for improving speech recognition for low-resource languages, focusing on Maltese as a test case. We consider three different types of data augmentation: unsupervised training, multilingual training and the use of synthesized speech as training data. The goal is to determine which of these techniques, or combination of them, is the most effective to improve speech recognition for languages where the starting point is a small corpus of approximately 7 hours of transcribed speech. Our results show that combining the data augmentation techniques studied here lead us to an absolute WER improvement of 15% without the use of a language model.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/openAccessen_GB
dc.subjectSpeech perceptionen_GB
dc.subjectSpeech synthesisen_GB
dc.subjectLinguisticsen_GB
dc.subjectAutomatic speech recognitionen_GB
dc.subjectNatural language processing (Computer science)en_GB
dc.subjectComputational linguisticsen_GB
dc.subjectSpeech processing systemsen_GB
dc.titleAnalysis of data augmentation methods for low-resource Maltese ASRen_GB
dc.typearticleen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holderen_GB
dc.description.reviewednon peer-revieweden_GB
dc.identifier.doi10.48550/arXiv.2111.07793-
dc.publication.titlearXiv e-printsen_GB
dc.contributor.creatorDeMarco, Andrea-
dc.contributor.creatorMena, Carlos-
dc.contributor.creatorGatt, Albert-
dc.contributor.creatorBorg, Claudia-
dc.contributor.creatorWilliams, Aiden-
dc.contributor.creatorvan der Plas, Lonneke-
Appears in Collections:Scholarly Works - InsSSA

Files in This Item:
File Description SizeFormat 
Analysis_of_data_augmentation_methods_for_low_resource_Maltese_ASR.pdf243.37 kBAdobe PDFView/Open


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.