Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/119355
Title: | Resources and tools for pre-processing speech data in a lesser-known variety of English |
Other Titles: | Proceedings of the 20th International Congress of Phonetic Sciences |
Authors: | Vella, Alexandra Grech, Sarah Padovani, Ian Micallef, Maria-Christina |
Keywords: | Information resources Phonetics Grammar, Comparative and general -- Phonology Malta -- Languages English language -- Variation |
Issue Date: | 2023 |
Publisher: | Guarant International |
Citation: | Vella, A., Grech, S., Padovani, I., & Micallef, M. C. (2023). Resources and tools for pre-processing speech data in a lesser-known variety of English. In R. Skarnitzl & J. Volín (Eds.), Proceedings of the 20th International Congress of Phonetic Sciences (pp. 3369–3373). Prague: Guarant International |
Abstract: | Research on lesser-known language varieties can be hindered from the outset by the need for both data and tools for automating the required pre-processing work. For speech, whilst more ecologically valid data in the form of video and audio are sometimes available, these need to be accompanied by a machine-readable text, ideally segmented and labelled, both allowing for searchability. A significant initial commitment is needed even before the relevant phonetic and phonological research can begin. This paper demonstrates and evaluates the efficacy of already existing tools (YouTube captioning and WebMAUS forced alignment) in automating the pre-processing work required using Maltese English (MaltE), whilst also showcasing a sample analysis of the pronunciation of post-vocalic ‘r’ in the variety. As a low resource variety of English, MaltE presents a test case for showing how existing resources and tools can be utilised to work with language varieties which are digitally less well-supported. |
URI: | https://www.um.edu.mt/library/oar/handle/123456789/119355 |
ISBN: | 9788090811423 |
Appears in Collections: | Scholarly Works - InsLin |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Resources_and_tools_for_pre_processing_speech_data_in_a_lesser_known_variety_of_English.pdf | 940.6 kB | Adobe PDF | View/Open |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.