Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/93832
Title: | Integrating annotated spoken Maltese data into corpora of written Maltese |
Authors: | Vella, Alexandra Chetcuti, Flavia Grech, Sarah Spagnol, Michael |
Keywords: | Maltese language -- Grammar Maltese language -- Morphology Maltese language -- Phonology Maltese language -- Terminology |
Issue Date: | 2010 |
Publisher: | LREC |
Citation: | Vella, A., Chetcuti, F., Grech, S., & Spagnol, M. (2010). Integrating annotated spoken Maltese data into corpora of written Maltese. Seventh conference on International Language Resources and Evaluation (LREC 2010), Workshop on Language Resources & Human Language Technologies for Semitic Languages, Valletta. 83-90. |
Abstract: | Spoken data features to a lesser extent in corpora available for languages than do written data. This paper addresses this issue by presenting work carried out to date on the development of a corpus of spoken Maltese. It outlines the standards for the PRAAT annotation of Maltese data at the orthographic level, and reports on preliminary work on the annotation of Maltese prosody and development of ToBI-style standards for Maltese. Procedures being developed for exporting PRAAT TextGrid information for the purposes of incorporation into a predominantly written corpus of Maltese are then discussed. The paper also demonstrates how characteristics of speech notoriously difficult to deal with have been tackled and how the exported output from the PRAAT annotations can be enhanced through the representation also of phenomena, sometimes referred to as “normal disfluencies”, which include “filled pauses” and other vocalisations of a quasi-lexical nature having various functions of a discourse-management type such as “backchannelling”. |
URI: | https://www.um.edu.mt/library/oar/handle/123456789/93832 |
Appears in Collections: | Scholarly Works - FacArtMal |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Integrating_annotated_spoken_Maltese_data_into_corpora_of_written_Maltese.pdf Restricted Access | 351.29 kB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.