Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/93832
Title: Integrating annotated spoken Maltese data into corpora of written Maltese
Authors: Vella, Alexandra
Chetcuti, Flavia
Grech, Sarah
Spagnol, Michael
Keywords: Maltese language -- Grammar
Maltese language -- Morphology
Maltese language -- Phonology
Maltese language -- Terminology
Issue Date: 2010
Publisher: LREC
Citation: Vella, A., Chetcuti, F., Grech, S., & Spagnol, M. (2010). Integrating annotated spoken Maltese data into corpora of written Maltese. Seventh conference on International Language Resources and Evaluation (LREC 2010), Workshop on Language Resources & Human Language Technologies for Semitic Languages, Valletta. 83-90.
Abstract: Spoken data features to a lesser extent in corpora available for languages than do written data. This paper addresses this issue by presenting work carried out to date on the development of a corpus of spoken Maltese. It outlines the standards for the PRAAT annotation of Maltese data at the orthographic level, and reports on preliminary work on the annotation of Maltese prosody and development of ToBI-style standards for Maltese. Procedures being developed for exporting PRAAT TextGrid information for the purposes of incorporation into a predominantly written corpus of Maltese are then discussed. The paper also demonstrates how characteristics of speech notoriously difficult to deal with have been tackled and how the exported output from the PRAAT annotations can be enhanced through the representation also of phenomena, sometimes referred to as “normal disfluencies”, which include “filled pauses” and other vocalisations of a quasi-lexical nature having various functions of a discourse-management type such as “backchannelling”.
URI: https://www.um.edu.mt/library/oar/handle/123456789/93832
Appears in Collections:Scholarly Works - FacArtMal

Files in This Item:
File Description SizeFormat 
Integrating_annotated_spoken_Maltese_data_into_corpora_of_written_Maltese.pdf
  Restricted Access
351.29 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.