Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/93832
Full metadata record
DC FieldValueLanguage
dc.contributor.authorVella, Alexandra-
dc.contributor.authorChetcuti, Flavia-
dc.contributor.authorGrech, Sarah-
dc.contributor.authorSpagnol, Michael-
dc.date.accessioned2022-04-14T13:28:16Z-
dc.date.available2022-04-14T13:28:16Z-
dc.date.issued2010-
dc.identifier.citationVella, A., Chetcuti, F., Grech, S., & Spagnol, M. (2010). Integrating annotated spoken Maltese data into corpora of written Maltese. Seventh conference on International Language Resources and Evaluation (LREC 2010), Workshop on Language Resources & Human Language Technologies for Semitic Languages, Valletta. 83-90.en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/93832-
dc.description.abstractSpoken data features to a lesser extent in corpora available for languages than do written data. This paper addresses this issue by presenting work carried out to date on the development of a corpus of spoken Maltese. It outlines the standards for the PRAAT annotation of Maltese data at the orthographic level, and reports on preliminary work on the annotation of Maltese prosody and development of ToBI-style standards for Maltese. Procedures being developed for exporting PRAAT TextGrid information for the purposes of incorporation into a predominantly written corpus of Maltese are then discussed. The paper also demonstrates how characteristics of speech notoriously difficult to deal with have been tackled and how the exported output from the PRAAT annotations can be enhanced through the representation also of phenomena, sometimes referred to as “normal disfluencies”, which include “filled pauses” and other vocalisations of a quasi-lexical nature having various functions of a discourse-management type such as “backchannelling”.en_GB
dc.language.isoenen_GB
dc.publisherLRECen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectMaltese language -- Grammaren_GB
dc.subjectMaltese language -- Morphologyen_GB
dc.subjectMaltese language -- Phonologyen_GB
dc.subjectMaltese language -- Terminologyen_GB
dc.titleIntegrating annotated spoken Maltese data into corpora of written Malteseen_GB
dc.typeconferenceObjecten_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holderen_GB
dc.bibliographicCitation.conferencenameSeventh conference on International Language Resources and Evaluation (LREC 2010), Workshop on Language Resources & Human Language Technologies for Semitic Languagesen_GB
dc.bibliographicCitation.conferenceplaceValletta, Malta, 17/05/2010en_GB
dc.description.reviewedpeer-revieweden_GB
Appears in Collections:Scholarly Works - FacArtMal

Files in This Item:
File Description SizeFormat 
Integrating_annotated_spoken_Maltese_data_into_corpora_of_written_Maltese.pdf
  Restricted Access
351.29 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.