Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/16474
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGauci, Oliver-
dc.contributor.authorDebono, Carl James-
dc.contributor.authorMicallef, Paul-
dc.date.accessioned2017-02-08T14:15:48Z-
dc.date.available2017-02-08T14:15:48Z-
dc.date.issued2008-
dc.identifier.citationGauci, O., Debono, C. J., & Micallef, P. (2008). A maximum log-likelihood approach to voice activity detection. 3rd International Symposium on Communications, Control and Signal Processing, St. Julians. 383-387.en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar//handle/123456789/16474-
dc.description.abstractModern voice activity detection (VAD) algorithms must achieve reliable operation at low signal-to-noise ratios (SNR). Although a lot of research has been performed to solve this issue, the operation of existing VAD algorithms is still far away from ideal. In this paper, we present a novel VAD algorithm, in which we apply the Teager energy cepstral coefficients, to obtain a noise robust feature extraction method, together with Gaussian mixture models that serve for the classification of speech and silence periods. In the suggested solution, the threshold method used in many noise robust VAD algorithms is eliminated, thus favoring its use in real applications. The performance of this novel algorithm was tested under known and unknown noise statistics, and compared to a statistical model-based approach found in literature. The results obtained show that the proposed solution achieves better accuracy and significantly reduces clipping of speech periods; thus achieving superior signal quality.en_GB
dc.language.isoenen_GB
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectSpeech processing systemsen_GB
dc.subjectRandom noise theoryen_GB
dc.subjectAmbient soundsen_GB
dc.titleA maximum log-likelihood approach to voice activity detectionen_GB
dc.typeconferenceObjecten_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.bibliographicCitation.conferencename3rd International Symposium on Communications, Control and Signal Processingen_GB
dc.bibliographicCitation.conferenceplaceSt. Julians, Malta, 12-14/03/2008en_GB
dc.description.reviewedpeer-revieweden_GB
dc.identifier.doi10.1109/ISCCSP.2008.4537255-
Appears in Collections:Scholarly Works - FacICTCCE

Files in This Item:
File Description SizeFormat 
Conference Paper - A maximum log-likelihood approach to voice activity detection.pdf
  Restricted Access
A maximum log-likelihood approach to voice activity detection309.87 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.