Acoustic approaches to gender and accent identification

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92446

Full metadata record

DC Field	Value	Language
dc.date.accessioned	2022-03-28T06:12:40Z	-
dc.date.available	2022-03-28T06:12:40Z	-
dc.date.issued	2015	-
dc.identifier.citation	DeMarco, A. (2015). Acoustic approaches to gender and accent identification (Doctoral dissertation).	en_GB
dc.identifier.uri	https://www.um.edu.mt/library/oar/handle/123456789/92446	-
dc.description.abstract	There has been considerable research on the problems of speaker and language recognition from samples of speech. A less researched problem is that of accent recognition. Although this is a similar problem to language identification, different accents of a language exhibit more fine-grained differences between classes than languages. This presents a tougher problem for traditional classification techniques. In this thesis, we propose and evaluate a number of techniques for gender and accent classification. These techniques are novel modifications and extensions to state of the art algorithms, and they result in enhanced performance on gender and accent recognition. The first part of the thesis focuses on the problem of gender identification, and presents a technique that gives improved performance in situations where training and test conditions are mismatched. The bulk of this thesis is concerned with the application of the i-Vector technique to accent identification, which is the most successful approach to acoustic classification to have emerged in recent years. We show that it is possible to achieve high accuracy accent identification without reliance on transcriptions and without utilising phoneme recognition algorithms. The thesis describes various stages in the development of i-Vector based accent classification that improve the standard approaches usually applied for speaker or language identification, which are insufficient. We demonstrate that very good accent identification performance is possible with acoustic methods by considering different i-Vector projections, frontend parameters, i-Vector configuration parameters, and an optimised fusion of the resulting i-Vector classifiers we can obtain from the same data. We claim to have achieved the best accent identification performance on the test corpus for acoustic methods, with up to 90% identification rate. This performance is even better than previously reported acoustic-phonotactic based systems on the same corpus, and is very close to performance obtained via transcription based accent identification. Finally, we demonstrate that the utilization of our techniques for speech recognition purposes leads to considerably lower word error rates.	en_GB
dc.language.iso	en	en_GB
dc.publisher	University of East Anglia, England	en_GB
dc.rights	info:eu-repo/semantics/restrictedAccess	en_GB
dc.subject	Speech perception	en_GB
dc.subject	Language and languages	en_GB
dc.subject	Speech processing systems	en_GB
dc.subject	Natural language processing (Computer science)	en_GB
dc.subject	Automatic speech recognition	en_GB
dc.title	Acoustic approaches to gender and accent identification	en_GB
dc.type	doctoralThesis	en_GB
dc.rights.holder	The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.	en_GB
dc.description.reviewed	N/A	en_GB
dc.contributor.creator	DeMarco, Andrea	-
Appears in Collections:	Scholarly Works - InsSSA

Files in This Item:

File	Description	Size	Format
Acoustic_Approaches_to_Gender_and_Accent_Identification(2015).pdf Restricted Access		7.31 MB	Adobe PDF	View/Open Request a copy

Show simple item record Statistics