A comparison of speech recognition techniques and models

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/91985

Full metadata record

DC Field	Value	Language
dc.date.accessioned	2022-03-22T13:58:42Z	-
dc.date.available	2022-03-22T13:58:42Z	-
dc.date.issued	2021	-
dc.identifier.citation	Pizzuto, A. (2021). A comparison of speech recognition techniques and models (Bachelor's dissertation).	en_GB
dc.identifier.uri	https://www.um.edu.mt/library/oar/handle/123456789/91985	-
dc.description	B.Sc. IT (Hons)(Melit.)	en_GB
dc.description.abstract	There has been an increase in speech recognition software used in many devices. Primarily it had been introduced to the public mostly on smartphones. As technology is improving speech recognition software is found on different devices and household appliances such as televisions and even refrigerators. As people are using speech to text software more frequently this form of computer interaction is becoming the norm and so inaccuracies, such as words misunderstood for others, may cause issues in the future. The project aims to make comparisons to understand which model is best adapted when tested against the obtained dataset. This study will present a better view of how these devices and appliances work since speech recognition software seems to be becoming an attractive form of computer interaction. The project delineates different topics such as sound and pre-processing sound recordings. It then focuses on feature extraction, which retrieves key characteristics of sound waves, to be used in the speech recognition stage. An example of this is known as the ‘Mel-frequency cepstral coefficients’ (MFCC) that can be presented graphically by using a spectrogram. Ultimately, the data from the feature extraction process is fed into the different models. Three models, the Hidden Markov Model, a Convolutional Neural Network and the Dynamic Time Warping Algorithm, were selected and compared using the same datasets, and the results were evaluated using accuracy and running time.	en_GB
dc.language.iso	en	en_GB
dc.rights	info:eu-repo/semantics/restrictedAccess	en_GB
dc.subject	Automatic speech recognition	en_GB
dc.subject	Hidden Markov models	en_GB
dc.subject	Neural networks (Computer science)	en_GB
dc.title	A comparison of speech recognition techniques and models	en_GB
dc.type	bachelorThesis	en_GB
dc.rights.holder	The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.	en_GB
dc.publisher.institution	University of Malta	en_GB
dc.publisher.department	Faculty of Information and Communication Technology. Department of Computer Information Systems	en_GB
dc.description.reviewed	N/A	en_GB
dc.contributor.creator	Pizzuto, Andrew (2021)	-
Appears in Collections:	Dissertations - FacICT - 2021 Dissertations - FacICTCIS - 2021

Files in This Item:

File	Description	Size	Format
21BITSD019.pdf Restricted Access		3.43 MB	Adobe PDF	View/Open Request a copy

Show simple item record Statistics