Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/91985
Title: A comparison of speech recognition techniques and models
Authors: Pizzuto, Andrew (2021)
Keywords: Automatic speech recognition
Hidden Markov models
Neural networks (Computer science)
Issue Date: 2021
Citation: Pizzuto, A. (2021). A comparison of speech recognition techniques and models (Bachelor's dissertation).
Abstract: There has been an increase in speech recognition software used in many devices. Primarily it had been introduced to the public mostly on smartphones. As technology is improving speech recognition software is found on different devices and household appliances such as televisions and even refrigerators. As people are using speech to text software more frequently this form of computer interaction is becoming the norm and so inaccuracies, such as words misunderstood for others, may cause issues in the future. The project aims to make comparisons to understand which model is best adapted when tested against the obtained dataset. This study will present a better view of how these devices and appliances work since speech recognition software seems to be becoming an attractive form of computer interaction. The project delineates different topics such as sound and pre-processing sound recordings. It then focuses on feature extraction, which retrieves key characteristics of sound waves, to be used in the speech recognition stage. An example of this is known as the ‘Mel-frequency cepstral coefficients’ (MFCC) that can be presented graphically by using a spectrogram. Ultimately, the data from the feature extraction process is fed into the different models. Three models, the Hidden Markov Model, a Convolutional Neural Network and the Dynamic Time Warping Algorithm, were selected and compared using the same datasets, and the results were evaluated using accuracy and running time.
Description: B.Sc. IT (Hons)(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/91985
Appears in Collections:Dissertations - FacICT - 2021
Dissertations - FacICTCIS - 2021

Files in This Item:
File Description SizeFormat 
21BITSD019.pdf
  Restricted Access
3.43 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.