Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/91985
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2022-03-22T13:58:42Z-
dc.date.available2022-03-22T13:58:42Z-
dc.date.issued2021-
dc.identifier.citationPizzuto, A. (2021). A comparison of speech recognition techniques and models (Bachelor's dissertation).en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/91985-
dc.descriptionB.Sc. IT (Hons)(Melit.)en_GB
dc.description.abstractThere has been an increase in speech recognition software used in many devices. Primarily it had been introduced to the public mostly on smartphones. As technology is improving speech recognition software is found on different devices and household appliances such as televisions and even refrigerators. As people are using speech to text software more frequently this form of computer interaction is becoming the norm and so inaccuracies, such as words misunderstood for others, may cause issues in the future. The project aims to make comparisons to understand which model is best adapted when tested against the obtained dataset. This study will present a better view of how these devices and appliances work since speech recognition software seems to be becoming an attractive form of computer interaction. The project delineates different topics such as sound and pre-processing sound recordings. It then focuses on feature extraction, which retrieves key characteristics of sound waves, to be used in the speech recognition stage. An example of this is known as the ‘Mel-frequency cepstral coefficients’ (MFCC) that can be presented graphically by using a spectrogram. Ultimately, the data from the feature extraction process is fed into the different models. Three models, the Hidden Markov Model, a Convolutional Neural Network and the Dynamic Time Warping Algorithm, were selected and compared using the same datasets, and the results were evaluated using accuracy and running time.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectAutomatic speech recognitionen_GB
dc.subjectHidden Markov modelsen_GB
dc.subjectNeural networks (Computer science)en_GB
dc.titleA comparison of speech recognition techniques and modelsen_GB
dc.typebachelorThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentFaculty of Information and Communication Technology. Department of Computer Information Systemsen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorPizzuto, Andrew (2021)-
Appears in Collections:Dissertations - FacICT - 2021
Dissertations - FacICTCIS - 2021

Files in This Item:
File Description SizeFormat 
21BITSD019.pdf
  Restricted Access
3.43 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.