Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/107857
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2023-03-28T12:53:34Z-
dc.date.available2023-03-28T12:53:34Z-
dc.date.issued2022-
dc.identifier.citationAttard, K. (2022). Crime analysis (Bachelor's dissertation).en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/107857-
dc.descriptionB.Sc. IT (Hons)(Melit.)en_GB
dc.description.abstractThis dissertation focuses on using data mining techniques within the crime analysis domain, which is becoming very popular since many datasets are being publicly available to researchers. We focus on three main objectives; predicting the type of crime, predicting the future crime rate and the investigation of how much data is required to train our models. Two distinct American public datasets are chosen for this study, one focusing on the city of San Francisco (SF) and the other on New York City (NYC). Our first objective concerns crime type prediction, where we implemented several models, ranging from classical Machine Learning (ML) to Deep Learning (DL) methods, that are able to classify the crime category based on the inputted data, mainly being, the time of crime occurrence, its location and how the crime was resolved. The results obtained are then directly compared to [31], where our main goal was to replicate and improve on their findings. The second objective covers crime rate prediction. We implemented both ML and DL regression-based models, meaning that we predict a continuous value (the expected future crimes), as opposed to the classificationbased approach above (which predicts a category). Additionally, we integrated external data (population, unemployment rate and median income data) to help us create more accurate models. Similar to the first objective, we replicate and improve on the results obtained by [9]. In our last objective, we investigated how much data the models require to learn from and yet still manage to produce decent results. Hence, for each prior objective, we diminished the original size of the training set whilst keeping the test set unchanged to check how much data is needed for all the models developed to retain their optimal performance. Our classification and regression models managed to produce better results than [31] and [9] respectively. We inferred that the DL classification model created was not more effective than the other ML models built. We also observed that the unemployment rate feature is the most effective external data when predicting the future crime rate. Finally, diminishing the training sets of Objectives 1 and 2 respectively by approximately half allowed the models to retain a decent performance in their predictions.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectCrime analysisen_GB
dc.subjectMachine learningen_GB
dc.subjectData miningen_GB
dc.titleCrime analysisen_GB
dc.typebachelorThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentFaculty of Information and Communication Technology. Department of Artificial Intelligenceen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorAttard, Karl (2022)-
Appears in Collections:Dissertations - FacICT - 2022
Dissertations - FacICTAI - 2022

Files in This Item:
File Description SizeFormat 
2208ICTICT390900013564_1.PDF
  Restricted Access
1 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.