Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92134
Title: Discovery of anomalies and teleconnection patterns in meteorological climatological data
Authors: Cassar, Lukan (2021)
Keywords: Climatic changes
Data sets
Data mining
Algorithms
Machine learning
Meteorology
Issue Date: 2021
Citation: Cassar, L. (2021). Discovery of anomalies and teleconnection patterns in meteorological climatological data (Bachelor’s dissertation).
Abstract: Climate change is an ever-increasing problem in the modern world, and the analysis of data sets to discover patterns and anomalous behaviour, with regards to the climate, is more crucial than ever. However, analysing such data sets can prove overwhelming, as these data sets tend to be too large to manually inspect. As a result, a need for techniques to efficiently scour and manipulate such extensive data, often referred to as data mining techniques has arisen. The work that is carried out within this research consists of using different data mining algorithms to extract anomalies and teleconnections from a data set of monthly global air temperatures, covering a period of 72 years (1948-2019). Anomaly detection is a significant step in data mining that aims to identify data points that deviate from the remainder of the data, often called anomalies. The purpose of anomaly detection in climate data is to identify any spatial (across space), temporal (across time) or spatial-temporal (across both space and time) anomalies within the data set. These anomalies are important to understand and forecast the nature of Earth’s ecosystem model. The anomalies are detected using three different algorithms; K-Nearest Neighbour (KNN), K-Means Clustering, and Density-Based Spatial Clustering of Applications with Noise (DBSCAN). Results obtained from anomaly detection show that the KNN algorithm is the best performing one out of the three. Teleconnections are recurring and persistent patterns in climate anomalies, which connect two distant regions. Teleconnections are important because they reflect large-scale changes in the atmosphere and influence temperature, rainfall, and storms over extensive areas. As a result, teleconnections are often the culprit responsible for anomalous weather patterns occurring concurrently over widespread distances. The teleconnections are detected using three different association mining techniques; Apriori, FP-Growth, and Generalized Sequential Pattern (GSP), over the spatial-temporal anomalies identified before. Results obtained from teleconnection detection show that the Apriori and FP-Growth algorithm output very similar results, while the GSP algorithm failed to output any results due to hardware limitations. The extracted anomalies and teleconnections, obtained from the previously mentioned algorithms, are presented using interactive graphs and heat maps.
Description: B.Sc. IT (Hons)(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/92134
Appears in Collections:Dissertations - FacICT - 2021
Dissertations - FacICTAI - 2021

Files in This Item:
File Description SizeFormat 
21BITAI018.pdf
  Restricted Access
6.19 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.