Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/83612
Title: Bayesian nonparametric latent feature modelling of river water quality
Authors: Aquilina, Albert (2021)
Keywords: Water quality management
Latent structure analysis
Nonparametric statistics
Bayesian statistical decision theory
Stochastic processes
Dirichlet problem
Issue Date: 2021
Citation: Aquilina, A. (2021). Bayesian nonparametric latent feature modelling of river water quality (Master's dissertation).
Abstract: Latent feature modelling is a class of multivariate techniques used to capture hidden structures underlying observed data sets. There are numerous instances in statistical literature where these techniques are applied to data sets, most notably from psychology. However, there is little to no published literature on the application of latent feature models to water quality data sets from scientific fields that attempt to identify and analyse scientific processes affecting our environment. In this study, two different latent feature modelling techniques known as Structural Equation Modelling (SEM) and Bayesian Nonparametric (BNP) latent feature modelling are fitted to a collection of data related to water quality. The former technique takes a finite-dimensional classical Frequentist approach, where the number of latent features extracted by the model is fixed and must be specified beforehand. In addition, hierarchical data is catered for by introducing a multi-level extension of SEM known as MSEM. However, these models encounter various problems when applied to larger, more complex data sets. To work with a more flexible and wider ranging family of models, we transition to infinite dimensions through the use of BNP models, which are the main protagonists of this study. In contrast to SEM, these models allow the number of latent features to be open-ended. The applicability of this technique to river water quality is determined by fitting a linear-Gaussian binary latent feature model using a Dirichlet Process (DP) prior. Studying the resulting posterior distribution allows us to identify and explain the possible sources affecting the water quality of rivers over time. Information on the provenance and timing of the observations comprising the data set shall be purposely left out during the estimation to enable us to test whether BNP models are able to detect, through the use of information contained within the data, inherent factors which are not measured directly but inferred statistically.
Description: M.Sc.(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/83612
Appears in Collections:Dissertations - FacSci - 2021
Dissertations - FacSciSOR - 2021

Files in This Item:
File Description SizeFormat 
21MSCSTAT001.pdf
  Restricted Access
1.51 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.