Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/14724
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.date.accessioned | 2016-12-16T09:19:42Z | - |
dc.date.available | 2016-12-16T09:19:42Z | - |
dc.date.issued | 2016 | - |
dc.identifier.uri | https://www.um.edu.mt/library/oar//handle/123456789/14724 | - |
dc.description | B.SC.IT(HONS) | en_GB |
dc.description.abstract | Data mining and Data Warehouse (DWH) effectiveness depends on integrating a number of data sources. Extract Transform and Load (ETL) is a fundamental process for data integration, improving data quality, timelines and efficacy of data. Its implementation is known to be quite code intensive and contrived. Various data integration tools and process languages exist that aim at making ETL more manageable. Nonetheless, to implement an ETL process using these tools is still complex, intensive and comparatively fine grained. This dissertation involves an investigation of data and processes commonly required in an ETL process in order to automate these at a higher level. To achieve this, Business Process Model and Notation (BPMN) is used to design an ETL workflow in conjunction with a specification file. This file describes additional details about the ETL processes depicted in a BPMN model. Moreover, this work extends the automated processes to abstract the complexity and intensity of the ETL. In order to extend the ETL processes, data definition files are supplied to an automated ETL workflow, where these files are used to define the source and destination DWH structure. The work presented proves to be effective as queries were run to evaluate the ETL processes implemented, where the results obtained illustrate that the processes function as expected. Consequently, the source data is extracted, transformed and loaded into the DWH as specified. | en_GB |
dc.language.iso | en | en_GB |
dc.rights | info:eu-repo/semantics/restrictedAccess | en_GB |
dc.subject | Data warehousing | en_GB |
dc.subject | Database management | en_GB |
dc.title | Extending and automating ETL processes in DWH | en_GB |
dc.type | bachelorThesis | en_GB |
dc.rights.holder | The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder. | en_GB |
dc.publisher.institution | University of Malta | en_GB |
dc.publisher.department | Faculty of Information & Communication Technology. Department of Computer Information Systems | en_GB |
dc.description.reviewed | N/A | en_GB |
dc.contributor.creator | Attard, Chiara | - |
Appears in Collections: | Dissertations - FacICT - 2016 Dissertations - FacICTCIS - 2016 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
16BITSD002.pdf Restricted Access | 2.92 MB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.