UNIVERSITÄTSBIBLIOTHEK

Temporal Data Management and Incremental Data Recomputation with Wide-column Stores and MapReduce

  • In recent years, ”Big Data” has become an important topic in academia and industry. To handle the challenges and problems caused by Big Data, new types of data storage systems called ”NoSQL stores” (means ”Not-only- SQL”) have emerged. ”Wide-column stores” are one kind of NoSQL stores. Compared to relational database systems, wide-column stores introduce a new data model, new IRUD (Insert, Retrieve, Update and Delete) semantics with support for schema-flexibility, single-row transactions and data expiration constraints. Moreover, each column stores multiple data versions with associated time- stamps. Well-known examples are Google’s ”Big-table” and its open sourced counterpart ”HBase”. Recently, such systems are increasingly used in business intelligence and data warehouse environments to provide decision support, controlling and revision capabilities. Besides managing the current values, data warehouses also require management and processing of historical, time-related data. Data warehouses frequently employ techniques for processing changes in various data sources and incrementally applying such changes to the warehouse to keep it up-to- date. Although both incremental data warehousing maintenance and temporal data management have been the subject of intensive research in the relational database and finally commercial database products have picked up the ability for temporal data processing and management, such capabilities have not been explored systematically for today’s wide-column stores. This thesis helps to address the shortcomings mentioned above. It care- fully analyzes the properties of wide-column stores and the applicability of mechanisms for temporal data management and incremental data ware- house maintenance known from relational databases, extends well-known approaches and develops new capabilities for providing equivalent support in wide-column stores.
Metadaten
Author:Yong Hu
URN (permanent link):urn:nbn:de:hbz:386-kluedo-49654
Advisor:Stefan Deßloch, Norbert Ritter
Document Type:Doctoral Thesis
Language of publication:English
Publication Date:2017/10/26
Year of Publication:2017
Publishing Institute:Technische Universität Kaiserslautern
Granting Institute:Technische Universität Kaiserslautern
Acceptance Date of the Thesis:2017/07/31
Date of the Publication (Server):2017/10/26
GND-Keyword:Incremental recomputation; MapReduce; Temporal data processing; Wide-column stores
Number of page:IV, 190
Faculties / Organisational entities:Fachbereich Informatik
CCS-Classification (computer science):E. Data
H. Information Systems
J. Computer Applications
DDC-Cassification:0 Allgemeines, Informatik, Informationswissenschaft / 004 Informatik
Licence (German):Creative Commons 4.0 - Namensnennung, nicht kommerziell, keine Bearbeitung (CC BY-NC-ND 4.0)