Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

A node partitioning strategy for optimising the performance of XML queries

Marks, Gerard (2011) A node partitioning strategy for optimising the performance of XML queries. PhD thesis, Dublin City University.

Abstract
For ease of communication between heterogeneous systems, the eXtensible Markup Language (XML) has been widely adopted as a data storage format. However, XML query processing presents issues both in terms of query performance and updatability. Thus, many are choosing to shred XML data into relational databases in order to benet from its mature technology. The problem with this approach is that (often complex and time consuming) data transformation processes are required to transform XML data to relational tables and vice versa. Additionally, many of the benets of XML data can be lost during these processes. In this dissertation, we present a process that partitions nodes within an XML document into disjoint subsets. Briefly, as there are fewer partitions than there are nodes, a more efficient join operation can be performed between partitions, thus reducing the number of inefficient node comparisons. The number and size of partitions varies depending on the structure and layout in the XML document, and the number of partitions impacts query performance. Therefore, we also provide a partition classication process, which signicantly reduces the number of partitions because each partition class represents many equivalent partitions within the XML document. In this dissertation, we will demonstrate that our approach outperforms similar approaches for a large subset of XML queries by eliminating complex join operations (where possible) during the query process.
Metadata
Item Type:Thesis (PhD)
Date of Award:November 2011
Refereed:No
Supervisor(s):Roantree, Mark
Uncontrolled Keywords:XML; databases; query processing
Subjects:Computer Science > Information storage and retrieval systems
Computer Science > Information retrieval
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:Enterprise Ireland
ID Code:16521
Deposited On:02 Dec 2011 12:00 by Mark Roantree . Last Modified 19 Jul 2018 14:54
Documents

Full text available as:

[thumbnail of Gerard Marks PhD Thesis]
Preview
PDF (Gerard Marks PhD Thesis) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record