Semantic query optimization for processing XML streams with minimized memory footprint

Ming Li, Murali Mani, Elke A. Rundensteiner

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

XQuery evaluation over XML streams requires the temporary buffering of XML elements. This paper presents a semantic query optimization solution to minimize memory footprint during XQuery evaluation by exploiting schema knowledge. We focus on one particular class of constraints, namely, the Pattern Non-Occurrence (PNO) constraints for XML streams conforming to pre-defined DTDs. PNO constraints facilitate the early release of buffered data (early buffer release) or possibly avoid to ever store the data (buffer avoidance), thus achieving a minimized memory footprint. We develop an automaton-based technique to detect PNO constraints at runtime. For a given query, optimization opportunities of early buffer release and buffer avoidance which can be triggered by runtime PNO detection are explored and the optimization decision is then encoded into the Raindrop algebraic plan. We implement our optimization technique within the Raindrop XQuery engine. Our experimental studies illustrate that the proposed techniques bring significant performance improvement in both memory and CPU usage with little overhead.

Original languageEnglish (US)
Title of host publication3rd International Workshop on Database Technologies for Handling XML Information on the Web, DataX'08 - Held at EDBT 2008
Subtitle of host publication11th International Conference on Extending Database Technology
Pages27-36
Number of pages10
DOIs
StatePublished - 2008
Externally publishedYes
Event3rd International Workshop on Database Technologies for Handling XML Information on the Web DataX'08, Held in Conjunction with the International Conference on Extending Database Technology, EDBT'08 - Nantes, France
Duration: Mar 25 2008Mar 25 2008

Publication series

Name3rd International Workshop on Database Technologies for Handling XML Information on the Web, DataX'08 - Held at EDBT 2008: 11th International Conference on Extending Database Technology

Conference

Conference3rd International Workshop on Database Technologies for Handling XML Information on the Web DataX'08, Held in Conjunction with the International Conference on Extending Database Technology, EDBT'08
Country/TerritoryFrance
CityNantes
Period3/25/083/25/08

ASJC Scopus subject areas

  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Semantic query optimization for processing XML streams with minimized memory footprint'. Together they form a unique fingerprint.

Cite this