• Corpus ID: 6727956

Relational Databases for Querying XML Documents: Limitations and Opportunities

@inproceedings{Shanmugasundaram1999RelationalDF,
  title={Relational Databases for Querying XML Documents: Limitations and Opportunities},
  author={Jayavel Shanmugasundaram and Kristin Tufte and Chun Zhang and Gang He and David J. DeWitt and Jeffrey F. Naughton},
  booktitle={VLDB},
  year={1999}
}
XML is fast emerging as the dominant standard for representing data in the World Wide Web. [...] Key Method To this end, we have developed algorithms and implemented a prototype system that converts XML documents to relational tuples, translates semi-structured queries over XML documents to SQL queries over tables, and converts the results to XML. We have qualitatively evaluated this approach using several real DTDs drawn from diverse domains. It turns out that the relational approach can handle most (but not…Expand
Querying XML documents by dynamic shredding
TLDR
An algorithm is presented to translate a significant subset of XQuery into an extended relational algebra that includes operators defined for the structured text datatype that can be used as the basis of a sound translation from XQuery to SQL and the starting point for query optimization which is required for XML to be supported by relational database technology.
Bridging relational technology and xml
TLDR
This dissertation presents a technique for storing and querying XML documents using a relational database system, and proposes and evaluates alternative strategies for efficiently publishing existing relational data as XML documents, to publish XML documents for electronic-commerce applications.
Querying XML Document using Relational Database System : A Survey
XML is rapidly emerging being the eminent standard for representing data in the World Wide Web. High-level query engines that allow end users to effectively tap the data stored in XML documents will
Using a relational database for scalable XML search
TLDR
This work describes a relational approach that is fixed schema eliminating the need for schema redesign at the expense of potentially longer runtimes, and shows that the relational approach typically outperforms the tree-based approach while scaling consistently over all collections studied.
Storing and querying ordered XML using a relational database system
TLDR
This paper shows that XML's ordered data model can indeed be efficiently supported by a relational database system, and proposes three order encoding methods that can be used to represent XML order in the relational data model, and also proposes algorithms for translating ordered XPath expressions into SQL using these encoding methods.
Efficiently Querying Large XML Data Repositories: A Survey
TLDR
This survey considers two classes of major XML query processing techniques: the relational approach and the native approach, which could result in higher query processing performance and also significantly reduce system reengineering costs.
Updating XML Views of Relational Data
TLDR
Experimental studies show that incremental XML updates supported by the system is a better choice than complete reload of XML documents under a variety of system settings.
A Generic and Flexible Framework for Mapping XML Documents into Relations
TLDR
This paper describes a mapping scheme wherein mappings are defined through annotations in an XML Schema and thus, not tied to any particular database engine, making the framework flexible; capable of expressing a wide range of existing mappings strategies; and easily extensible to incorporate new mapping strategies.
Model-Mapping Approaches for Storing and Querying XML Documents in Relational Database: A Survey
TLDR
Although XPEV approach overcome on others approaches like Edge, XRel and Xparent in storage space, query response time and the influence of scalability on query performance when data sizes become bigger, it still has not resolve storing and querying XML documents in efficient way, the paper paves the way for solutions to improve these approaches.
Exploiting native XML indexing techniques for XML retrieval in relational database systems
TLDR
This paper presents the Relational CADG, an adaptation of several native indexing approaches to the relational model, and shows how it supports the evaluation of a clean formal language of conjunctive XML queries, thus addressing several open problems known from the literature.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 28 REFERENCES
A Query Language for XML
TLDR
This work presents a query language for XML, called XML-QL, which is argued to be suitable for performing the above tasks, and can extract data from existing XML documents and construct new XML documents.
A query language and optimization techniques for unstructured data
TLDR
Here a simple language UnQL is proposed for querying data organized as a rooted, edge-labeled graph and it is shown that known optimization techniques for operators on flat relations apply to the "horizontal" dimension of UnQL.
From structured documents to novel query facilities
TLDR
A natural mapping from SGML documents into OODB's and a formal extension of two OodB query languages in order to deal with SGML document retrieval and a key element is the introduction of paths as first class citizens.
The Lorel query language for semistructured data
TLDR
The main novelties of the Lorel language are the extensive use of coercion to relieve the user from the strict typing of OQL, which is inappropriate for semistructured data; and powerful path expressions, which permit a flexible form of declarative navigational access and are particularly suitable when the details of the structure are not known to the user.
Storing semistructured data with STORED
TLDR
It is shown how a document-type-descriptor (DTD), when present, can be exploited to further improve performance and is interested in applying STORED to XML data, which is an instance of semistructured data.
Multiple-query optimization
TLDR
The results show that using multiple- query processing algorithms may reduce execution cost considerably, and the presentation and analysis of algorithms that can be used for multiple-query optimization are presented.
eXtensible Markup Language (XML) 1.0 (Second Edition)
The Extensible Markup Language (XML) is a subset of SGML that is completely described in this document. Its goal is to enable generic SGML to be served, received, and processed on the Web in the way
Optimizing regular path expressions using graph schemas
TLDR
This work describes two optimization techniques for queries with regular path expressions that rely on graph schemas for specifying partial knowledge about the data's structure and gives an efficient algorithm for rewriting any regular path expression query into a pruned one.
Extensible Markup Language (XML)
TLDR
XML is an extremely simple dialect of SGML which is completely described in this document, to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML.
Lore: a database management system for semistructured data
TLDR
This paper provides an overview of these aspects of the Lore system, as well as other novel features such as dynamic structural summaries and seamless access to data from external sources.
...
1
2
3
...