My Account Log in

3 options

XML query reformulation over mixed and redundant storage / Alin Deutsch.

LIBRA QA003 2002 .D485
Loading location information...

Available from offsite location This item is stored in our repository but can be checked out.

Log in to request item
LIBRA Diss. POPM2002.276
Loading location information...

Available from offsite location This item is stored in our repository but can be checked out.

Log in to request item
LIBRA Microfilm P38:2002
Loading location information...

Mixed Availability Some items are available, others may be requested.

Log in to request item
Format:
Book
Manuscript
Microformat
Thesis/Dissertation
Author/Creator:
Deutsch, Alin.
Contributor:
Tannen, Val, 1953- advisor.
University of Pennsylvania.
Language:
English
Subjects (All):
Penn dissertations--Computer and information science.
Computer and information science--Penn dissertations.
Local Subjects:
Penn dissertations--Computer and information science.
Computer and information science--Penn dissertations.
Physical Description:
xii, 187 pages : illustrations ; 29 cm
Production:
2002.
Summary:
XML is widely accepted as the standard for data exchange between businesses on the Internet. However, most corporations publish only selected portions of their proprietary business data as XML documents, and even then only virtually, that is by exposing a schema against which queries can be formulated. In order to be answered, such XML queries must be reformulated as queries on the actual proprietary data. Existing XML publishing systems conform to the Global-As-View data integration scenario, in which the correspondence between published (global) and proprietary (local) data is given by expressing the former as a view of the latter. However, an ideal publishing system should enhance query execution by allowing for redundancy in storage which enables multiple reformulations, some potentially cheaper to execute than others. Redundancy requires the complementary, Local-As-View approach to data integration, in which the proprietary data is expressed as a view of the published data. We are led to consider XML publishing systems according to a combined Global-and-Local-As-View approach. Building such a system means facing the following challenges. Existing reformulation algorithms developed for the Global-As-View scenario are said to perform composition-with-views, and they are seemingly unrelated to reformulation algorithms for the Local-As-View scenario, which do rewriting-with-views. Moreover, it turns out that picking the optimal reformulation among the possible candidates requires query minimization. We present MARS, a system implementing a novel reformulation algorithm which achieves the combined effect of rewriting-with-views, composition-with-views and minimization. The algorithm works even when the proprietary storage is a mix of XML documents and relational databases. We prove a completeness theorem which guarantees that under certain conditions, our algorithm will find a minimal reformulation if one exists. Moreover, we study the complexity of the problem and identify conditions when this algorithm achieves best complexity bounds. We report on experiments that show the practicality of the approach.
Notes:
Supervisor: Val Tannen.
Thesis (Ph.D. in Computer and Information Science) -- University of Pennsylvania, 2002.
Includes bibliographical references.
Local Notes:
University Microfilms order no.: 3072989.
OCLC:
244972839

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

We want your feedback!

Thanks for using the Penn Libraries new search tool. We encourage you to submit feedback as we continue to improve the site.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account