Data on the Web: From Relations to Semistructured Data and XML
is an examination of XML as a universal data transfer language and the theory behind the merging of the document-centric Web with a data-driven infrastructure. The book is intended as a textbook analysis of the issues, as well as background material for tool developers and others interested in the serious architectural details.
Aimed at readers already familiar with database concepts, the book includes little introductory material. It quickly lays out the concepts of self-describing semi-structured data and how XML fits into this approach to data representation. The discussion deals with XML as a data transfer mechanism and not a presentation language. While there is a quick explanation of DTDs, Xlink, and XPointer, readers should be fairly familiar with XML before approaching this advanced title.
The meat of the book revolves around query languages for XML. The authors present XML-QL and XSL in depth as examples. Then they move into much more advanced concepts such as schema formalisms, path constraints, and storage architectures. The book wraps up with a look at Lore and Strudel--two real-world systems that work with semi-structured data. Because of its intensive study of database and query theory, this textbook isn't for the ordinary Web developer. If data architectures are your expertise, however, Data on the Web may open new design doors. --Stephen W. Plain
Topics covered: Object database models, basic XML syntax, UnQL, XML-QL, XSL, StruQL, schema formalisms, extracting schemas from queries, semistructured data servers, Lore, Strudel, and XML-based database products.
From Library Journal
Most data on the web are not well structured, making the search and retrieval process difficult since the spiders, robots, and other search engines don't really understand the context of the data they are indexing and storing. This very advanced book examines the new retrieval and processing techniques as semistructured data and XML (as a data transfer language) that aim to merge a document-based web with a data-driven infrastructure. Hardcore programmers will want this. Recommended for university and large public libraries.
Copyright 2000 Reed Business Information, Inc.