SQL Server新版本支持XQuery：融合XML数据与关系型数据库的查询利器

需积分: 3 121 浏览量更新于2024-12-26 收藏 291KB PDF 举报

XQuery在关系数据库系统中的实现是一项关键的技术发展，尤其对于那些倾向于将XML数据作为丰富的数据类型，即一系列字节存储在关系数据库中的企业应用来说。这样做可以避免因分解大量数据到多个表所带来的复杂性，以及重新组装XML数据的成本。微软即将发布的SQL Server版本尤为引人关注，它将XQuery作为查询语言支持，充分利用其关系型基础设施。 XQuery作为一种新兴的W3C推荐标准，设计用于高效查询XML数据。它的核心是提供了一套语言构造（FLWOR），包括动态调整查询结果的能力，以及丰富的函数集和运算符。FLWOR允许开发者灵活地根据需求构建查询，无论是基于路径的导航访问，还是处理复杂的XML结构，都能得到高效的支持。与XML Schema类型系统兼容是XQuery的一个重要特性，这确保了数据的规范性和一致性。在关系数据库环境中，XQuery能够无缝集成到现有的SQL查询语句中，实现了对XML文档的高效查询、过滤和聚合操作，从而简化了开发者的工作流程。 SQL Server的XQuery支持意味着企业可以利用这一强大工具来增强其数据分析和报告功能，特别是在处理XML文档时。通过与数据库的集成，XQuery能够直接在存储的原始XML数据上执行操作，无需预先进行数据转换，从而提高了性能和效率。然而，实施XQuery在关系数据库中也带来了一些挑战，如查询优化问题、性能调优、以及可能的SQL和XQuery语句交互的最佳实践。开发人员需要熟悉这两种语言的特点，并确保它们的有效协同工作。 XQuery在关系数据库系统中的实施是一个结合了XML数据管理、查询语言创新和数据库技术融合的重要课题。它为现代企业应用提供了更有效的数据处理手段，推动了数据库技术向更灵活、适应性强的方向发展。随着SQL Server等平台的支持日益成熟，XQuery的应用前景将更加广阔。

implicitly or explicitly during assignments of either string

or binary SQL values to XML columns, variables and

parameters.

XML values are stored in an internal format as large

binary objects (“XML blob”) in order to support the XML

data model characteristics more faithfully such as

document order and recursive structures.

The following statement creates a table DOCS with an

integer, primary key column PK and an XML column

XDOC:

CREATE TABLE DOCS (

PK INT PRIMARY KEY, XDOC XML)

2.2 XML Schema Support

SQL Server 2005 provides XML schema collections as a

mechanism for managing W3C XML schema documents

[21] as metadata. XML data type can be associated with

an XML schema collection to have XML schema

constraints enforced on XML instances. Such XML data

types are called “typed XML”. Non-XML schema bound

XML data type is referred to as “untyped XML”.

Both typed and untyped XML are supported within a

single framework, the XML data model is preserved, and

query processing enforces XQuery semantics. The

underlying relational infrastructure is used extensively for

this purpose.

2.3 Querying XML Data

XML instances can be retrieved using the SQL SELECT

statement. Four built-in methods on the XML data type,

namely query(), value(), exist() and nodes(), are available

for fine-grained querying. A fifth built-in method modify()

allows fine-grained modification of XML instances but is

not discussed further in this paper.

The query methods on XML data type accept the

XQuery language [15][16][22], which is an emerging

W3C recommendation (currently in Last Call), and

includes the navigational language XPath 2.0 [20].

Together with a large set of functions, XQuery provides

rich support for manipulating XML data. The supported

features of the XQuery language are shown below:

• XQuery clauses “for”, “where”, “return” and

“order by”.

• XPath axes child, descendant, parent, attribute,

self and descendant-or-self.

• Functions – numeric, string, Boolean, nodes,

context, sequences, aggregate, constructor, data

accessor, and SQL Server extension functions to

access SQL variable and column data within

XQuery.

• Numeric operators (+, -, *, div, mod).

• Value comparison operators (eq, ne, lt, gt, le,

ge).

• General comparison operators (=, !=, <, >, <=,

>=).

The following is an example of a query in which

section titles are retrieved from books and wrapped in

new <topic> elements:

SELECT PK, XDOC.query('

for $s in /BOOK/SECTION

return <topic>

{data($s/TITLE)}

</topic>')

FROM DOCS

The query execution is tuple-oriented – the SELECT

list is evaluated on each row of the DOCS table, the

query() method is processed on the XDOC column in

each row, and the result is a two-column rowset where the

column types are integer (for PK) and untyped XML (for

the XML result). The query methods are evaluated on

single XML instances, so that XQuery evaluation over

multiple XML documents is currently not supported by

the syntax but is allowed by the architecture. Scalar value-

based joins over XML instances are possible.

2.4 Indexing XML Data

Query execution processes each XML instance at runtime;

this becomes expensive whenever the XML blob is large

in size, the query is evaluated on a large number of rows

in a table, or a single SQL query executes multiple

XQuery expressions requiring the XML blob to be parsed

multiple times. Consequently, a mechanism for indexing

XML columns is supported in SQL Server 2005 to speed

up queries.

A primary XML index [12] on an XML column creates

a B

tree index on the data model content of the XML

nodes, and adds a column Path_ID for the reversed,

encoded path from each XML node to the root of the

XML tree.

The structural properties of the XML instance, such as

relative order of nodes and document hierarchy, are

captured in the OrdPath column for each node [11]. The

primary XML index is clustered on the OrdPath value of

each XML instance in the XML column. The other

noteworthy columns are the name, type and the value of a

node.

XML indexes provide efficient evaluation of queries

on XML data, and reassembly of the XML result from the

tree. These use the relational infrastructure while

preserving document order and document structure.

OrdPath encodes the parent-child relationship of XML

nodes by extending the parent’s OrdPath with a labelling

component for the child. This allows efficient

determination of parent-child and ancestor-descendant

relationships. Furthermore, the subtree of any XML node

N can be retrieved from the primary XML index using a

1177

剩余11页未读，继续阅读

sunsanmao

粉丝: 0
资源: 1

SQL Server新版本支持XQuery：融合XML数据与关系型数据库的查询利器

ISO/IEC TR 19075-1:2011 - XQuery Regular Expressions in SQL: A Technical Report

O'Reilly XQuery：学习XQuery和XML技术的必备资源

XQuery 2nd Edition: A Comprehensive Guide to XML Data Search (2015)

XQuery Database Managment System-开源

Relational XQuery-开源

数据库系统概念第四版（Database System Concepts）

Database Management System 3rd Edition

xml database management system-开源

XQuery0.69

xquery resource

最新资源