LotusX：轻松搜索XML的图形界面与自动完成

38 浏览量更新于2024-08-26 收藏 601KB PDF 举报

"LotusX:APosition-AwareXMLGraphicalSearchSystemwithAuto-Completion ChunbinLin、JiahengLu、TokWangLing和BogdanCautis在他们的研究论文中介绍了一个创新的XML图形搜索系统——LotusX。这个系统针对XML数据，解决了传统查询语言如XQuery对用户专业知识要求过高的问题。LotusX的核心特点在于其"位置感知"和"自动完成"功能，使得用户在无需深入理解查询语言、数据模式或XML文档具体内容的情况下，也能轻松进行查询。 XML（eXtensibleMarkupLanguage）是一种常用的数据存储和交换格式，尤其在结构化数据管理中占据重要地位。然而，XQuery等专业查询语言的学习曲线陡峭，对初学者来说是一项艰巨的任务。此外，理解和操作XML文档的层次结构和内容对普通用户来说同样具有挑战性。因此，开发一个用户友好的界面显得至关重要，它能够降低查询构建的复杂度，促进XML技术的广泛应用。 LotusX采用树枝（twig）为基础的查询方法，提供图形化的用户界面。用户可以通过直观的树状模型来构建和修改查询，系统则会实时提供合适的查询候选，类似于常见的文本输入自动完成功能。这种"细枝样式查询"简化了用户与XML数据交互的方式，使得查询过程更为便捷。除了基本的图形化查询，LotusX还支持复杂的树枝查询，包括对顺序敏感的查询。这意味着用户可以轻松查找特定元素的序列，这在处理XML中的顺序信息时非常有用。此外，系统还引入了新的排名策略，根据相关性对查询结果进行排序，从而提升搜索质量。同时，LotusX具备查询重写功能，自动优化用户的查询表达式，进一步提高查询效率。为了方便用户体验和评估，研究团队提供了LotusX系统的在线演示，网址为http://datasearch.ruc.edu.cn:8080/LotusX。这个平台让潜在用户可以直接试用LotusX，感受其在XML查询上的易用性和高效性。 LotusX是一个旨在简化XML数据查询的图形化系统，通过位置感知和自动完成技术降低了非专业用户操作的难度，同时也考虑到了复杂查询需求和查询性能的优化。这一创新工具对于XML社区的发展具有积极的推动作用，尤其对那些不熟悉复杂查询语言的用户来说，是一个极具价值的解决方案。"

LotusX: A Position-Aware XML Graphical Search

System with Auto-Completion

Chunbin Lin

, Jiaheng Lu

, Tok Wang Ling

, Bogdan Cautis

Key Laboratory of Data Engineering and Knowledge Engineering, MOE

Renmin University of China

{chunbinlin, jiahenglu}@ruc.edu.cn

School of Computing, National Universtiy of Singapore

lingtw@comp.nus.edu.sg

T´el´ecom ParisTech

cautis@telecom-paristech.fr

Abstract— The existing query languages for XML (e.g.,

XQuery) require professional programming skills to be for-

mulated, however, learning such complex query languages is

a tedious and a time consuming process that can be very

challenging especially to novice users. In addition, when issuing

an XML query, users are required to be familiar with the

content (including the structural and textual information) of

the hierarchical XML, which is diﬃcult for common users. The

need for designing user-friendly interfaces to reduce the burden

of query formulation is fundamental to the spreading of XML

community.

We present a twig-based XML graphical search system, called

LotusX, that provides a graphical interface to simplify the query

processing without the need of learning query languages, data

schemas, nor the knowledge of the content of the XML document.

The basic idea is that LotusX proposes “position-aware” and

“auto-completion” features to help users to create tree-modeled

queries (twig pattern queries) by providing the reasonable can-

didates on-the-ﬂy. In addition, complex twig queries (including

order-sensitive queries) are supported in LotusX. Furthermore,

a new ranking strategy and a query rewriting solution are

implemented to rank the results and automatically rewrite

queries, respectively. We provide an online demo for LotusX

system: http://datasearch.ruc.edu.cn:8080/LotusX

I. Introduction

XML plays an important role in information exchange

nowadays. As a result, a wide spectrum of users, including

those with minimal or no computer programming skill at all,

have the need to query hierarchical XML. Therefore, designing

eﬀective and eﬃcient systems to simplify the query processing

over XML documents attracts lots of research interests. The

well known XML query languages (e.g., XQuery) are provided

to process XML queries. However, these languages are far too

<bib>

{ for $b in doc (‘‘bib.xml’’)/bib/book

where $b//publisher=‘‘Thomas S. Huang’’

and ($b/year>1999 or $b/year <2010)

and ($b/price>30 and $b/price<50 )

return <book> { $b/title } </book> }

</bib>

book

“>1999 or <2010”“<50 and >30”“Thomas S. Huang’’

(a) Xquery expression (b) Twig Pattern Query

price

year

publisher

title

Fig. 1. The XQuery and twig pattern expression of the query.

complicated for unskilled users, who might only be aware of

the basics of the XML data model or even lack the knowledge

of the content (i.e., structural and textual information) of the

XML documents.

For example, assume that a user wants to issue the fol-

lowing query “List the title of books written by Thomas S.

Huang and published before 1999 or later than 2010, and

the price should be distributed in 30 ∼ 50 dollars”. This

query can be formulated as the XQuery expression in Figure

1(a). Unfortunately, formulating such query often demands

considerable cognitive eﬀort from the end users and requires

“programming” skills that is at least comparable to SQL,

which can be both time-consuming and error-prone. In order

to deal with the problem, XML graphical languages are

developed (e.g., XQBE[3], GLASS [6]) to allow the users,

who do not know the professional query languages, to express

queries. They allow users to create queries through simple

graphical languages and then map the queries directly to

XQuery in the background. However, (i) users are required

to learn the syntax of the graphical languages, furthermore,

(ii) users need to have the knowledge of the structural (i.e.,

the parent-child (P-C) and ancestor-descendant (A-D) relation)

and textual (i.e., node names and values) information of the

XML documents, since the content of each node in the query

should be input by users instead of the systems. E.g., when

issuing the query in Figure 1, the user needs to know the

name of the publisher is “Thomas” rather than “Thomason”

(i.e., textual information) and the price is a child of the book

(i.e., structural information).

In order to simplify the query processing, (i) XML keyword

search systems are proposed (e.g., XReal [2]), which return the

subtrees containing all the keywords. However, keywords can

only express simple textual information but cannot describe

the structural information and complex content. For example,

these systems cannot answer the query in Figure 1, since

keywords cannot describe the structures (e.g., year is a child

of book) and the content conditions (e.g., “year>1999 or year

<2010”). (ii) Visual search systems are implemented (e.g.,

Xing[4]). They present the structural and textual information

of the document in visual interfaces, which allows the users to

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38629303

粉丝: 4
资源: 868

LotusX：轻松搜索XML的图形界面与自动完成

基于C#的XML可视化界面编辑系统

Silverlight矢量化图形系统在煤矿自动化平台中的应用.pdf

Maven配置详解大全：setting.xml和pom.xml配置详细解释

"Java读取XML配置文件实践指南：选择合适的解析器和XML配置文件优势

优化Maven构建速度：配置阿里云镜像settings.xml教程

Android权限配置详解：uses-permission在AndroidManifest.xml中的应用

Delphi实现DOM与SAX解析XML：实例演示与功能应用

XML协议自动化测试系统：PTS的设计与实现

SpringBoot YAML配置错误：缺失mybatis-config.xml

Android应用自动更新实操：XML解析、Http链接与文件IO

最新资源