METHODOLOGIES AND APPLICATION
A query-oriented XML text summarization for mobile devices
Dexi Liu
•
Shihan Wu
•
Yuehua Lan
•
Guoqiang Di
•
Jiezhao Peng
•
Naixue Xiong
•
Athanasios V. Vasilakos
Ó Springer-Verlag Berlin Heidelberg 2012
Abstract Extensible Markup Language (XML) is a
simple, flexible text format derived from SGML, which is
originally designed to support large-scale electronic pub-
lishing. Nowadays XML plays a fundamental role in the
exchange of a wide variety of data on the Web. As XML
allows designers to create their own customized tags,
enables the definition, transmission, validation, and inter-
pretation of data between applications, devices and orga-
nizations, lots of works in soft computing employ XML to
take control and responsibility for the information, such as
fuzzy markup language, and accordingly there are lots of
XML-based data or documents. However, most of mobile
and interactive ubiquitous multimedia devices have
restricted hardware such as CPU, memory, and display
screen. So, it is essential to compress an XML document/
element collection to a brief summary before it is delivered
to the user according to his/her information need. Query-
oriented XML text summarization aims to provide users a
brief and readable substitution of the original retrieved
documents/elements according to the user’s query, which
can relieve users’ reading burden effectively. We propose a
query-oriented XML summarization system QXMLSum,
which extracts sentences and combines them as a summary
based on three kinds of features: user’s queries, the content
of XML documents/elements, and the structure of XML
documents/elements. Experiments on the IEEE-CS datasets
used in Initiative for the Evaluation of XML Retrieval
show that the query-oriented XML summary generated by
QXMLSum is competitive.
Keywords Mobile devices Query-oriented
XML text summarization Query expansion
Content and structure
1 Introduction
Extensible Markup Language (XML) (World Wide Web
Consortium 2004) is a simple, flexible text format derived
from SGML, which is originally designed to support large-
scale electronic publishing. Nowadays XML plays a fun-
damental role in the exchange of a wide variety of data on
the Web. As XML allows designers to create their own
customized tags, enables the definition, transmission, vali-
dation, and interpretation of data between applications,
devices and organizations, lots of works in soft computing
employ XML to take control and responsibility for the
information. To solve the problems such as adaptivity,
hybrid control strategies, system integration, and ubiquitous
networking access in ubiquitous computing, Acampora and
Loia (2005) proposed XML-derived technologies to define
Communicated by G. Acampora.
D. Liu (&) G. Di J. Peng N. Xiong
Jiangxi University of Finance and Economics,
Nanchang 330013, China
e-mail: dexi.liu@163.com
D. Liu
Jiangxi Key Laboratory of Data and Knowledge Engineering,
Nanchang 330013, China
S. Wu
Songjiang Branch of Shanghai Rural Commercial Bank,
Shanghai 201600, China
Y. Lan
Gannan Medical University, Ganzhou 341000, China
A. V. Vasilakos
Department of Computer Engineering,
University of Western Macedonia, 56100 Kozani, Greece
123
Soft Comput
DOI 10.1007/s00500-012-0980-8