基于集覆盖的语义映射提取方法

需积分: 3 137 浏览量更新于2024-10-12 收藏 332KB PDF 举报

本文档探讨了一种基于集覆盖（Set Covering）的本体映射提取方法，发表在2009年的国际Web信息系统与挖掘会议上。本体映射是基于本体的语义查询和融合的核心，它涉及到从源本体和目标本体之间的相似性中识别是否存在有效的本体映射。作者们关注的是如何有效地解决本体映射提取的问题，即确定两个本体之间是否存在映射关系。该研究将本体映射提取问题转化为一个集覆盖问题。集覆盖问题的目标是寻找最小的集合，这些集合能够覆盖所有训练数据中的元素。在论文中，提出了一个名为SCM-based Mapping Extraction（SME）的新颖算法。在训练阶段，该算法旨在找到能够最大程度覆盖训练数据的属性集。到了测试阶段，通过比较测试数据集中属性集的属性组合，来执行本体映射的提取。具体来说，SME算法的工作流程可能包括以下几个步骤： 1. 数据预处理：对源本体和目标本体进行特征抽取和相似度计算，以便构建或更新训练和测试数据集。 2. 集覆盖模型构建：建立一个模型，其中每个元素代表训练数据中的一个本体对的相似性特征，而集合则是可能的属性集。 3. 训练过程：应用集覆盖算法搜索出最优的属性集，确保它们能覆盖尽可能多的训练数据中的相似性特征。 4. 测试阶段：对于新的本体对，通过检查其特征是否被训练阶段找到的属性集所覆盖，判断是否存在潜在的映射。 5. 结果评估与优化：通过精确度、召回率等指标评估算法性能，并根据需要调整算法参数以提高映射提取的准确性。这种方法的优势在于它将复杂的本体映射问题转化为经典的数学优化问题，使得解决过程更为高效。然而，由于集覆盖问题通常为NP完全问题，实际应用中可能存在计算复杂度较高的挑战。因此，算法的性能和效率依赖于训练数据的规模以及所选算法的优化策略。这篇论文为本体映射的自动提取提供了一个新颖且有理论基础的方法，对于推动基于本体的语义互联网应用具有重要意义。

An ontology mapping extraction method based on set covering

Hongke Xia

1, 2

, Xuefeng Zheng

, Xiang Hu

and Yunmei Shi

Information Engineer Institute, University of Science and Technology Beijing

Computer School, Beijing Information Science and Technology University, Beijing

Computer Science and Technology Department, North China Electric Power University, Beijing

E-mail: hongke.xia@gmail.com

Abstract—Ontology Mapping is the foundation of semantic

query and semantic integration based on ontology. As the

crucial point of ontology mapping, the task of mapping extrac-

tion is to find whether there exists the ontology mapping

among the similarities between source ontology and target

ontology. In this paper the problem of mapping extraction is

regarded as the problem of set covering, and a novel ontology

mapping extraction algorithm based on set covering SME

(SCM-based Mapping Extraction) is proposed, by which the

property set which covers the training data set in maximum

degree is searched during training stage, and the mapping

extraction is carried out by means of the conjunction of the

properties of property set in testing data set during testing

stage. Experimental evaluations show that this method has

better comprehensive performances compared to other algo-

rithms.

Keywords—Mapping Extraction; set covering; data-dependent

ball; property spaces

I. INTRODUCTION

Among the various strategies that realized ontology

mapping, the ontology mapping based on similarity calcu-

lating is the mainstream method, which extracts the ontolo-

gy mapping after the similarities between source ontology

and target ontology are calculated. The proposed mapping

extraction method includes the selection strategy using

threshold, the selection strategy based on maximum predi-

cated value [1], the selection strategy using relaxation labe-

ling [2], and so on.

The selection strategy using threshold defines a thre-

shold, and determines whether there exist mappings among

concept couples by judging the relationship between the

similarities of concept couples and the threshold. If the

similarity of two concepts is larger than the threshold, then

there exists ontology mapping between these two concepts;

otherwise there exists no mapping. The main shortcoming

of threshold lies in how to choose the value of the threshold,

since different thresholds impact the final mapping extrac-

tion greatly, and how to choose the threshold soundly is a

problem which should be taken into account thoroughly.

The selection strategy based on maximum predicated

value sorts all the concept similarities by descending order,

and selects the concept couple with maximum similarity as

the concept couple which has ontology mapping between

them. This method can be substitute by the strategy using

threshold each other in essence, both Falcon-AQ [3] and

Rimom [4] extract the ontology mapping with this method.

Relaxation labeling is a widely used method by graphics

and image processing, nature language processing area, and

so on, and the main idea of it is that a node’s label is influ-

enced by its neighbors’, so we can deduce a node’s label by

its neighbors’ labels. In ontology mapping, the concepts of

target ontology are treated as labels, and the label which

most fits the concept of source ontology is searched by the

means of relaxation labeling. GLUE [2] extracts ontology

mapping using relaxation labeling.

In this paper a novel ontology mapping extraction me-

thod is proposed, which introduces the theory of set cover-

ing into mapping extraction, and converts the problem of

mapping extraction into the problem of set covering. This

method searches the property set, each of which covers the

training data in maximum degree, in training set, then de-

cides whether the mapping exists or not in testing set via the

property set found. The rest of the paper organizes as fol-

lows: in section 2 the related conceptions are introduced, in

section 3 the problem of ontology mapping is described and

the mapping extraction algorithm SME is presented, and

section 4 gives the results of experiment and analysis. Final-

ly section 5 concludes the paper with a discussion.

II.

THE PROBLEM OF SET COVERING

The problem of set covering has the following mathemat-

ics descriptions: suppose {X, F} consists of a finite set X

and a family of subsets F, and F covers X, in other words

every element of X belongs to at lest one subset of F, name-

∈

= ∪

. For a family of subset B which belongs to F

(

F⊆

), if

∈

= ∪

, it is to say B covers X. The problem of

set covering is to find the minimum subset B* of F which

2009 International Conference on Web Information Systems and Mining

DOI 10.1109/WISM.2009.46

191

Authorized licensed use limited to: Sunchon National University. Downloaded on June 07,2010 at 13:55:57 UTC from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余3页未读，立即下载

kxzhgl

粉丝: 0
资源: 1

基于集覆盖的语义映射提取方法

论文研究-Ontology Module Extraction based on Approximate Resolvability.pdf

On Ontology Mapping Based on Transfer Learning

An Event Ontology Description Framework Based on SKOS

Information Exchange Interface between Smart Building and Utility Based on Ontology Mapping

A Method of Event Ontology Mapping

Text similarity calculation method based on ontology model

Research on domain ontology in different granulations based on concept lattice

Extraction of Event Elements Based on Event Ontology Reasoning

A Cognitive Support Framework for Ontology Mapping

An Ontology-based Approach to Adaptive Data Processing

最新资源