知识图谱事实预测：基于知识增强的张量分解方法

需积分: 17 50 浏览量更新于2024-07-17 收藏 2.37MB PDF 举报

"这篇论文提出了一种新的知识图谱事实预测方法，通过知识增强的张量分解来嵌入知识图谱。这种方法能够捕捉到RDF等语义网络语言表示的知识图谱中的有序关系，并且可以利用用户提供的或自动从现有知识图谱中提取的背景知识。论文还提供了一个线性张量分解算法，该算法被证明是收敛的。在对8个不同的知识图谱进行的事实预测任务中，与现有的最佳知识图谱嵌入技术相比，这种方法的性能提高了5%到50%。实验表明，当图中实体的平均度较高时，所有张量分解模型都表现良好，而约束型模型在具有少量高度相似关系的图中表现更优，正则化模型则在具有不同程度相似关系的图中占据优势。" 这篇研究的核心在于构建知识图谱的张量表示，这是一种将知识图谱的三元组（主体、关系、客体）转换为多维数组的技术。张量是一种数学对象，可以处理多维数据，这里用于捕获实体之间的复杂和有序的关系。与传统的知识图谱嵌入方法不同，新提出的模型不仅考虑了知识图谱结构本身，还考虑了额外的背景知识，这可以增强模型的预测能力。论文中提到的“知识增强”是指利用额外的信息来丰富每个实体和关系的向量表示。这些信息可以是用户提供的，也可以是从其他知识图谱中自动抽取的。这种增强有助于提高模型的鲁棒性，因为它使得模型能够在预测未知事实时利用更多的上下文信息。为了实现这一目标，作者提出了一种线性张量分解算法，它保证了模型的收敛性。这意味着在一定条件下，算法将收敛到一个稳定的解决方案，从而提供准确的实体和关系的低维表示。这种表示对于预测知识图谱中缺失的三元组（即事实预测任务）至关重要。实验部分展示了这些模型在不同知识图谱上的应用效果。模型在高密度图（即实体平均度高的图）中表现良好，而在有多种相似度关系的情况下，正则化模型优于约束型模型。这表明，选择适合特定知识图谱结构的模型对于提高预测准确性至关重要。这篇论文介绍的方法扩展了知识图谱嵌入技术，引入了张量分解和知识增强，提高了预测新事实的能力，这对于知识图谱的问答系统、推荐系统和信息检索等领域具有重要的实际应用价值。

Figure 2: The similarity matrix C is used to compute the similarity of pairs of relations in the knowledge graph. Its i

frontal slice is the adjacency

matrix of the i

relation, i.e., a two-dimensional matrix with a row and column for each entity whose values are 1 if the relation holds for a pair

and 0 otherwise.

In our framework, we represent a multi-relational knowledge graph of N

binary relations among N

entities by

the order-3 tensor X of dimension N

× N

. This binary tensor is often very large and sparse. Our goal is to

construct dense, informative p-dimensional embeddings, where p is much smaller than either the number of entities

or the number of relations. We represent the collection of p-dimensional entity embeddings by A, the collection of

relation embeddings by R, and the similarity matrix by C. The entity embeddings collection A contains matrices A

of size N

× p while the relation embeddings collection R contains matrices R

of size p × p. Recall that the frontal

slice X

of tensor X is the adjacency matrix of the k

binary relation, as shown in Figure 2. We use A ⊗B to denote

the Kronecker product of two matrices A and B, vec (B) to denote the vectorization of a matrix B, and a lower italic

letter like a to denote a scalar.

Mathematically, our objective is to reconstruct each of the k relation slices of X , X

, as the product

≈ A

. (1)

Recall that both A

and A

are matrices: each row is the embedding of an entity. By changing the exact form of

A—that is, the number of different entity matrices, or the different ways to index A—we can then arrive at different

models. These model variants encapsulate both mathematical and philosophical differences. In this paper, we specif-

ically study two cases. First, we examine the case of having only a single entity embedding matrix, represented as

A—that is, A

= A

= A. This results in a quadratic reconstruction problem, as we approximate X

≈ AR

Second, we examine the case of having two separate entity embedding matrices, represented as A

and A

. This

results in a reconstruction problem that is linear in the entity embeddings, as we approximate X

≈ A

We learn A

, A

, and R by minimizing the augmented reconstruction loss

min

A,R

f(A, R)

| {z }

reconstruction loss

numerical regularization of the embeddings

z }| {

g(A, R) + f

(A, R, C)

| {z }

knowledge-directed enrichment

. (2)

The ﬁrst term of (2) reﬂects each of the k relational criteria given by (1). The second term employs standard numerical

regularization of the embeddings, such as Frobenius minimization, that enhances the algorithm’s numerical stability

and supports the interpretability of the resulting embeddings. The third term uses our similarity matrix C to enrich

the learning process with our extra knowledge.

We ﬁrst discuss how we construct the similarity matrix C in Section 3.2 and then, starting in Section 3.3, describe

how the framework readily yields three novel embedding models, while also generalizing prior efforts. Throughout,

We use the standard tensor notations and deﬁnitions in Kolda and Bader [10]. Recall that the Kronecker product A ⊗ B of an (m

, n

)

matrix A and a (m

, n

) matrix B returns an (m

, n

) block matrix, where each element of A scales the entire matrix B.

剩余28页未读，继续阅读

Jayxp

粉丝: 6
资源: 137

知识图谱事实预测：基于知识增强的张量分解方法

neo4j-graph-data-science-1.6.1-standalone.zip

TuckER：Tensor Factorization for Knowledge Graph Completion.pdf

AS-23-Wu-When-Knowledge-Graph-Meets-TTPs-wp.pdf

boost159-graph-mpich-1.59.0-2.el7.x86_64.rpm

java-all-call-graph-main-2.0.0.zip java-callgraph2-main-2.0.5.zi

CCF-唐杰-认知图谱Cognitive Graph 理解、认知与推理-2019-10-19.pdf

graph-tool-2.43.tar.bz2

spark-graphx_2.11-2.1.3-SNAPSHOT.jar

spark-graphx_2.10-0.9.0-incubating.jar

Python库 | splink_graph-0.4.3-py3-none-any.whl

最新资源