T-PRISM：基于张量嵌入的大规模数据逻辑编程语言

需积分: 9 35 浏览量更新于2024-09-08 收藏 117KB PDF 举报

"A tensorized logic programming language for large-scale data.pdf" 在当前的IT领域中，逻辑编程语言已经发展成为处理复杂问题的重要工具。T-PRISM（Tensor-PRISM）是一种创新的逻辑编程语言，它引入了张量（tensor）嵌入的概念，尤其适用于大规模数据的处理。该语言是对现有最先进的概率逻辑编程语言PRISM的扩展和改进，旨在融合符号推理和深度学习的优势，提供更为灵活且可解释的建模能力。 T-PRISM的核心思想是将PRISM中的分布语义替换为多维数组，即张量。在传统的PRISM中，分布函数用于表示不确定性和概率，而在T-PRISM中，这些功能被张量所取代，这使得语言能够更好地适应现代计算环境，特别是那些可以利用并行化和GPU硬件加速的环境。这样的设计不仅保留了逻辑编程的灵活性和可解释性，还提升了处理大数据时的计算效率。 T-PRISM由两部分组成：逻辑编程部分和数值计算部分。逻辑编程部分允许用户以第一阶逻辑的抽象级别构建模型，这使得模型具有清晰的结构和易于理解的规则。而数值计算部分则专注于大规模计算，通过并行化和GPU加速，使得处理复杂任务的能力显著增强。结合这两部分，T-PRISM能够支持广泛的应用场景，包括从传统的符号推理任务到深度学习模型的构建。这种高阶声明性建模方式使得开发者无需深入底层细节，就能够处理从逻辑推理到机器学习的多种问题。这种集成的编程范式为解决跨领域的复杂问题提供了新的途径。论文"Tensorized Logic Programming Language for Large-Scale Data"由Ryosuke Kojima和Taisuke Sato共同撰写，发表于arXiv，展示了T-PRISM在生物医学数据智能和人工智能研究中的潜在应用。通过这种新的编程语言，研究人员和开发者可以更高效地处理大量数据，并在理论与实践之间架起一座桥梁，实现更高级别的知识表示和推理。 T-PRISM是一个革命性的逻辑编程框架，它结合了传统逻辑编程的解释性和张量计算的效率，为大规模数据处理和复杂知识图谱（KG）的构建提供了强大的工具。随着深度学习和大数据分析的需求不断增长，T-PRISM有望成为未来智能系统开发的重要平台。

arXiv:1901.08548v1 [cs.LG] 20 Jan 2019

A tensorized logic programming language for large-scale data

Ryosuke Kojima

, Taisuke Sato

Departmen t of Biomedical Data Intelligence, Graduate School of Medicine,

Kyoto University, Kyoto, Japan.

AI research center ( AIRC)

National Institute of Advanced Industrial Science a nd Technology (AIST), Tokyo, Japan.

Abstract

We introduce a new logic programming language T-PRISM

based on tensor embeddings. Our embedding scheme is

a modiﬁcation of the distri bution semantics in PRISM,

one of the state-of-the-art probabilistic logic programming

languages, by replacing distribution functions with multi-

dimensional arrays, i.e., tensors. T-PRISM consists of two

parts: logic programming part and numerical computation

part. The former provides ﬂexible and interpretable model-

ing at the level of ﬁrst order logic, and the latter part provides

scalable computation utilizing parallelization and hardware

acceleration with GPUs. Combing these two parts provides

a remarkably wide range of high-level declarative modeling

from symbolic reasoning to deep learning.

To embody this programming language, we also introduce a

new semantics, termed tensorized semantics, which combines

the t raditional least model semantics in logic programming

with the embeddings of tensors. In T-PRIS M, we ﬁrst derive

a set of equations related to tensors from a given program

using logical inference, i.e., Prolog execution in a symbolic

space and then solve the derived equations in a continuous

space by TensorFlow.

Using our preliminary implementation of T-P RISM, we have

successfully dealt with a wide range of modeling. We have

succeeded in dealing with real large-scale data in the declar-

ative modeling. This paper presents a DistMult model for

knowledge graphs using the FB15k and WN18 datasets.

1. Introduction

Logic programming provides concise expressions of

knowledge and has be en proposed as means for rep-

resenting and modeling various type s of data for real-

world AI systems. For example, to deal with unce r-

tain and noisy data, probabilistic logic programming

(PLP) has been extensively studied (Kimmig et al. 2011;

Wang, Mazaitis, and Cohen 2013; Sato and Kameya 2008).

PLP systems allow users to ﬂexibly and clearly describe

stochastic dependencies and relations between entities using

logic program ming. Also in the different context, to han-

dle a wide range of applications, the uniﬁcation of neu-

ral networks and approxima te reasoning by symbol em-

beddings into contin uous spaces has recently been pro-

 2019, Association for the A dvancement of Art iﬁcial

posed (Manhaeve et al. 2018; Rockt¨aschel and Riedel 2017;

Evans a nd Grefenstette 2018).

In this paper, we tac kle a task of combining symbolic

reasoning and multi-dimensional continuous-space embed-

dings of lo gical constructs such as clauses, and explore a

new approach to compile a program written in a declarative

languag e into a procedure of numerical calculation suitable

for large-scale data. Such languages are expected to be in-

terpretable as a programming language while efﬁciently exe-

cutable in the level of numerical calculation like vector com-

putation. Aiming at this goal, we introduce tensorized se-

mantics, a novel algebraic semantics interfacing a symbolic

reasoning layer and the numeric computation layer, and pro-

pose a new modeling language “T-PRISM”. It is based on an

existing probabilistic logic programming language PRISM

(Sato and Kameya 2008) and implements the tensorized se-

mantics for large-scale datasets.

Thus the ﬁrst co ntribution of this paper is the introduc-

tion of a new semantics, tensorized semantics. T he current

PRISM has the distribution semantics (Sato 1995) that prob-

abilistically generalizes the least model sem a ntics in logic

programming to coherently assign pr obabilities to the log-

ical con structs. Likewise tensorized semantics assigns ten-

sors

to the logical constructs based on the le a st mode l se-

mantics. Both of PRISM and T-PRISM programs are char-

acterized by a set of equations for the assigned quantities.

One may be ab le to view T-PRISM as one approach

to an equation-level interface to connect logic program-

ming and continuous-spa ce embeddings. Another approach

is possible, predicate-level interface, such as DeepProblog

(Manhaeve et al. 2018). Their approach is to provide spe-

cial p redicates connecting neural networks and pr obabilistic

logic programming b y ProbL og. It implementation a lly sep-

arates neural networks from probabilistic models but synta c -

tically integrates, for example, image recognition and logi-

cal reasoning using estimated labels. This approach, unlike

our approach, does not allow constants and predicates to

have corresponding vectors (te nsors) representations in neu-

ral networks.

The second contribution of this paper is an implementa-

tion methodology of the T-PRISM’s tensorized semantics.

In this paper, the term “tensor” is used as a multi-dimensional

array interchangeably.

下载后可阅读完整内容，剩余4页未读，立即下载

Jayxp

粉丝: 6
资源: 137

T-PRISM：基于张量嵌入的大规模数据逻辑编程语言

Matlab 总学术人数 (TAH) 工具箱.pdf

Google C++ Style Guide_英文版.pdf

11g_plsql_user_guide_and_reference.pdf

Java - A Beginner’s Guide - Sixth Edition - Herbert Schildt

毕业论文jsp1948电影院售票管理系统ssh.doc

毕业论文springboot303针对老年人的景区订票系统论文.doc

"PLC学习资料-PPT课件及SLC500系列控制器简介

MATLAB Linear Programming: In-depth Analysis and Case Applications

[Practical Guide]: Building a GAN Model from Scratch: Step-by-Step Optimization for Your First AI ...

C Language Image Pixel Data Loading and Analysis [File Format Support] Supports multiple file ...

最新资源