OMCSNet：常识推理工具包详解

需积分: 9 134 浏览量更新于2024-07-29 收藏 354KB PDF 举报

"OMCSNet常识库是一个由MIT实验室开发的语义网常识库，包含超过25万个常识元素，支持C++、Java和Python等多种编程语言的开发。该库受到Cyc和WordNet的启发，融合了广泛的概念和关系，并以简单易用的语义网络结构呈现。OMCSNet不仅可用于查询扩展和语义相似性计算，还能进行时间、空间、情感等多种推理。其附带的OMCSNet工具包提供了传播激活、类比和概念路径查找等功能，便于文本推理任务。此外，OMCSNet经过定量和定性分析，证明了其在常识推理中的实用性和有效性。" OMCSNet常识库是一个强大的工具，旨在促进常识推理和自然语言处理任务。它的核心是庞大的语义网络，包含了大量的常识知识，这些知识由“语义片段”构成，涵盖日常生活中广泛的概念和关系。这些概念和关系的多样性使得OMCSNet在处理复杂的人类理解任务时具有很高的潜力。受Cyc项目的影响，OMCSNet包含了丰富的概念和关系，这些元素超越了传统的词汇表或词典，深入到人类认知的层次。另一方面，它借鉴了WordNet的结构，创建了一个简单易用的框架，用户可以轻松地探索和操作这些常识知识。 OMCSNet工具包是一个配套的推理系统，提供了多种功能以支持文本理解。例如，传播激活机制允许从一个概念出发，自动激活与之相关的一系列概念，这有助于扩展和深化理解。类比功能则允许在不同的概念之间建立联系，以进行创新性的思考。路径查找工具则能帮助找出概念之间的关系路径，这对于推理和问题解答非常有用。为了评估OMCSNet的效果，研究人员进行了定量和定性分析，包括比较其在查询扩展、语义相似度评估以及各种推理任务上的性能。这些分析结果证实了OMCSNet在提升文本理解和推理能力方面的价值。最后，OMCSNet的开发者还指出了一些潜在的应用方向，如人工智能对话系统、机器学习模型的增强以及情感分析等。作为一个开源资源，OMCSNet为学术界和工业界提供了宝贵的资源，促进了常识推理和自然语言处理技术的发展。

2.1 History of OMCSNet

Building large-scale databases of commonsense knowledge is not a trivial task. One

problem is scale. It has been estimated that the scope of common sense may involve

many tens of millions of pieces of knowledge (Mueller, 2001). Unfortunately, com-

mon sense cannot be easily mined from dictionaries, encyclopedias, the web, or

other corpora because it consists largely of knowledge obvious to a reader, and thus

omitted. Indeed, it likely takes much common sense to even interpret dictionaries

and encyclopedias. Until recently, it seemed that the only way to build a common-

sense knowledge base was through the expensive process of hiring an army of

knowledge engineers to hand-code each and every fact.

However, in recent years we have been exploring a new approach. Inspired by

the success of distributed and collaborative projects on the Web, Singh et al. turned

to volunteers from the general public to massively distribute the problem of building

a commonsense knowledgebase. Three years ago, the Open Mind Commonsense

(OMCS) web site (Singh et al. 2002) was built, a collection of 30 different activities,

each of which elicits a different type of commonsense knowledge—simple asser-

tions, descriptions of typical situations, stories describing ordinary activities and

actions, and so forth. Since then the website has gathered over 675,000 items of

commonsense knowledge from over 13,000 contributors from around the world,

many with no special training in computer science. The OMCS corpus now consists

of a tremendous range of different types of commonsense knowledge, expressed in

natural language.

The earliest applications of the OMCS corpus made use of its knowledge not di-

rectly, but by first extracting into semantic networks only the types of knowledge

they needed. For example, the ARIA photo retrieval system (Liu & Lieberman,

2002) extracted taxonomic, spatial, functional, causal, and emotional knowledge

from OMCS to improve information retrieval. This suggested a new approach to

building a commonsense knowledgebase. Rather than directly engineering the

knowledge structures used by the reasoning system, as is done in Cyc, OMCS en-

courages people to provide information clearly in natural language, and then from

this semi-structured English sentence corpus, we are able to extract more usable

knowledge representations and generate useable knowledge bases. In OMCSNet,

we reformulated the knowledge in OMCS into a system of binary relations which

constitute a semantic network. This allows us to apply graph-based methods when

reasoning about text.

2.2 Generating OMCSNet from the OMCS corpus

The current OMCSNet is produced by an automatic process, which applies a set of

‘commonsense extraction rules’ to the semi-structured English sentences of the

OMCS corpus. The key to being able to do this is that the OMCS website already

elicits knowledge in a semi-structured way by prompting users with fill-in-the-blank

templates (e.g. “The effect of [falling off a bike] is [you get hurt]”). A pattern

matching parser uses roughly 40 mapping rules to easily parse semi-structured sen-

剩余15页未读，继续阅读

junjunyeti

粉丝: 2
资源: 25

OMCSNet：常识推理工具包详解

open3d-0.11.2-cp36-cp36m-win_amd64.whl

SpringBoot是一个经典实用的员工管理系统。集成了Mybatis在MySQL数据库上的

nipy-0.4.1-cp35-cp35m-win_amd64.whl

算法部署-使用OpenVINO在FPGA上部署人脸检测算法-附详细流程教程+项目源码-优质项目实战.zip

Pillow_SIMD-6.0.0.post0+avx2-cp35-cp35m-win_amd64.whl

单片机控制LED点阵显示课程设计.doc

这是一个将Shiro权限控制与SpringBoot集成在一起的项目，使用Redis进行缓

Pillow_SIMD-7.0.0.post3-cp36-cp36m-win_amd64.whl

一个只完成前端部分而没有整个后端数据库的虚假留言板。基于HTML+CSS+JS实现，实现了登录跳转和页面消息功能.zip

数据面板_图表_表格.zip

最新资源