生成式层次神经网络构建端到端对话系统

人工智能

需积分: 1 135 浏览量更新于2024-08-03 收藏 426KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

Building End-To-End Dialogue Systems

Using Generative Hierarchical Neural Network Models

Iulian V. Serban

, Alessandro Sordoni

, Yoshua Bengio

, Aaron Courville

and Joelle Pineau

†

Department of Computer Science and Operations Research, Universit

e de Montr

eal, Montreal, Canada

{iulian.vlad.serban,alessandro.sordoni,yoshua.bengio,aaron.courville} AT umontreal.ca

†

School of Computer Science, McGill University, Montreal, Canada jpineau AT cs.mcgill.ca

Abstract

We investigate the task of building open domain, conversa-

tional dialogue systems based on large dialogue corpora us-

ing generative models. Generative models produce system

responses that are autonomously generated word-by-word,

opening up the possibility for realistic, ﬂexible interactions.

In support of this goal, we extend the recently proposed hier-

archical recurrent encoder-decoder neural network to the di-

alogue domain, and demonstrate that this model is compet-

itive with state-of-the-art neural language models and back-

off n-gram models. We investigate the limitations of this and

similar approaches, and show how its performance can be im-

proved by bootstrapping the learning from a larger question-

answer pair corpus and from pretrained word embeddings.

Introduction

Dialogue systems, also known as interactive conversational

agents, virtual agents and sometimes chatterbots, are used

in a wide set of applications ranging from technical sup-

port services to language learning tools and entertainment

(Young et al. 2013; Shawar and Atwell 2007). Dialogue sys-

tems can be divided into goal-driven systems, such as tech-

nical support services, and non-goal-driven systems, such as

language learning tools or computer game characters. Our

current work focuses on the second case, due to the avail-

ability of large corpora of this type, though the model may

eventually prove useful for goal-driven systems also.

Perhaps the most successful approach to goal-driven sys-

tems has been to view the dialogue problem as a partially

observable Markov decision process (POMDP) (Young et al.

2013). Unfortunately, most deployed dialogue systems use

hand-crafted features for the state and action space repre-

sentations, and require either a large annotated task-speciﬁc

corpus or a horde of human subjects willing to interact with

the unﬁnished system. This not only makes it expensive and

time-consuming to deploy a real dialogue system, but also

limits its usage to a narrow domain. Recent work has tried

to push goal-driven systems towards learning with few ex-

amples using constraints on the POMDP (Gasic et al. 2013)

as well as learning the observed features themselves with

neural network models (Henderson, Thomson, and Young

Y.B. is a CIFAR senior Fellow

 2015, Association for the Advancement of Artiﬁcial

2014), yet such approaches still require either hand-crafted

features or large corpora of annotated task-speciﬁc simu-

lated conversations.

On the other end of the spectrum are the non-goal-

driven systems (Ritter, Cherry, and Dolan 2011; Banchs

and Li 2012; Ameixa et al. 2014). Most recently Sor-

doni et al. (2015b) and Shang et al. (2015) have drawn in-

spiration from the use of neural networks in natural lan-

guage modeling and machine translation tasks (Cho et al.

2014). There are several motivations for developing non-

goal-driven systems. First, they may be deployed directly

for tasks which do not naturally exhibit a directly measur-

able goal (e.g. language learning) or simply for entertain-

ment. Second, if they are trained on corpora related to the

task of a goal-driven dialogue system (e.g. corpora which

cover conversations on similar topics) then these models

can be used to train a user simulator, which can then train

the POMDP models discussed earlier (Young et al. 2013;

Pietquin and Hastie 2013). This would alleviate the expen-

sive and time-consuming task of constructing a large-scale

task-speciﬁc dialogue corpus. In addition to this, the fea-

tures extracted from the non-goal-driven systems may be

used to expand the state space representation of POMDP

models (Singh et al. 2002). This can help generalization to

dialogues outside the annotated task-speciﬁc corpora.

Our contribution is in the direction of end-to-end train-

able, non-goal-driven systems based on generative proba-

bilistic models. We deﬁne the generative dialogue problem

as modeling the utterances and interactive structure of the

dialogue. As such, we view our model as a cognitive sys-

tem, which has to carry out natural language understand-

ing, reasoning, decision making and natural language gen-

eration in order to replicate or emulate the behavior of the

agents in the training corpus. Our approach differs from

previous work on learning dialogue systems through inter-

action with humans (Young et al. 2013; Gasic et al. 2013;

Cantrell et al. 2012; Mohan and Laird 2014), because it

learns off-line through examples of human-human dialogues

and aims to emulate the dialogues in the training corpus in-

stead of maximize a task-speciﬁc objective function. Con-

trary to explanation-based learning (Mohan and Laird 2014)

and rule-based inference systems (Langley et al. 2014), our

model does not require a predeﬁned state or action space

representation. These representations are instead learned

arXiv:1507.04808v3 [cs.CL] 6 Apr 2016

下载后可阅读完整内容，剩余7页未读，立即下载

UnknownToKnown

粉丝: 1w+
资源: 749

生成式层次神经网络构建端到端对话系统

CheckPoint-CCSE 176 question ETE格式.ete

ImportError: cannot import name 'TreeStyle' from 'ete3' (D:\software\python\lib\site-packages\ete3\__init__.py)

.请将目录/eto(包含目录本身)打包成ete.bk文件，但是不要压缩

在文件/ete/passwd中查找以bash结尾的行内容。

（4） 现在/ete/passwd文件中肯定多了一条关于ysq用户的信息，fetc/shadow和/etc/group两个文件也是如此，而且ysq用户的默认主目录/home/ysq也已被默认创建。因为useradd命令会使用默认的参数创建新用户。

Traceback (most recent call last): File "<stdin>", line 6, in <module> AttributeError: 'TreeNode' object has no attribute 'compare_topology'

怎么利用python将.ALN文件变成.NEX。

还是存在一致的问题，请再次帮我修改代码

请检查上述代码。我复制了一棵物种树加入基因树文件中作为测试，该代码把同一棵物种树也对比为不一致

最新资源

ImportError: cannot import name 'TreeStyle' from 'ete3' (D:\software\python\lib\site-packages\ete3\init.py)

（4）现在/ete/passwd文件中肯定多了一条关于ysq用户的信息，fetc/shadow和/etc/group两个文件也是如此，而且ysq用户的默认主目录/home/ysq也已被默认创建。因为useradd命令会使用默认的参数创建新用户。