北航-微软亚洲研究院在SemEval-2017社区问答竞赛中的神经匹配排名系统

82 浏览量更新于2024-08-26 收藏 457KB PDF 举报

在2017年的SemEval（国际语言资源与评估组织）竞赛中，北京航空航天大学-微软亚洲研究院（Beihang-MSRA）团队参与了任务3：社区问答中的排名系统。这个研究论文主要关注的是如何设计一个能够在几乎没有词汇重叠的情况下捕捉文本对之间语义关系的排名系统，以解决社区问答平台中的常见问题，如答案选择和问题检索。该系统的创新之处在于除了传统的自然语言处理（NLP）特征，如词性标注、句法结构等，他们引入了基于神经网络的匹配特征。这些特征允许系统超越词典级别的相似度计算，更深入地理解文本的意义和上下文关联。这在Subtask A（答案选择）中，他们的系统表现出显著的优势，最终获得了第二名的成绩，显示出其在精准答案筛选方面的高效性能。在Subtask B（问题检索）中，尽管面临更广泛的挑战，包括对大量信息的筛选和用户意图的理解，Beihang-MSRA的系统也取得了第五名，证明了其在复杂场景下的实用性。通过结合这两种任务的优秀表现，研究者们展示了神经网络在社区问答领域中对文本匹配技术的革新应用，为后续的研究者提供了有价值的参考和启示。在介绍部分，研究人员首先阐述了SemEval 2017任务3的背景和目标，然后详细介绍了他们所面临的挑战，即如何在信息爆炸的时代准确地找到相关答案并进行质量评估。通过深入探讨他们的方法论，论文展示了在CQA任务中，如何将人工智能技术与传统的自然语言处理技术相结合，以提升系统的智能水平和用户体验。这篇论文不仅揭示了Beihang-MSRA在SemEval 2017上的一项前沿工作，还为我们理解如何利用深度学习在社区问答中的角色提供了一个重要的视角，特别是在处理文本相似度和信息检索方面。

Beihang-MSRA at SemEval-2017 Task 3: A Ranking System with Neural

Matching Features for Community Question Answering

Wenzheng Feng

†

, Yu Wu

†

, Wei Wu

‡

, Zhoujun Li

†∗

, Ming Zhou

‡

†

State Key Lab of Software Development Environment, Beihang University, Beijing, China

‡

Microsoft Research, Beijing, China

{wuyu,lizj,wenzhengfeng}@buaa.edu.cn {wuwei,mingzhou}@microsoft.com

Abstract

This paper presents the system in

SemEval-2017 Task 3, Community Ques-

tion Answering (CQA). We develop a

ranking system that is capable of captur-

ing semantic relations between text pairs

with little word overlap. In addition to

traditional NLP features, we introduce

several neural network based matching

features which enable our system to mea-

sure text similarity beyond lexicons. Our

system signiﬁcantly outperforms baseline

methods and holds the second place in

Subtask A and the ﬁfth place in Subtask

B, which demonstrates its efﬁcacy on

answer selection and question retrieval.

1 Introduction

In task 3 of SemEval 2017, participants are

required to address typical problems in mod-

ern CQA forums. We participate two sub-

tasks: question-comment similarity (Subtask A)

and question-question similarity (Subtask B). In

Subtask A, given a question and 10 comments in

its comment thread, one is required to re-rank the

10 comments according to their relevance with the

question. Subtask B gives a question and asks par-

ticipants to re-rank 10 related questions according

to their similarity to the input question.

The challenge of both subtasks is that two natu-

ral language sentences often express similar mean-

ings with different but semantically related words,

which results in semantic gaps between them. To

bridge the semantic gaps, we build a ranking sys-

tem with a variety of features. In addition to tra-

ditional NLP features such as tf-idf (Salton and

Buckley, 1988), the longest common subsequence

(Allison and Dix, 1986), translation models (Jeon

∗

Corresponding Author

et al., 2005), and tree kernels (Schlkopf et al.,

2003; Collins and Duffy, 2002; Moschitti, 2006),

which match sentences based on word overlap,

syntax (tree kenerls), and word-word translations

(translation models), we also introduce neural net-

work based matching models into the system as

features. The neural matching features, includ-

ing a long short term memory network (LSTM)

(Schuster and Paliwal, 1997) and a 2D matching

network which is a variant of our model in (Wu

et al., 2016), can extract high level matching sig-

nals from distributed representations of the sen-

tences and capture their similarity beyond lexi-

cons. We also design some speciﬁc features for

each subtask. All the features are combined as

a ranking model by a gradient boosted regression

tree which is implemented by Xgboost (Chen and

Guestrin, 2016). Our system signiﬁcantly outper-

forms baseline methods on the two subtasks. On

Subtask A, it holds the second place and is compa-

rable with the best system. On Subtask B, it holds

the ﬁfth place. The results demonstrate that our

system can alleviate the semantic gaps in the tasks

of CQA and effectively rank relevant comments

and similar questions to high positions.

2 System Description

Our system is built under a learning to rank frame-

work (Liu et al., 2009). It takes a question and

a group of candidates (comments or related ques-

tions) as input, and outputs a ranking list of the

candidates based on scores of question-candidate

pairs. The ranking scores are calculated in three

steps: text preprocessing, feature extraction, and

feature combination. In preprocessing, we replace

special characters and punctuations with spaces,

normalize all letters to their lowercase, remove

stop-words, and conduct stemming and syntax

analysis. Subsequently, we extract a variety of fea-

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38536576

粉丝: 6
资源: 939

北航-微软亚洲研究院在SemEval-2017社区问答竞赛中的神经匹配排名系统

beihang-os-lab1课程设计debug

beihang-university:北航考研上机复试机试

beihang.zip_beihang

2011年于北京航空航天大学获得硕士学位，现为湖南铁路科技职业技术学院教授，主要研究方向为电力电子技术。请以投稿作者的身份翻译成英文

程序设计-C and C++的实现：第2章 C++编程入门.ppt

基于STM32F407芯片，优质课程资源，机电一体化课程教学开发板与例程

最新资源