双解联合词对齐与双语命名实体识别的协同模型

PDF格式 | 408KB | 更新于2024-08-26 | 168 浏览量 | 举报

本文主要探讨了"使用双重分解的联合词对齐与双语命名实体识别"这一主题，由Mengqiu Wang、Wanxiang Che和Christopher D. Manning三位作者，分别来自斯坦福大学和哈尔滨工业大学的研究团队合作完成。研究背景是翻译文本中蕴含丰富的互补语言线索，这对于双语命名实体识别（NER）任务具有提升性能的潜力。以往的研究已经证实，通过促进两种语言的标签一致性，可以提高双语标注的准确性。然而，大多数早期的双语标注方法假设词对齐信息是预先固定的，这可能导致标签错误的级联影响。论文作者认识到，命名实体标签信息本身具有纠正词对齐错误的能力。因此，他们提出了一种创新的图形模型，该模型将双语命名实体识别与词对齐任务结合起来。这个模型基于两个单语标注模型和两个单向词对齐模型，通过引入额外的跨语言边关系因素，旨在增强标签决策和词对齐之间的协调性。具体来说，这种双重分解的方法允许模型在进行双语NER的同时动态学习和优化词对，这样就减少了由于依赖固定对齐而引入的误差。通过这种方式，研究者们试图打破传统方法中的限制，提高双语命名实体识别的准确性和鲁棒性。这种方法对于处理多语言文本挖掘和跨语言自然语言处理任务具有重要意义，因为它不仅提升了命名实体识别的效果，还为后续的文本分析提供了更为精确的基础。

Joint Word Alignment and Bilingual Named Entity Recognition

Using Dual Decomposition

Mengqiu Wang

Stanford University

Stanford, CA 94305

mengqiu@cs.stanford.edu

Wanxiang Che

Harbin Institute of Technology

Harbin, China, 150001

car@ir.hit.edu.cn

Christopher D. Manning

Stanford University

Stanford, CA 94305

manning@cs.stanford.edu

Abstract

Translated bi-texts contain complemen-

tary language cues, and previous work

on Named Entity Recognition (NER)

has demonstrated improvements in perfor-

mance over monolingual taggers by pro-

moting agreement of tagging decisions be-

tween the two languages. However, most

previous approaches to bilingual tagging

assume word alignments are given as ﬁxed

input, which can cause cascading errors.

We observe that NER label information

can be used to correct alignment mis-

takes, and present a graphical model that

performs bilingual NER tagging jointly

with word alignment, by combining two

monolingual tagging models with two uni-

directional alignment models. We intro-

duce additional cross-lingual edge factors

that encourage agreements between tag-

ging and alignment decisions. We design

a dual decomposition inference algorithm

to perform joint decoding over the com-

bined alignment and NER output space.

Experiments on the OntoNotes dataset

demonstrate that our method yields signif-

icant improvements in both NER and word

alignment over state-of-the-art monolin-

gual baselines.

1 Introduction

We study the problem of Named Entity Recogni-

tion (NER) in a bilingual context, where the goal

is to annotate parallel bi-texts with named entity

tags. This is a particularly important problem for

machine translation (MT) since entities such as

person names, locations, organizations, etc. carry

much of the information expressed in the source

sentence. Recognizing them provides useful in-

formation for phrase detection and word sense dis-

ambiguation (e.g., “melody” as in a female name

has a different translation from the word “melody”

in a musical sense), and can be directly leveraged

to improve translation quality (Babych and Hart-

ley, 2003). We can also automatically construct a

named entity translation lexicon by annotating and

extracting entities from bi-texts, and use it to im-

prove MT performance (Huang and Vogel, 2002;

Al-Onaizan and Knight, 2002). Previous work

such as Burkett et al. (2010b), Li et al. (2012) and

Kim et al. (2012) have also demonstrated that bi-

texts annotated with NER tags can provide useful

additional training sources for improving the per-

formance of standalone monolingual taggers.

Because human translation in general preserves

semantic equivalence, bi-texts represent two per-

spectives on the same semantic content (Burkett et

al., 2010b). As a result, we can ﬁnd complemen-

tary cues in the two languages that help to dis-

ambiguate named entity mentions (Brown et al.,

1991). For example, the English word “Jordan”

can be either a last name or a country. Without

sufﬁcient context it can be difﬁcult to distinguish

the two; however, in Chinese, these two senses are

disambiguated: “乔丹” as a last name, and “约旦”

as a country name.

In this work, we ﬁrst develop a bilingual NER

model (denoted as BI-NER) by embedding two

monolingual CRF-based NER models into a larger

undirected graphical model, and introduce addi-

tional edge factors based on word alignment (WA).

Because the new bilingual model contains many

cyclic cliques, exact inference is intractable. We

employ a dual decomposition (DD) inference al-

gorithm (Bertsekas, 1999; Rush et al., 2010) for

performing approximate inference. Unlike most

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38535848

粉丝: 8

双解联合词对齐与双语命名实体识别的协同模型

基于注意力机制和深度学习模型的外来海洋生物命名实体识别.pdf

2万条词对齐的双语句子对

中老双语命名实体对位研究

NER命名体识别：文本标注工具Doccano配置方法/命名实体识别任务标注方法实例/标注导出与BIO处理/标签处理并完成对齐操作

跨语言命名实体翻译对抽取的研究综述

2017年人脸检测、人脸对齐、人脸识别源码

中老双语命名实体对齐研究与方法

中文命名实体识别数据集：深入解析与应用

纳西-汉语双语词对齐算法：基于双语词典与IBM模型

优化版LFW数据集：裁剪对齐后精选99人像识别

最新资源