基于HowNet的神经机器翻译未知词处理方法

5 浏览量更新于2024-08-29 收藏 586KB PDF 举报

本文主要探讨了神经机器翻译（NMT）系统中处理未知词的一种创新方法，该方法利用HowNet来改进传统的未知词处理策略。神经机器翻译在处理未在训练语料库中出现的新词汇时，往往表现不足，这限制了其翻译质量。传统的解决方案通常依赖大规模单语语料库训练的词向量，通过相似度匹配替换未知词。然而，这种做法存在两个主要问题：首先，对于未知词的词向量质量不高，可能导致翻译不准确；其次，处理多义词时，词向量方法难以区分不同含义。 HowNet是一个丰富的汉语概念知识库，它包含了丰富的语义信息，包括概念和隐喻关系。作者提出了一种结合HowNet的未知词处理方法。该方法利用HowNet中的概念和语义信息，寻找与输入未知词最相关的词汇作为替代。通过这种方式，可以提高词向量的质量，并更好地处理多义词，因为HowNet能够提供更深层次的语义理解。实验结果显示，这种方法不仅提升了神经机器翻译系统的性能，而且在处理未知词的准确性和多样性方面优于传统方法。通过将领域特定的语义知识与NMT模型相结合，该研究旨在增强机器翻译的鲁棒性，使得模型能够更自然地处理未曾见过但具有类似语义的词汇。这项工作为解决神经机器翻译中的未知词问题提供了一个有前景的解决方案，有望推动机器翻译技术向更高效、准确的方向发展。

A Method of Unknown Words Processing

for Neural Machine Translation Using HowNet

Shaotong Li, JinAn Xu

(&)

, Yujie Zhang, and Yufeng Chen

School of Computer and Information Technology,

Beijing Jiaotong University, Beijing, China

{shaotongli,jaxu,yjzhang,chenyf}@bjtu.edu.cn

Abstract. An inherent weakness of neural machine translation (NMT) systems

is their inability to correctly translate unknown words. Traditional unknown

words processing methods are usually based on word vectors trained on large

scale of monolingual corpus. Replacing the unknown words according to the

similarity of word vectors. However, it suffers from two weaknesses: Firstly, the

resulting vectors of unknown words are not of high quality; Secondly, it is

difﬁcult to deal with polysemous words. This paper proposes an unknown word

processing method by integrating HowNet. Using the concepts and sememes in

HowNet to seek the replacement words of unknown words. Experimental results

show that our proposed method can not only improves the performance of

NMT, but also provides some advantages compared with the traditional

unknown words processing methods.

Keywords: NMT

 Unknown words  HowNet  Concept  Sememe

End-to-End NMT is a kind of machine translation method proposed in recent years [1–

4]. Most of the NMT systems are based on the encoder-decoder framework, the

encoder encodes the source sentence into a vector, and the decoder decodes the vector

into the target sentence. Compared with the traditional statistical machine translation

(SMT), NMT has many advantages, and has shown greatly performance in many

translation tasks.

But NMT still has the problem of unknown words which is caused by the limited

vocabulary scale. In order to control the temporal and spatial expenses of the model,

NMT usually uses small vocabularies in the source side and the target side [5]. The

words that are not in the vocabulary are unknown words, which will be replaced by an

“UNK” symbol. A feasible method to solve this problem is to ﬁnd out the substitute

in-vocabulary words of the unknown words. Li et al. proposed a replacing method

based on word vector similarity [5], the unknown words are replaced by the synonyms

in the vocabulary through the cosine distance of the word vector and the language

model. However, there are some unavoidable problems with this method. Firstly, the

vectors of rare words are difﬁcult to train; Secondly, the trained word vectors cannot

express various semantics of the polysemous words and cannot adapt to the replace-

ment of the polysemous words in different contexts.

To solve these problems, this paper proposes an unknown words processing

method based on HowNet. This met hod uses HowNet’s concept s and sememes as well

D.F. Wong and D. Xiong (Eds.): CWMT 2017, CCIS 787, pp. 20–29, 2017.

https://doi.org/10.1007/978-981-10-7134-8_3

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38742927

粉丝: 9
资源: 936

基于HowNet的神经机器翻译未知词处理方法

MIRROR-GENERATIVE NEURAL MACHINE TRANSLATION.pdf

Neural Machine Translation_translation_机器翻译_

论文解读：DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

Code to train state-of-the-art Neural Machine Translation

Stronger Baselines for Trustable Results in Neural Machine Translation

MaxSD: A Neural Machine Translation Evaluation Metric Optimized by Maximizing Similarity Distance

Neural machine translation by jointly learning to align and translate

Google's Neural Machine Translation System - Bridging the Gap between Human and Machine Translation - 2016 (1609.08144v1)-计算机科学

Neural Machine Translation by Jointly Learning to Align and Translate阅读笔记

Convergence of gradient method with penalty for Ridge Polynomial neural network

最新资源