MaxSD：神经机器翻译评估指标优化方法

165 浏览量更新于2024-08-29 收藏 607KB PDF 举报

"MaxSD是一种基于神经网络的机器翻译评估指标，通过最大化高质量和低质量假设之间的相似性距离来优化。在训练阶段，该方法旨在拉大优质和劣质翻译结果的评分差距，然后在测试阶段使用训练好的神经网络对新的翻译假设进行评估。MaxSD能够有效地将词汇和句法度量作为网络特征，从而捕捉不同层次的语言信息。在WMT-14数据集上的实验显示，该方法在五个语言对中的两个上达到了最先进的性能。" 详细说明: 机器翻译（Machine Translation, MT）是自然语言处理领域的重要研究方向，旨在自动将文本从一种语言翻译成另一种语言。传统的评价方法如BLEU、ROUGE等主要依赖于n-gram的精确匹配，但这些指标往往无法全面地评估翻译的语义质量和语法结构。 MaxSD（最大相似性距离）是一种新型的机器翻译评估指标，它利用神经网络模型来实现更精细的评估。在设计上，MaxSD的核心理念是在训练过程中最大化高质和低质翻译样本之间的相似性得分差异。这样，模型可以学习到区分优秀和较差翻译的关键特征。在MaxSD的训练阶段，神经网络通过比较人工翻译（gold reference）与机器生成的翻译（hypothesis）的相似性分数，以学习区分好坏翻译。这种训练策略使得网络能够捕获更多的语言特性，包括词汇选择、句法结构以及潜在的语义信息。在测试阶段，已经训练好的MaxSD模型被用来评估新的翻译假设。由于模型已经在训练中学会了区分高质和低质翻译，因此它可以提供更为准确的评价结果，不仅限于简单的n-gram匹配，而是考虑了更复杂的语言结构和语义理解。实验结果显示，MaxSD在WMT-14数据集的五个语言对中，有两对的评估性能超过了现有的最佳方法。这表明MaxSD在评估机器翻译的准确性和多样性方面具有显著优势，特别是在捕捉不同层次的语言信息方面表现突出。总结来说，MaxSD是机器翻译评估的一个创新性尝试，通过神经网络优化，能够更全面地衡量翻译的品质，尤其是在词汇和句法层面。这一方法对于提高机器翻译系统的性能和用户体验具有重要的理论与实践意义。

MaxSD: A Neural Machine Translation

Evaluation Metric Optimized by Maximizing

Similarity Distance

Qingsong Ma

1,2

Fandong Meng

1,2

Daqi Zheng

1,2

Mingxuan Wang

1,2

Yvette Graham

Wenbin Jiang

1,2

Qun Liu

1,3

Key Laboratory of Intelligent Information Processing,

Institute of Computing Technology, Chinese Academy of Sciences

University of Chinese Academy of Sciences

ADAPT Centre, School of Computing, Dublin City University, Ireland

{maqingsong, mengfandong, zhengdaqi, wangmingxuan, jiangwenbin}@ict.ac.cn

qun.liu@dcu.ie graham.yvette@gmail.com

Abstract. We propose a novel metric for machine translation evalu-

ation based on neural networks. In the training phrase, we maximize

the distance between the similarity scores of high and low-quality hy-

potheses. Then, the trained neural network is used to evaluate the new

hypotheses in the testing phase. The proposed metric can eﬃciently in-

corporate lexical and syntactic metrics as features in the network and

thus is able to capture diﬀerent levels of linguistic information. Experi-

ments on WMT-14 show state-of-the-art performance is achieved in two

out of ﬁve language pairs on the system-level and one on the segment-

level. Comparative results are also achieved in the remaining language

pairs.

Keywords: machine translation evaluation, neural networks, similarity

distance, maximization

1 Introduction

With the development of machine translation (MT), MT evaluation (MTE) has

received increasing attention. Traditional lexical-based metrics such as BLEU

[8], Meteor [3], and TERp [11] take n-grams, synonyms, stems, word order, and

phrases into account. However, metrics based on lexical and syntactic informa-

tion are insuﬃcient to evaluate the quality of the hypotheses, due to mismatch

errors caused by limited synonyms and references.

Recently, semantic-based metrics have become more feasible with the help of

deep learning. This paper presents an eﬀective metric based on neural networks,

i.e. Bidirectional Long Short Term Memory (Bi-LSTM) network [7, 10] for MTE.

To capture the inner connection between hypotheses and references, we also

explore the eﬀect of an enhanced Bidirectional Combined LSTM (BiC-LSTM)

network, which takes the concatenation of the hypothesis and the reference as

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38719564

粉丝: 2

MaxSD：神经机器翻译评估指标优化方法

深度学习模型ANFM：Attentional-Neural-Factorization-Machine实现与应用

深度学习在推荐系统中的应用：蔡氏电路仿真与Neural-Factorization-Machine

NeuralTalk开源项目：用MATLAB实现图像描述的多模态循环神经网络

论文解读：DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

Neural machine translation by jointly learning to align and translate

Neural Machine Translation by Jointly Learning to Align and Translate阅读笔记

Neural Machine Translation by Jointly Learning to Align and Translate.pdf

MIRROR-GENERATIVE NEURAL MACHINE TRANSLATION.pdf

A Method of Unknown Words Processing for Neural Machine Translation Using HowNet

Stronger Baselines for Trustable Results in Neural Machine Translation

最新资源