机器翻译CKY解码修剪方法对比分析

74 浏览量更新于2024-08-26 收藏 826KB PDF 举报

"对机器翻译中基于CKY解码的几种修剪方法进行比较的研究论文" 本文主要探讨了在机器翻译领域，尤其是基于CYK（Cocke-Kasami-Younger）算法的解码过程中，几种常见的修剪方法的性能、实现细节以及可能的优化策略。作者Yu Ze Gao和Tong Xiao来自东北大学的自然语言处理实验室，他们通过实验对比分析了这些方法，并提出了新颖的修剪策略。在统计机器翻译（SMT）系统中，解码是核心步骤，即寻找给定源字符串的最佳目标字符串。CKY算法是一种用于处理上下文无关语法的动态规划方法，在SMT解码器中广泛应用。然而，由于搜索空间的庞大，解码过程往往效率低下，因此需要有效的修剪技术来加速并提高翻译质量。文章首先介绍了几种流行的修剪方法，包括基于分数阈值的修剪、基于节点得分的前向和后向剪枝，以及基于规则得分的修剪等。每种方法都有其独特的优势和适用场景。例如，分数阈值修剪可以通过设定一个得分阈值来快速剔除低分的解码路径，但可能会过早地排除掉潜在的最优解。而基于节点得分的剪枝则试图在更精细的层次上平衡搜索效率和准确性。接下来，作者提供了详细的实验结果，对比了不同修剪方法在解码速度和翻译准确性上的表现。实验结果表明，某些方法在特定条件下能显著提升解码速度，但可能牺牲部分翻译质量，反之亦然。通过对这些结果的深入分析，作者为每种方法提出了针对性的优化建议。最后，作者提出了新颖的修剪策略，这些策略旨在结合现有方法的优点，同时减少它们的缺点。例如，他们可能涉及动态调整阈值、引入上下文信息或采用混合修剪策略，以在保持解码效率的同时提高翻译的精确度。这篇研究论文为SMT领域的研究者和实践者提供了一项宝贵的资源，帮助他们在基于CKY解码的过程中做出更明智的选择，以优化翻译系统的性能。通过比较和分析各种修剪方法，本文不仅加深了我们对解码过程的理解，也为未来机器翻译技术的发展指明了可能的方向。

A Comparison of Pruning Methods for

CYK-based Decoding in Machine Translation

YuZe Gao

Natural Language Processing Lab

Northeastern University

yuze.gao@outlook.com

Tong Xiao

Natural Language Processing Lab

Northeastern University

xiaotong@mail.neu.edu.cn

Abstract

We present some popular pruning meth-

ods for CYK-based decoding in machine

translation, and describe the implementa-

tion of them. Then, we provide the exper-

imental results of these methods and the

comparison of these results. In addition,

we analyze each method in terms of de-

coding speed and translation accuracy,

based on which some possible optimiza-

tions for each method are given. Lastly,

we propose some novel pruning methods

for CYK-based decoding.

1 Introduction

In recent years, statistical machine translation

(SMT) has been extensively investigated, show-

ing state-of-the-art performance in many transla-

tion tasks. In current SMT paradigm, a core step

is to search for the "best" target string for the

given source string, namely decoding. Several

methods are available to implement SMT decod-

ers. For instance, we can incrementally add tar-

get words in a left-to-right fashion [Ortiz, 2003;

Yang, 2010], or build translation hypotheses in a

bottom-up fashion [Young, 1996]. One popular

method is CYK-based decoding that originates

from monolingual parsing [Cheppalier, 1998].

In CYK decoders, a partial hypothesis can be

produced by the use of hypotheses generated on

smaller segments/spans (Fig. 1). The algorithm

starts with the smallest spans, and proceeds once

it generates all possible hypotheses for a span.

The final translation can be accessed when we

finish the computation on the entire span. The

brilliance of CYK-based decoding comes from

its simplicity and from the natural manner in

which one can build derivations using linguisti-

cally-motivated grammars or formally syntactic

rules. Therefore, it is widely used in hierarchical

machine translation (MT) systems [Vilar, 2012;

V. F. López, 2010; Chiang David, 2007].

One bottleneck of SMT decoding is its speed.

In CYK-based decoding, there are two factors

that can slow down the system.

1) Large Search Space. Given a source string,

the number of possible translations is huge. Even

for a word-based model, decoding is a NP-

complete problem [Knight, 1999]. The situation

is worse for modern hierarchical MT models be-

cause more ambiguities are introduced by the

underlying derivations of rules.

2) Cubic Time Complexity of CYK. The time

complexity of CYK algorithm is O(n

), i.e., the

decoding time is cubic to the length of the input

sentence. Decoding long sentences is a big prob-

lem.

Obviously, pruning is of great importance to

the speed-up of CYK decoders. The simplest of

these is beam pruning, which keeps the most

promising candidates in a certain distance from

the top-1 candidate, and discards the rest [Koehn,

2004; Robert C 2007]. The decoder can run fast-

er using cube pruning/growing, which is also

popular in MT systems [Gesmundo et al, 2010].

Cube pruning is particularly powerful if one can

organize the decoding problem into a search

problem in hypergraphs.

In this paper, we empirically compare three

pruning methods for CYK-based decoding that

try to address or relieve its cubic time complexity

(Fig. 1). In particular, we divide the CYK algo-

rithm into two parts – a dual loop on spans

(O(n

)) and a loop on segmentation that chops a

given span (O(n)). We use the parser tree and

punctuation information to prune unlikely spans

(the two outermost loops), and use phrase

第十一届全国机器翻译研讨会(CWMT 2015)

中国 · 合肥 2015.9.24-25

- 65 -

Proceedings of the 11th China Workshop on Machine Translation, pages 65–73, Sept. 23-25, 2015 HeFei, AnHui, China

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38706951

粉丝: 4
资源: 930

机器翻译CKY解码修剪方法对比分析

CKY算法c++实现

CKY_Algorithm_PCFG:概率上下文无关文法的 CKY 算法

加入概率的CKY句法分析器

PCFG CKY实现--python版本

动态规划算法在自然语言处理中的应用

实现CKY算法用于PCFG下的句法分析算法，具体算法描述

usage: cky.py [-h] test_file grammar_file output_file cky.py: error: the following arguments are required: grammar_file, output_file,出现这个错误怎么解决

RoRocky Linux安装配置若依cky Linux安装配置若依

cfg模型有哪些，encoder模型有哪些，encoder_k模型有哪些

最新资源