迭代精炼框架：基于质量感知的BERT中文诗歌生成

需积分: 9 74 浏览量更新于2024-09-06 收藏 1.94MB PDF 举报

"BERT NLP - An Iterative Polishing Framework based on Quality Aware Masked Language Model for Chinese Poetry Generation" 本文探讨了一种基于质量感知的掩码语言模型（Quality-Aware Masked Language Model, QA-MLM）的迭代精炼框架，用于高质量的中文诗歌生成。在人工智能领域，自动创作中文诗歌因其独特的文学和美学特性而具有挑战性，通常不能简单地通过端到端的方法直接实现。该框架提供了一个创新的解决方案，通过两个主要阶段来逐步提高诗作的质量。首先，利用编码器-解码器结构生成诗歌初稿。这个结构通常由一个用于理解输入信息的编码器和一个用于生成输出序列的解码器组成。编码器将输入信息转化为中间表示，解码器则根据这个表示生成诗句，初步形成一首诗。接下来，文章的重点在于提出的QA-MLM，它在这个迭代精炼阶段起着关键作用。QA-MLM不仅能够评估诗歌草案的质量，还能够定位需要改进的地方。这得益于其多任务学习策略，模型能够同时考虑诗歌的语言学特征和文学性，从而判断是否需要进行打磨以及打磨的具体位置。这种能力使得QA-MLM在保持诗歌原有风格的同时，能针对性地提升语言的流畅度和文学价值。此外，QA-MLM的掩码语言建模技术也值得关注。这种技术通常在预训练阶段使用，通过随机遮蔽部分输入序列，让模型预测被遮蔽的词，从而学习语言的内在规律。在诗歌生成中，这一技术可能被用来识别并修正不恰当或不和谐的词汇，确保生成的诗歌既符合语法规则，又富含诗意。这篇论文提出的框架通过迭代和精细化处理，使得机器生成的中文诗歌在语言表达和艺术性上得到了显著提升。这为AI在中文诗歌创作领域的应用开辟了新的可能，也展示了深度学习在处理复杂文本生成任务时的潜力。未来的研究可能会进一步优化这种框架，使其适用于其他类型的文本生成，如古文、歌词等，或者探索如何结合更多的文化背景和情感元素，生成更加生动和富有表现力的文本作品。

An Iterative Polishing Framework based on Quality Aware Masked Language

Model for Chinese Poetry Generation

Liming Deng,

Jie Wang,

Hangming Liang,

Hui Chen,

Zhiqiang Xie,

3∗

Bojin Zhuang,

Shaojun Wang,

Jing Xiao

Ping An Technology

Ping An Insurance (Group) Company of China

University of Science and Technology of China

dengliming777@pingan.com.cn, photonicsjay@163.com

Abstract

Owing to its unique literal and aesthetical characteristics, au-

tomatic generation of Chinese poetry is still challenging in

Artiﬁcial Intelligence, which can hardly be straightforwardly

realized by end-to-end methods. In this paper, we propose a

novel iterative polishing framework for highly qualiﬁed Chi-

nese poetry generation. In the ﬁrst stage, an encoder-decoder

structure is utilized to generate a poem draft. Afterwards,

our proposed Quality-Aware Masked Language Model (QA-

MLM) is employed to polish the draft towards higher quality

in terms of linguistics and literalness. Based on a multi-task

learning scheme, QA-MLM is able to determine whether pol-

ishing is needed based on the poem draft. Furthermore, QA-

MLM is able to localize improper characters of the poem

draft and substitute with newly predicted ones accordingly.

Beneﬁted from the masked language model structure, QA-

MLM incorporates global context information into the pol-

ishing process, which can obtain more appropriate polishing

results than the unidirectional sequential decoding. Moreover,

the iterative polishing process will be terminated automati-

cally when QA-MLM regards the processed poem as a qual-

iﬁed one. Both human and automatic evaluation have been

conducted, and the results demonstrate that our approach

is effective to improve the performance of encoder-decoder

structure.

Introduction

Chinese Poetry, originated from people’s production and

life, has a long history. The poetry is developed from few

characters, vague rules to some ﬁxed characters and lines

with stable rules and forms. The rules like tonal pattern,

rhyme scheme lead to poems easy to be read and remem-

bered. The great poems, which touch millions of people at

heart across the space and time, should unify the concise

form, reﬁned language and rich content together to guaran-

tee the long-term prosperity. Writing great poems are not

easy, which require strong desire for poets to express their

feelings, views or thoughts and then to choose characters

and build sentence carefully.

Poets are always regarded as genius with great talents and

well trained in writing poems. It is hard to write a poem

∗

This work was done when Zhiqiang Xie was at Ping An Tech-

nology

for ordinary people, let alone to computers. Although many

works (Gerv

as 2001; Ghazvininejad et al. 2016; Yi et al.

2018; Li et al. 2018) have been conducted for automatic po-

etry generation and poetic rules and forms can be learned

partially, the large gaps remain in the meaningfulness and

coherence of generated poems.

In this paper, we focus on the automatic Chinese poetry

generation and aim to ﬁll these gaps. We notice that poets

would ﬁrst write a poem draft and then polish the draft many

times to a perfect one. There is a popular story about pol-

ishing poem by Dao Jia, a famous poet in Tang Dynasty,

who inﬂuences many later poets in polishing their poems

intensively. Motivated by the writing poem process of po-

ets, we aim to imitate this process and improve the coher-

ence and meaningfulness of primitive poems. However, it is

challenging for computer algorithms to automatically polish

the poem draft to an excellent one. The computer algorithms

are unable to choose the characters and sentences like poets

with intuition and comprehensive understanding of the char-

acters, which are only good at calculating the probability of

characters and picking up ones with maximum probability

from vocabulary. There are three key issues to be addressed

for the polishing framework.

• Whether the text need to be polished, and when should we

stop the iterative polishing process?

• Which characters in the text are improper and need to be

replaced with better ones?

• How to obtain the better ones?

To address these key issues and further improve the qual-

ity of generated poem, we propose a Quality-Aware Masked

Language Model (QA-MLM) to implement an iterative pol-

ishing process. To the best of our knowledge, this is the ﬁrst

work to solve the three key issues in polishing framework

with one elegant model.

Our idea originates from the BERT (Devlin et al. 2018)

with two-task learning schema, and we modify the tasks to

aware of text quality and further obtain appropriate charac-

ters to replace the low quality characters in the text. With

these two tasks, we can polish the generated poem draft it-

eratively, and the polishing process will be terminated auto-

matically. The main contributions of this paper are summa-

arXiv:1911.13182v1 [cs.CL] 29 Nov 2019

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_42252147

粉丝: 3
资源: 1

迭代精炼框架：基于质量感知的BERT中文诗歌生成

NLP陈丹琪博士论文.pdf

poetry_generator_Keras.zip

Python自然语言处理-BERT实战

Inference with C# BERT NLP Deep Learning and ONNX Runtime 源码

情绪分析：一种情绪分析API，支持朴素贝叶斯，马尔可夫模型和BERT NLP

NLP：基于bert的中文自然语言处理工具.zip

NLP技术 自然语言处理技术知识讲解 自然语言处理通用框架BERT原理解读 共33页.pdf

机器学习模型-谷歌免费开源的bert模型（NLP自然语言处理）

bert中文NLP模型

BERT在自然语言处理中的应用

最新资源

NLP技术自然语言处理技术知识讲解自然语言处理通用框架BERT原理解读共33页.pdf