深度学习驱动的自动文本摘要技术探析

版权申诉

138 浏览量更新于2024-09-09 收藏 702KB PDF 举报

"这篇PDF文件名为'基于深度学习的自动文本摘要.pdf'，主要探讨了如何运用深度学习技术解决自动文本摘要的问题。作者Som Gupta和S. K. Gupta是来自AKTU Lucknow的研究学者，他们在计算机科学部门工作。文章提到了在数据爆炸性增长的背景下，自动文本摘要的重要性，它能够减少人工搜索相关信息的时间和努力。机器学习、自然语言处理以及无监督方法在自动摘要领域已有广泛应用，但深度学习作为新兴的数据驱动方法，已经超越了这些传统方法，并且与传统方法结合使用时，在冗余性和覆盖度方面表现出良好的效果。特别是序列到序列（Seq2Seq）的编码器-解码器模型，被广泛用于文本摘要任务。然而，深度学习面临的一个主要挑战是需要大量训练数据。许多研究者正在致力于解决这个问题。该论文的目标是对深度学习在文本摘要领域的最新进展进行简要概述，并提供相关的介绍。" 本文主要讨论了深度学习在自动文本摘要中的应用，这是一个随着互联网信息爆炸而变得越来越重要的领域。自动文本摘要能够高效地提炼大量文本中的关键信息，节省用户的时间。传统的文本摘要方法包括基于统计的方法、基于机器学习的方法以及无监督方法，这些方法在一定程度上已经取得了成功，但在处理复杂语境和生成高质量摘要方面仍存在局限。深度学习的引入改变了这一状况，尤其是编码器-解码器架构的Seq2Seq模型，这种模型能够在理解输入文本后生成简洁的摘要。Seq2Seq模型由两个主要部分组成：编码器负责理解输入序列的信息，而解码器则生成输出序列。在训练过程中，模型通过端到端学习来优化生成的摘要质量，通常采用的是最大似然估计或对抗性训练等策略。尽管深度学习在自动文本摘要方面展现出巨大潜力，但其依赖大量标注数据的问题限制了其发展。由于获取大规模带标签的摘要数据集成本高昂，研究人员正在探索迁移学习、半监督学习和无监督学习等方法来缓解这一问题，或者通过数据增强和自动生成摘要对模型进行预训练，从而降低对大量训练数据的依赖。这篇论文将深入探讨这些最新的研究进展，为读者提供一个深度学习在文本摘要领域应用的概览，对于关注该领域的学者和实践者具有很高的参考价值。

Deep Learning in Automatic Text Summarization

Som Gupta and S.K Gupta

somi.11ce@gmail.com, guptask_biet@rediffmail.com

Research Scholar AKTU Lucknow, Computer Science Department BIET Jhansi

Abstract—Exponential increase of amount of data has led to the need

of automatic text summarization approaches to reduce the manual effort

and save time of searching the relevant information. Machine Learning,

Natural Language Processing and Unsupervised approaches have been

widely applied successfully for automatic summarization problem. Deep

Learning is an emerging data-driven approach which has outperformed

all the above mentioned traditional approaches and in combination with

these traditional approaches, gives good results in terms of redundancy

and coverage. Seq2Seq encoder-decoder models are the most widely

used deep learning models for the purpose of summarization. But the

need of huge training dataset is one of the big challenge in this ﬁeld.

Lot of researchers are working on this issue. The aim of the paper is

to give a brief overview of the recent works done using deep learning

in the ﬁeld of text summarization, give a brief introduction to various

techniques being used while using deep learning, list down the various

challenges and various datasets being used to perform this task.

Keywords—Recurrent Neural Networks; Convolutional Neural Net-

works, deep learning; Neural Networks; Attention Models;

I INTRODUCTION

Automatic Text summarization, is to shorten the amount of

text in the document without losing the essence, is one of

the most challenging and a time-consuming activity because

of the complexity of natural language processing involved

with different kind of data. There are number of techniques

available to perform automatic text summarization and are

classiﬁed broadly into extractive and abstractive. Extractive

approaches are where the summarization is performed by

extracting the important sentences and it is mostly done

by using the features from the text and processing them

with soft computing techniques like fuzzy logic, genetic pro-

gramming, machine learning or neural networks. Whereas,

abstractive approaches are where the new sentences are

generated for consideration into the summary and is mostly

performed by considering the semantic information present

in the sentences of the text.

With the increasing amount of data available due to the

emergence of online platforms, the challenge is to produce

the automatic summaries which are grammatically correct,

less redundant, coherent and good in coverage. Deep Learn-

ing is one of the emerging machine learning approach which

uses neural networks inside it with multiple-layers of hid-

den layers to perform summarization which helps solve the

above mentioned challenges. Deep Learning involves the

use of neural networks, where the input is fed to the system

and then the input goes to various hidden layers for ﬁne-

tuning and then ﬁnally the output is obtained. Mostly non-

linear processing is performed at the hidden layers to obtain

the output. Deep learning models help reduce the semantic

space of the text. Researchers have proved that the deep

learning models outperform than the existing approaches in

terms of both coherency and linguistic quality.

The paper is organized in this way: Section 2 describes

the various deep learning techniques which have been used

to perform text summarization. Section 3 describes the

various works done using deep learning techniques and

are being classiﬁed into extractive and abstractive. Section

4 describes the various evaluation measures which have

been used to perform summarization. Section 5 lists down

the various challenges and future areas for researchers. And

ﬁnally the conclusion.

II TECHNIQUES USED

Mostly the Restricted Boltzmann Machine, Se-

quence2Sequence Models using Encoder-Decoder approach

and Unsupervised approaches have been used for

summarization purpose and are described as below:

II.1 Restricted Boltzmann Machine, RBM

It consists of three layers input, hidden and output layer.

It helps remove the redundant information. Verma et al.

[1] used the feature-based extraction along with the Re-

stricted Boltzmann Machine(RBM) where they used one

hidden layer and 9 perceptrons in each layer, to create the

extractive summaries. Liu et al. [2] used Support Vector

Machines(SVM) and Deep Belief networks to perform multi-

document query-speciﬁc text summarization. They have

used Restricted Boltzmann Machines(RBM) in the hidden

layers. Yao et al. [3] have also used the Restricted Boltzmann

Machine to perform summarization using 1 visible and 4

hidden layers.

II.2 Sequence-to-Sequence Models

When the input and output are both the sequence of words,

it is called as Sequence to Sequence problem. Like in text

summarization, the input is in the form of string which is a

sequence of words and the output is also a text summary

which is again the sequence of words. They consist of

two parts namely encoder and decoder. Input is fed to the

International Journal of Computer Science and Information Security (IJCSIS),

Vol. 16, No. 11, November 2018

150

https://sites.google.com/site/ijcsis/

ISSN 1947-5500

下载后可阅读完整内容，剩余5页未读，立即下载

Fun_He

粉丝: 19
资源: 104

深度学习驱动的自动文本摘要技术探析

自然语言处理中的文本摘要技术探索

神经网络在文本摘要生成中的应用

"Python深度学习多格式纠错系统与教师绩效考核管理系统设计

基于深度学习的文本自动摘要方案.pdf

基于深度学习模型的自动摘要.pdf

基于深度学习的文本相似度计算.pdf

基于深度学习的生成式自动摘要技术.pdf

综述：文本摘要.pdf

基于子词单元的深度学习摘要生成方法.pdf

一种基于深度学习的中文生成式自动摘要方法.pdf

最新资源