图卷积网络在半监督文本分类中的应用

需积分: 0 182 浏览量更新于2024-08-05 收藏 1.83MB PDF 举报

"本文介绍了将图卷积网络（GCN）应用于半监督文本分类的方法，称为TextGCN。" 在自然语言处理领域，文本分类是一项重要且基础的任务，它涉及将文本数据分配到预定义的类别中。传统的文本分类方法通常依赖于基于特征的模型或者深度学习模型，如词袋模型、TF-IDF以及近年来流行的卷积神经网络（CNN）。然而，这些方法主要集中在序列数据上的卷积操作，而忽略了文本中潜在的复杂结构信息，例如词汇共现关系和文档之间的关联。图卷积网络（GCN）是一种能够处理非规则网格结构数据（如任意图）的深度学习模型，它在图数据上进行卷积操作。GCN通过在图的邻接矩阵上进行层叠的信息传播来学习节点的表示，这些节点可以是文本中的词汇或整个文档。GCN的这一特性使其在处理非线性结构信息时具有优势，尤其适合捕捉文本中的语义关系。在本文中，作者提出了一种用于文本分类的图卷积网络模型——TextGCN。首先，他们根据词共现和文档-词的关系构建一个单一的文本图。这个图的节点代表词或文档，边则表示它们之间的关系。然后，TextGCN在该图上进行卷积运算，初始使用词的一阶指示符（one-hot representation）作为词和文档的表示。通过迭代学习，TextGCN联合优化词和文档的嵌入，这一过程受到已知文档类别标签的监督。实验结果显示，TextGCN在多个基准数据集上取得了优于传统方法的性能，验证了利用图结构进行文本分类的有效性。此外，由于TextGCN能够从无标注数据中学习，因此它在半监督学习场景下表现尤为出色，这降低了对大量标注数据的依赖，对于资源有限的文本分类任务来说，这是一种高效且实用的方法。 GCN和TextGCN的引入为文本分类提供了一个新的视角，强调了在理解文本时考虑上下文和结构的重要性。这种基于图的建模方式不仅能够捕获词汇间的相互作用，还能捕捉文档的整体结构信息，从而提高分类的准确性和鲁棒性。未来的研究可能会进一步探索更复杂的图结构，如引入依存句法树或其他类型的文本图，以及优化GCN的训练策略以适应更大规模的数据集。

Graph Convolutional Networks for Text Classiﬁcation

Liang Yao, Chengsheng Mao, Yuan Luo

∗

Northwestern University

Chicago IL 60611

{liang.yao, chengsheng.mao, yuan.luo}@northwestern.edu

Abstract

Text classiﬁcation is an important and classical problem in

natural language processing. There have been a number of

studies that applied convolutional neural networks (convolu-

tion on regular grid, e.g., sequence) to classiﬁcation. How-

ever, only a limited number of studies have explored the more

ﬂexible graph convolutional neural networks (convolution on

non-grid, e.g., arbitrary graph) for the task. In this work, we

propose to use graph convolutional networks for text classi-

ﬁcation. We build a single text graph for a corpus based on

word co-occurrence and document word relations, then learn

a Text Graph Convolutional Network (Text GCN) for the cor-

pus. Our Text GCN is initialized with one-hot representation

for word and document, it then jointly learns the embeddings

for both words and documents, as supervised by the known

class labels for documents. Our experimental results on mul-

tiple benchmark datasets demonstrate that a vanilla Text GCN

without any external word embeddings or knowledge outper-

forms state-of-the-art methods for text classiﬁcation. On the

other hand, Text GCN also learns predictive word and docu-

ment embeddings. In addition, experimental results show that

the improvement of Text GCN over state-of-the-art compar-

ison methods become more prominent as we lower the per-

centage of training data, suggesting the robustness of Text

GCN to less training data in text classiﬁcation.

Introduction

Text classiﬁcation is a fundamental problem in natural lan-

guage processing (NLP). There are numerous applications

of text classiﬁcation such as document organization, news

ﬁltering, spam detection, opinion mining, and computa-

tional phenotyping (Aggarwal and Zhai 2012; Zeng et al.

2018). An essential intermediate step for text classiﬁcation

is text representation. Traditional methods represent text

with hand-crafted features, such as sparse lexical features

(e.g., bag-of-words and n-grams). Recently, deep learning

models have been widely used to learn text representa-

tions, including convolutional neural networks (CNN) (Kim

2014) and recurrent neural networks (RNN) such as long

short-term memory (LSTM) (Hochreiter and Schmidhuber

1997). As CNN and RNN prioritize locality and sequential-

ity (Battaglia et al. 2018), these deep learning models can

∗

Corresponding Author

 2019, Association for the Advancement of Artiﬁcial

capture semantic and syntactic information in local consec-

utive word sequences well, but may ignore global word co-

occurrence in a corpus which carries non-consecutive and

long-distance semantics (Peng et al. 2018).

Recently, a new research direction called graph neural

networks or graph embeddings has attracted wide atten-

tion (Battaglia et al. 2018; Cai, Zheng, and Chang 2018).

Graph neural networks have been effective at tasks thought

to have rich relational structure and can preserve global

structure information of a graph in graph embeddings.

In this work, we propose a new graph neural network-

based method for text classiﬁcation. We construct a single

large graph from an entire corpus, which contains words and

documents as nodes. We model the graph with a Graph Con-

volutional Network (GCN) (Kipf and Welling 2017), a sim-

ple and effective graph neural network that captures high

order neighborhoods information. The edge between two

word nodes is built by word co-occurrence information and

the edge between a word node and document node is built

using word frequency and word’s document frequency. We

then turn text classiﬁcation problem into a node classiﬁca-

tion problem. The method can achieve strong classiﬁcation

performances with a small proportion of labeled documents

and learn interpretable word and document node embed-

dings. Our source code is available at https://github.

com/yao8839836/text_gcn. To summarize, our con-

tributions are as follows:

• We propose a novel graph neural network method for text

classiﬁcation. To the best of our knowledge, this is the

ﬁrst study to model a whole corpus as a heterogeneous

graph and learn word and document embeddings with

graph neural networks jointly.

• Results on several benchmark datasets demonstrate that

our method outperforms state-of-the-art text classiﬁca-

tion methods, without using pre-trained word embeddings

or external knowledge. Our method also learn predictive

word and document embeddings automatically.

Related Work

Traditional Text Classiﬁcation

Traditional text classiﬁcation studies mainly focus on fea-

ture engineering and classiﬁcation algorithms. For feature

arXiv:1809.05679v3 [cs.CL] 13 Nov 2018

下载后可阅读完整内容，剩余8页未读，立即下载

武藏美-伊雯

粉丝: 31
资源: 352

图卷积网络在半监督文本分类中的应用

MLX框架的一些示例 包含：文本模型、图像模型、音频模型等

semi -supervised classification with graph convolutional networks学习必记

GNN概述及Numpy实现Semi-GCN(组会报告)1

故障时间数据的半监督学习.pptx

基于图卷积半监督学习的论文作者同名消歧方法研究.docx

玩转图神经网络和知识图谱的相关算法：GCN,GAT,GAFM,GAAFM,GraphSage,W2V,TRANSe.zip

基于图神经网络和异构信息的兴趣点分类.pdf

TensorFlow实现图卷积网络GCN用于图节点分类

半监督学习对抗微博水军：Affinity Propagation算法应用

"图神经网络概述及Numpy实现Semi-GCN(组会报告)

最新资源

MLX框架的一些示例包含：文本模型、图像模型、音频模型等