中文临床文本实体识别：注意力机制CNN-LSTM-CRF模型研究

63 浏览量更新于2024-08-27 1 收藏 1.31MB PDF 举报

"基于注意的CNN-LSTM-CRF在中文临床文本中的实体识别" 这篇研究论文主要探讨了在中文临床文本中进行实体识别的一种新颖方法，即基于注意力的卷积神经网络（CNN）-长短时记忆网络（LSTM）-条件随机场（CRF）模型。临床实体识别是医疗文本处理的基础任务，它对于提取医疗记录中的关键信息至关重要，例如疾病、症状、药物等。然而，尽管近年来在英文临床文本的实体识别上取得了显著进步，但针对其他语言，尤其是中文的研究相对较少。方法部分，研究者提出了一种扩展的深度神经网络架构——注意力机制的CNN-LSTM-CRF。这个模型的核心在于将CNN层添加到输入层之后，用于捕获感兴趣的单词的局部上下文信息。同时，在CRF层之前引入了注意力层，目的是在同一个句子中选择相关的关键词，以提高识别准确性和对语境的理解。这种方法结合了CNN的强大特征提取能力、LSTM的记忆与序列建模功能以及CRF的全局依赖关系建模，旨在优化实体边界检测和分类。为了验证所提方法的有效性，研究者将其与其他两种当前流行的方法进行了对比。这种比较通常包括了性能指标的评估，如精确度、召回率和F1分数，这些指标可以全面反映模型在识别实体时的性能。实验结果可能显示了基于注意力的CNN-LSTM-CRF模型在识别中文临床文本实体方面的优势，可能是由于其能更好地捕捉上下文信息和关注关键元素。此外，论文可能会讨论模型的训练过程，包括数据预处理、模型参数调整、训练策略以及可能面临的挑战，如过拟合问题。研究者可能还分析了不同类型的临床实体在识别过程中的表现，以及模型如何适应不同长度和复杂性的句子。最后，论文可能提出了未来的研究方向，如模型的优化、多语言应用或者将此模型应用于更广泛的医疗文本分析任务。这篇论文为中文临床文本的实体识别提供了一个创新的深度学习框架，通过引入注意力机制，提升了模型在理解和处理复杂临床语境中的效能。这对于推动医疗信息的自动化处理，提高医疗决策支持系统的效率和准确性具有重要意义。

RES E A R C H Open Access

Entity recognition in Chinese clinical text

using attention-based CNN-LSTM-CRF

Buzhou Tang

, Xiaolong Wang

, Jun Yan

and Qingcai Chen

From The Sixth IEEE International Conference on Healthcare Informatics (ICHI 2018)

New York, NY, USA. 4-7 June 2018

Abstract

Background: Clinical entity recognition as a fundamental task of clinical text processing has been attracted a great

deal of attention during the last decade. However, most studies focus on clinical text in English rather than other

languages. Recently, a few researchers have began to study entity recognition in Chinese clinical text.

Methods: In this paper, a novel deep neural network, called attention-based CNN-LSTM-CRF, is proposed to recognize

entities in Chinese clinical text. Attention-based CNN-LSTM-CRF is an extension of LSTM-C RF by introducing a

CNN (convolutional neural network) layer after the input layer to capture local context information of words of

interest and an attention layer before the CRF layer to select relevant words in the same sentence.

Results: In order to evaluate the proposed method, we compare it with other two currently popular methods,

CRF (conditional random field) and LSTM-CRF, on two benchmark datasets. One of the datasets is publically

available and only contains contiguous clinical entities, and the other one is constructed by us and contains

contiguous and discontiguous clinical entities. Experimental resul ts show that attent ion-based CNN- LSTM-CRF

outperforms CRF and LSTM-CRF.

Conclusions: CNN and attention mechanism are in dividually beneficial to LSTM-CRF-based Chinese clinical entity

recognition system, no matter whether contiguous clinical entities are considered. The conribution of attention mechanism

is greater than CNN.

Keywords: Chinese clinical entity recognition, Neural network, Convolutional neural network, Long-short term

memory, Condi tional random field

Introduction

With rapid development of electronic medical information

systems, more and more electronic medical records

(EMRs) are available for medical research and application.

In EMRs, plenty of useful information is embedded in

clinical text. The first step to use clinical text is clinical

entity recognition that finds which words form clinical

entities and which type each entity belongs to.

In the last decades, a large number of methods have

been proposed for clinical entity recognitio n. The

methods includes early rule-based methods, machine

learning methods based on manually-crafted features in

past a few years and recently deep neural networks. The

most popular machine learning method used for clinical

entity recognition is conditional random field (CRF) [1],

and the most popular deep neural network is

LSTM-CRF [2]. However, most studies focus on entity

recognition in English clinical text rather than other

languages. It is necessary to investigate the latest

methods for entity recognition in other languages, for

example Chinese.

To promote de velopment of entity recognition in

Chinese clinical tex t, the organizers of China conference

on knowledge graph and semantic computing

(CCKS) launched a c hallenge was launched in 2017

[3]. The challenge organizer provided a dataset

(called CCKS2017_CNER) with only contiguous clinical

entities following the guideline of i2b2 (Informatics for

* Correspondence: qingcai.chen@gmail.com

Key Laboratory of Network Oriented Intelligent Computation, Harbin

Institute of Technology, (Shenzhen), Shenzhen 518055, China

Full list of author information is available at the end of the article

International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and

reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to

the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver

(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Tang et al. BMC Medical Informatics and Decision Making 2019, 19(Suppl 3):74

https://doi.org/10.1186/s12911-019-0787-y

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38691199

粉丝: 1
资源: 940

中文临床文本实体识别：注意力机制CNN-LSTM-CRF模型研究

通过基于注意的CNN-LSTM-CRF进行中国临床实体识别

LSTM-CNNs-CRF.rar

双向LSTM-CNN的命名实体识别：双向LSTM-CNN的命名实体识别

CNN-LSTM与EnDecoder框架的CNN-LSTM有何区别，优缺点

FL-CNN-LSTM

cnn-lstm相比lstm优势

串联cnn-lstm网络相对于并联在eeg分类中有什么缺点

基于STN-CNN-LSTM-CTC的车牌识别代码

pytorch CNN-LSTM

matlab cnn-lstm

最新资源