深度神经网络与对抗式多任务学习：提升技术实体识别的挑战与解决方案

67 浏览量更新于2024-08-28 收藏 474KB PDF 举报

对抗式多任务学习技术实体识别是一项关键的自然语言处理技术，它在学术研究中占据重要地位。本文的标题《Adversarial Multitask Learning for Technology Entity Recognition》强调了如何利用对抗性学习策略来解决技术实体识别（Technology Entity Recognition，简称TER）领域的特定问题。技术实体是指论文中提到的技术术语、工具、方法等，它们对于后续的技术预见、技术路线图以及技术创新分析等工作具有基础性作用。在正常命名实体识别（Named Entity Recognition，NER）的基础上，技术实体识别面临更复杂的问题。首先，特征提取是挑战之一。为了应对这一难题，研究人员采用深度神经网络（Deep Neural Network, DNN），其强大的表征学习能力能够捕捉文本中的抽象特征，提高识别精度。然而，技术实体识别的另一个挑战在于标注数据的缺乏。由于技术领域的专业性和多样性，获取全面且准确的实体标注数据是一项艰巨的任务。对抗式多任务学习作为一种策略，旨在通过同时训练多个相关任务，共享和增强模型的泛化能力。这有助于缓解数据稀疏的问题，因为模型可以从不同任务中学习到共性的特征表示，从而减少对单一任务数据的依赖。此外，跨领域差异也是TER面临的挑战。不同的技术领域有着独特的术语和表达方式，这要求模型具备良好的领域适应性。通过引入对抗性学习，可以训练模型在面对不同领域的文本时，既能保持通用性，又能针对特定领域进行微调，提高识别的准确性。总结来说，这篇研究论文主要探讨了如何运用对抗式多任务学习方法来优化技术实体识别过程，包括特征提取的改进以及数据稀缺和领域差异等问题的处理。作者们提出了一种策略，旨在通过深度学习和多任务协同学习，提升技术实体识别的性能，为后续的高阶技术分析提供更坚实的基础。

Adversarial Multitask Learning for

Technology Entity Recognition

Hui Gao

1,2,*

, Ting Wang

, Wei Luo

, Lin Gui

School of Computer, National University of Defense Technology, Changsha, China

2. Information Research Center of Military Science, Beijing, China

*corresponding author

gaohui_baixiang@163.com

Abstract—When reading scholar papers, the first thing

researchers want to know is which tasks and processes the

papers describe, which materials they use, etc.In this paper,

these concepts are referred as technology entities. Technology

entity recognition (TER) is the basis for carrying out

subsequent high-level technology analysis works, such as

technical foresight, technology roadmap, and technological

innovation. However, the challenges TER faces are much

greater than that of normal named entity recognition (NER).

Those challenges include the difficulty in feature extraction,

the lack of annotation data, and the differences between

different domains. To deal with the first challenge, we use a

deep neural network to extract features from text. For the

other two challenges, we propose an adversarial multitask

learning method. The existing knowledge from a big dataset on

a source domain is transferred to implement TER on a target

domain with only a small number of labeled samples. The

experiments show that the proposed method significantly

outperforms comparison systems.

Keywords—technology entity recognition, multitask learning,

domain adaptation, transfer learning

I. INTRODUCTION

Empirical research requires gaining and maintaining an

understanding of the body of work in specific area. For

example, typical questions researchers face are which papers

describe which tasks and processes, use which materials and

how those relate to one another[1]. The key and basic task

tackled here is mention-level identification and classification

of technology entity, i.e. Technology entity Recognition

(TER).

In research area of information extraction, there are many

similar expressions to a technical entity, such as a key phrase,

a technical concept and a technical term. In recent years,

TER based higher-level analysis has received great interest,

such as the tracking or prediction for influence [2,3],

technology forecasting [4] and research communities study

[5].

However, the TER are much more challenging to the

normal NER(Named entity recognition) e.g. person names

recognition. The TER faces several difficult problems: 1)

The technology entity lacks regularity, and the number of

new words and unregistered words increase frequently. It is

difficult for traditional feature engineering to extract high-

quality features. 2) The definition of “technology” in

information extraction is not clear. Related researches

generally have different interpretations and inconsistencies in

technology according to different research backgrounds and

purposes, resulting in the lack of authoritative labeled corpus

and evaluation criteria, which makes it difficult to use

supervised machine learning for TER. 3) Even if several

labeled datasets had been released, such as Semeval-2017

released test corpus on the domains of computer, physics,

etc., the domains involved were limited. Since the

technology entities vary significantly in terms of composition,

indicators and context between different domains, it is

difficult training a TER model for specific domain by using

corpus in other domains.

In this paper, we present an adversarial multitask neural

network for the TER. By translating the data into compact

intermediate representations akin to principal components,

deep neural network can learn features directly from the data

without manual feature extraction. The proposed method

uses multitask transfer learning to implement domain

adaptation ion. By studying the distribution of samples

between the source and target domains, the existing

knowledge is transferred to implement the TER on the target

domain with only a small number of labeled data or even

without labeled data. Furthermore, an adversarial task is

employed to discriminate whether the features of the source

and target domains conform to the same distribution in the

shared space, which can ensure a better transfer effect.

The main contributions of this paper are:

 We focus on technology entity recognition from

scientific literature, which is a challenging

information extraction task. We analyzed the

difficulties faced by TER and possible solutions, and

constructed a comprehensive architecture for the task

of TER.



We presented an adversarial multitask learning model

to implement TER with only limited labeled samples.



We conducted extensive experiments on Semeval-

2017 task10 corpus, and evaluated the results with

different baselines, in which the proposed method

showed an outstanding performance.

II. RELATED WORK

A. Technology Entity

The American Heritage Science Dictionary defines

technology as

: a. The application of science, especially to

industrial or commercial objectives. b. The scientific method

and material used to achieve a commercial or industrial

objective. It can be seen that the definition of ‘technology’ is

very extensive, and there are many different expressions for

‘technology’ in previous researches, such as a technology

concept [6], a technology entity[7], a technical terms[8,9],

and a key phrase[6], etc. Eytan[6] thought that the technical

https://www.ahdictionary.com/word/search.html?q=Technology

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38741531

粉丝: 6

深度神经网络与对抗式多任务学习：提升技术实体识别的挑战与解决方案

亚马逊创新技术：提升线上搜索准确率的对抗式query-doc模型

半监督学习入门：机器学习领域的必读书籍

2018 MIT深度学习技术突破与应用：BERT与GAN

对抗性细粒度构图学习，用于看不见的属性对象识别

《用实体实验法学强化学习，深度学习和机器学习》是一系列的基于实体实验的交互式教程

融合对抗主动学习的网络安全知识三元组抽取.docx

实体识别中的无监督学习：在数据中发现实体的7种方法

命名实体识别实战：机器学习方法与应用全览

实体识别深度教程：揭秘NLP领域的10大挑战与机遇

实体识别中的细粒度分类：如何实现分类准确性最大化

最新资源