知识驱动的多标签学习：特征减量化提升效率

研究论文

112 浏览量更新于2024-08-28 收藏 514KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

本文主要探讨了"减少标签特定功能的多标签学习"这一主题，发表在2016年的《知识基于系统》(Knowledge-Based Systems)期刊上，卷号104，页码52-61。多标签学习是一种机器学习方法，它关注每个实例可以被多个预定义标签同时标记的情况，这在处理复杂数据集时尤为有用，如文本分类、图像标注或生物信息学中的基因功能预测。作者Suping Xu、Xibei Yang、Hualong Yu等人来自中国江苏科技大学计算机科学与工程学院、南京科技学院经济管理学院以及南京科技学院的智能感知与高维信息智能感知重点实验室等多个机构，展示了他们在该领域的研究合作。文章可能涉及到的内容可能包括： 1. **问题背景**：多标签学习面临的主要挑战之一是过拟合和冗余特征，特别是当标签间存在相关性时，如何有效地减少对特定标签影响较大的特征，以提高泛化能力和效率。 2. **方法论**：文中可能介绍了一种创新的方法，可能是通过特征选择、降维技术（如主成分分析PCA、线性判别分析LDA）或者神经网络中的注意力机制，来针对性地降低对某个标签敏感的特征权重。 3. **实验设计**：研究者可能通过一系列实验来验证他们的方法，对比不同特征减少策略对多标签任务性能的影响，如精确率、召回率、F1分数等评价指标。 4. **结果与讨论**：论文会展示实验结果，分析减少特定标签功能后模型性能的变化，讨论优化策略的有效性和局限性，以及对未来研究的启示。 5. **应用前景**：由于多标签学习广泛应用于信息检索、推荐系统、计算机视觉等领域，研究成果可能具有实际应用价值，如个性化搜索结果的排序、社交媒体内容的标签分配等。 6. **贡献与局限性**：最后，论文可能总结了他们的主要贡献，以及未来研究可能需要进一步探索的问题，例如如何更好地处理大量标签和不平衡的标签分布。这篇研究论文针对多标签学习中的挑战，提出了一个新颖的特征减少策略，旨在提升模型在特定标签上的表现，为理解和改进多标签学习算法提供了有价值的研究成果。

资源详情

资源推荐

Knowledge-Based Systems 104 (2016) 52–61

Contents lists available at ScienceDirect

Knowle dge-Base d Systems

journal homepage: www.elsevier.com/locate/knosys

Multi-label learning with label-speciﬁc feature reduction

Suping Xu

, Xibei Yang

∗

, Hualong Yu

, Dong-Jun Yu

, Jingyu Yang

, Eric C.C. Tsang

School of Computer Science and Engineering, Jiangsu University of Science and Technology, Zhenjiang 212003, PR China

School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, PR China

Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information, Nanjing University of Science and Technology, Ministry of

Education, Nanjing 210094, PR China

Faculty of Information Technology, Macau University of Science and Technology, 519020, Macau

Intelligent Information Processing Key Laboratory of Shanxi Province, Shanxi University, Taiyuan 030 0 06, PR China

Key Laboratory of Oceanographic Big Data Mining and Application of Zhejiang Province, Zhejiang Ocean University, Zhoushan 316022, PR China

a r t i c l e i n f o

Article history:

Received 22 December 2015

Revised 10 March 2016

Accepted 13 April 2016

Available online 25 April 2016

Keywords:

Feature reduction

Fuzzy rough set

Label-speciﬁc feature

Multi-label learning

Sample selection

a b s t r a c t

In multi-label learning, since different labels may have some distinct characteristics of their own, multi-

label learning approach with label-speciﬁc features named LIFT has been proposed. However, the con-

struction of label-speciﬁc features may encounter the increasing of feature dimensionalities and a large

amount of redundant information exists in feature space. To alleviate this problem, a multi-label learning

approach FRS-LIFT is proposed, which can implement label-speciﬁc feature reduction with fuzzy rough

set. Furthermore, with the idea of sample selection, another multi-label learning approach FRS-SS-LIFT

is also presented, which effectively reduces the computational complexity in label-speciﬁc feature reduc-

tion. Experimental results on 10 real-world multi-label data sets show that, our methods can not only

reduce the dimensionality of label-speciﬁc features when compared with LIFT, but also achieve satisfac-

tory performance among some popular multi-label learning approaches.

1. Introduction

Nowadays, multi-label learning problem has received an in-

creased attention in real-world applications. For example, in se-

mantic annotation of images [3,16,26,49] , a picture can be an-

notated as camel, desert and landscape. In text categorization

[5,11,17,29] , a document may belong to several given topics, includ-

ing economics, ﬁnance or GDP. In bioinformatics [6,13,50] , each

gene may be associated with a set of functional classes, such as

metabolism, transcription and protein synthesis. In all cases above,

each sample may be associated with more than one label simulta-

neously and predeﬁned labels for different samples are not mutu-

ally exclusive but may overlap. This situation is distinct from the

traditional single-label learning where predeﬁned labels are mutu-

ally exclusive, each sample only belongs to a single label.

Over the last decade, many multi-label learning approaches

have been witnessed [12,28,58] . Generally, the existing methods

can be grouped into two main categories [43] , i.e., algorithm

∗

Corresponding author at: School of Computer Science and Engineering, Jiangsu

University of Science and Technology, Zhenjiang 212003, PR China.

E-mail addresses: supingxu@yahoo.com (S. Xu), yangxibei@hotmail.com

(X. Yang), yuhualong@just.edu.cn (H. Yu), njyudj@njust.edu.cn (D.-J. Yu),

yangjy@mail.njust.edu.cn (J. Yang), cctsang@must.edu.mo (E.C.C. Tsang).

adaptation methods and problem transformation methods. Algo-

rithm adaptation methods extend speciﬁc single-label learning al-

gorithms to directly handle multi-label data by modifying some

constraint conditions, such as AdaBoost.MH [40] , ML- k NN [59] ,

MLNB [60] , and RankSVM [9] . Problem transformation methods,

transform the multi-label task into one or more corresponding

single-label ones and then handle them one by one through tra-

ditional methods. The well-known problem transformation meth-

ods include binary relevance (BR), label power set (LP) and pruned

problem transformation (PPT). BR [3] learns a binary classiﬁer for

each label independently and predicts each of the labels separately,

so it cuts up the relationship among different labels. LP [44] con-

siders each unique set of labels that exists in a multi-label train-

ing set as a new single-label multi-value class. Though this method

considers the correlations among different labels, it easily leads to

a higher time consumption since the number of new classes is

increased exponentially with the increasing of labels. Meanwhile,

some new classes created by a few samples may lead to class un-

balance problem. PPT [34] abandons the new classes associated

with extremely small number of samples or assigns these sam-

ples with new labels that can create accepted classes, while some

abandoned classes will lead to the loss of multi-label informa-

tion. Although above methods have achieved good performance in

multi-label learning, they make use of the same features to achieve

http://dx.doi.org/10.1016/j.knosys.2016.04.012

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38581308

粉丝: 2
资源: 893

知识驱动的多标签学习：特征减量化提升效率

自定义标签学习笔记

Unity3D虚拟现实开发之标签跟随功能

labelimg标注 预标签、

yolov5改进标签平滑

标签平滑怎么就提高模型泛化能力了呢

iframe标签现在还在用吗

语义分割获得的伪标签打散

ggplot x轴刻度标签太密集怎么办

iso7010安全标签

a标签触发 keepalive

前端性能优化减少table布局

解释一下yaml文件严格类型标签的用法

BASH标签跳转和传统的控制结构（如if-else）相比有何优势？

yolov7芯片缺陷数据集打标签

scannet 根据检测框标签把点云数据取出来

将setup作为<script>标签的属性进行编写

txt标签坐标点过多labelimg

大语言模型的迁移学习

元数据学习索引的优劣

codesoft 7.1

最新资源

labelimg标注预标签、