多标签延迟关联分类法：优化依赖与小断言覆盖

需积分: 31 196 浏览量更新于2024-09-11 收藏 86KB PDF 举报

"Multi-Label Lazy Associative Classification" 在当前的分类任务中，大多数研究集中在单一标签分类上，即每个实例仅关联一个标签。然而，实际应用中，如基因功能预测和文本分类，往往允许实例同时关联多个标签，这就涉及到了多标签分类（Multi-Label Classification）。多标签分类是单一标签分类的扩展，由于其复杂性，解决起来更具挑战性。尽管多标签分类的重要性不言而喻，但相关的研究仍然不足。常见的处理方法是为每个标签独立学习二元分类器，这种方法忽略了标签之间的依赖关系。当标签数量较大时，可能会出现多个小的不相交部分（disjuncts），忽略这些小的不相交部分可能导致分类准确性的下降。为此，本文提出了一个多标签延迟关联分类器（Multi-Label Lazy Associative Classifier），它逐步利用标签间的依赖关系，以提高分类效果。通过实例基础的延迟策略，该方法能更好地覆盖那些小的不相交部分，从而在与现有的先进多标签分类器比较时，观察到高达24%的性能提升。论文作者包括Adriano Veloso、Wagner Meira Jr.、Marcos Gonçalves以及Mohammed Zaki，分别来自巴西米纳斯吉拉斯联邦大学计算机科学系和美国伦斯勒理工学院计算机科学系。他们的工作强调了探索和利用多标签数据中的复杂关系以提升分类性能，这种延迟策略可以更有效地处理多标签问题中的小规模不相交情况，为解决此类问题提供了一种新的思路。多标签懒惰关联分类法是一种针对多标签分类问题的新方法，它通过考虑标签之间的关联性和实例基础的模型诱导，提高了分类的准确性，尤其在处理大量标签组合的情况下表现优秀。这一方法对于处理现实世界中的多标签数据集，如生物信息学或文本挖掘等领域，具有重要的应用价值。

Multi-Label Lazy Associative Classiﬁcation

⋆

Adriano Veloso

, Wagner Meira Jr.

, Marcos Gonc¸alves

, and Mohammed Zaki

Computer Science Department, Universidade Federal de Minas Gerais, Brazil

{adrianov,meira,mgoncalv}@dcc.ufmg.br

Computer Science Department, Rensselaer Polytechnic Institute, USA

zaki@cs.rpi.edu

Abstract. Most current work on classiﬁcation has been focused on learning from

a set of instances that are associated with a single label (i.e., single-label classi-

ﬁcation). However, many applications, such as gene functional prediction and

text categorization, may allow the instances to be associated with multiple la-

bels simultaneously. Multi-label classiﬁcation is a generalization of single-label

classiﬁcation, and its generality makes it much more difﬁcult to solve.

Despite its importance, research on multi-label classiﬁcation is still lacking. Com-

mon approaches simply learn independent binary classiﬁers for each label, and

do not exploit dependencies among labels. Also, several small disjuncts may ap-

pear due to the possibly large number of label combinations, and neglecting these

small disjuncts may degrade classiﬁcation accuracy. In this paper we propose a

multi-label lazy associative classiﬁer, which progressively exploits dependencies

among labels. Further, since in our lazy strategy the classiﬁcation model is in-

duced on an instance-based fashion, the proposed approach can provide a better

coverage of small disjuncts. Gains of up to 24% are observed when the proposed

approach is compared against the state-of-the-art multi-label classiﬁers.

1 Introduction

The classiﬁcation problem is to build a model, which, based on external observations,

assigns an instance to one or more labels. A set of examples is given as the training

set, from which the model is built. A typical assumption in classiﬁcation is that labels

are mutually exclusive, so that an instance can be mapped to only one label. However,

due to ambiguity or multiplicity, it is quite natural that most of the applications violate

this assumption, allowing instances to be mapped to multiple labels simultaneously. For

example, a movie being mapped to action or adventure, or a song being classiﬁed as

rock or ballad, could all lead to violations of the single-label assumption.

Multi-label classiﬁcation consists in learning a model from instances that may be

associated with multiple labels, that is, labels are not assumed to be mutually exclusive.

Most of the proposed approaches [7,1,3] for multi-label classiﬁcation employ heuris-

tics, such as learning independent classiﬁers for each label, and employing ranking and

thresholding schemes for classiﬁcation. Although simple, these heuristics do not deal

with important issues such as small disjuncts and correlated labels.

⋆

This research was sponsored by UOL (www.uol.com.br) through its UOL Bolsa Pesquisa pro-

gram, process number 20060519184000a.

下载后可阅读完整内容，剩余7页未读，立即下载

honghf123

粉丝: 0
资源: 10

多标签延迟关联分类法：优化依赖与小断言覆盖

前端开源库-markdown-it-lazy-headers

前端项目-vue-lazyload.zip

el-cascader lazyload

el-cascader lazy checkStrictly

elementui el-image lazy

el-tree lazy :default-expanded-keys

el-tree lazy multiple 数据回显

el-cascader lazy模式下怎么回显

cpp-for-lazy-programmers:Will Briggs的“ C ++ for Lazy Programmers”的源代码-Lazy source

cpp20-for-lazy-programmers:Will Briggs的“ C ++ 20 for Lazy Programmers”的源代码-Lazy source

最新资源