细粒度图像分类的弱监督部分选择模型

深度学习

图像检索

需积分: 10 179 浏览量更新于2024-09-08 收藏 1.8MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

弱监督学习在细粒度图像分类中的应用越来越受到关注，尤其是在《基于空间约束的细粒度图像分类中的部分选择模型弱监督学习》（WeaklySupervisedLearningofPartSelectionModelwithSpatialConstraintsforFine-GrainedImageClassification_AAAI2017）这篇论文中。细粒度图像分类是一项极具挑战性的任务，因为同类内的变化很大（如不同品种的鸟），而同类间的差异却很小，关键在于识别出那些微妙的局部特征，以区分这些基本类别下的众多子类别。传统的解决方法依赖于精确的物体或部分注解，但这往往成本高昂。为了降低标注成本，研究人员开始探索通过弱监督学习来检测部分特征，而非直接使用昂贵的标注信息。这种策略允许系统仅基于图像的部分标签或无标签数据进行训练，从而提高模型的泛化能力。论文的核心贡献是提出了一种弱监督部分选择模型，该模型考虑了两个关键因素：首先，它利用空间约束来捕获对象与其部分之间的位置关系，这是细粒度分类中的重要线索，因为相似的子类别可能在特定部位有相似的布局。其次，模型还考虑了部分之间的相互作用，这有助于增强对整体特征的理解和区分。通过弱监督学习，该模型能够在没有精确部分标注的情况下，学习到如何有效地识别和选择有助于区分不同子类别的关键部分。这种方法减少了对大量标注数据的依赖，提高了细粒度图像分类的效率和准确性。然而，由于弱监督的性质，模型可能会遇到性能上的挑战，比如噪声较大的预测和较低的精度，因此优化算法和正则化策略在论文中也是一个重要的研究焦点。这篇文章提供了一个新颖的视角，展示了如何通过弱监督学习和空间约束来解决细粒度图像分类中的部分选择问题，这对于实际应用中的计算机视觉和深度学习系统具有重要意义，特别是在资源有限或者标注成本高的场景下。未来的研究可能会进一步改进模型的鲁棒性和性能，以应对更复杂的细粒度分类任务。

资源详情

资源推荐

Weakly Supervised Learning of Part Selection Model with

Spatial Constraints for Fine-Grained Image Classiﬁcation

Xiangteng He, Yuxin Peng

∗

Institute of Computer Science and Technology, Peking University

Beijing 100871, China

pengyuxin@pku.edu.cn

Abstract

Fine-grained image classiﬁcation is challenging due to the

large intra-class variance and small inter-class variance, aim-

ing at recognizing hundreds of sub-categories belonging

to the same basic-level category. Since two different sub-

categories is distinguished only by the subtle differences in

some speciﬁc parts, semantic part localization is crucial for

ﬁne-grained image classiﬁcation. Most previous works im-

prove the accuracy by looking for the semantic parts, but rely

heavily upon the use of the object or part annotations of im-

ages whose labeling are costly. Recently, some researchers

begin to focus on recognizing sub-categories via weakly su-

pervised part detection instead of using the expensive anno-

tations. However, these works ignore the spatial relationship

between the object and its parts as well as the interaction of

the parts, both of them are helpful to promote part selection.

Therefore, this paper proposes a weakly supervised part se-

lection method with spatial constraints for ﬁne-grained im-

age classiﬁcation, which is free of using any bounding box or

part annotations. We ﬁrst learn a whole-object detector auto-

matically to localize the object through jointly using saliency

extraction and co-segmentation. Then two spatial constraints

are proposed to select the distinguished parts. The ﬁrst spa-

tial constraint, called box constraint, deﬁnes the relationship

between the object and its parts, and aims to ensure that the

selected parts are deﬁnitely located in the object region, and

have the largest overlap with the object region. The second

spatial constraint, called parts constraint, deﬁnes the relation-

ship of the object’s parts, is to reduce the parts’ overlap with

each other to avoid the information redundancy and ensure

the selected parts are the most distinguishing parts from other

categories. Combining two spatial constraints promotes parts

selection signiﬁcantly as well as achieves a notable improve-

ment on ﬁne-grained image classiﬁcation. Experimental re-

sults on CUB-200-2011 dataset demonstrate the superiority

of our method even compared with those methods using ex-

pensive annotations.

Introduction

Fine-grained image classiﬁcation is an extremely challeng-

ing task, which aims to distinguish the objects in subordinate

classes, such as bird types (Wah et al. 2011), dog species

(Khosla et al. 2011), plant breeds (Angelova and Zhu 2013)

∗

Corresponding author.

 2017, Association for the Advancement of Artiﬁcial

and aircraft models (Maji et al. 2013) etc. An inexperienced

person can easily recognize basic-level categories such as

birds, horses and dogs, since they vary a lot in appearance.

He may know several kinds of birds, but it would be very

difﬁcult to recognize 200 or even more sub-categories. For

example, it is extremely hard for an inexperienced person

to distinguish between Herring Gull and Slaty-backed Gull

whose appearance are very similar, as both of them have the

gray back and pink legs. These subordinate classes share the

same global appearance, and are often distinguished by the

subtle differences in their parts (e.g. Herring Gull and Slaty-

backed Gull are distinguished by the color of the back, the

latter’s is deeper). Therefore, the object and its salient parts

are crucial for ﬁne-grained image classiﬁcation.

Since the discriminative features are mainly localized on

the object and its parts, most existing works follow the

pipeline: localizing the object or its parts ﬁrstly, and then ex-

tracting discriminative features for ﬁne-grained image clas-

siﬁcation. As the ﬁne-grained image classiﬁcation datasets

(e.g. CUB-200-2011 (Wah et al. 2011)) mostly have the

detailed annotations like bounding box and part locations,

early works directly use the detailed annotations at both

training and testing stage. The works of (Chai, Lempitsky,

and Zisserman 2013; Yang et al. 2012) use the provided

bounding box to learn part detectors in a unsupervised or

latent manner. Several methods even use the part annota-

tions (Berg and Belhumeur 2013; Xie et al. 2013). Since

the annotations of the testing image are not available in the

practical applications, some works use the object or part an-

notations only at training stage and no knowledge of anno-

tations at testing stage. Bounding box and Part annotations

are directly used in training phase to learn a strongly su-

pervised deformable part-based model (Zhang et al. 2013;

Azizpour and Laptev 2012) or directly used to ﬁne-tune the

pre-trained Convolutional Neural Net (CNN) (Branson et al.

2014). Further more, Krause et al. (Krause et al. 2015) only

uses bounding box at training stage to learn the part de-

tectors, then localize the parts automatically in the testing

stage. Recently, there are some promising works attempting

to learn the part detectors under the weakly supervised con-

dition, i.e. the bounding box and part annotations are not

used at training or testing stage. These works make it possi-

ble to put the ﬁne-grained image classiﬁcation into practical

applications. Neural Activation Constellations Part Model

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17)

4075

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_40163480

粉丝: 0
资源: 1

细粒度图像分类的弱监督部分选择模型

Weakly-Supervised-Object-Localization:监督不足的对象本地化文件

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection Network怎么计算边界框

weakly supervised

weakly supervised deep detection networks

Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection Network的方法

Part-aware fine-grained object categorization using weakly supervised part detection network的主要方法

Weakly Supervised Semantic Segmentation

weakly supervised temporal action localization via representative snippet kn

railroad is not a train: saliency as pseudo-pixel supervision for weakly supervised semantic segmentation

d2-net_weakly-supervised_action_localization_via_discriminative_embeddings_a

小样本学习经典算法及近年高效算法

2022跨域目标检测

运动想象的左右手分类跨域

深度学习中不准确监督学习的实现代码

labelme标注湖

伪标签生成算法本身属于弱监督学习吗，如果不是，如何从弱监督学习的监督优化伪标签生成算法

弱监督目标检测clip

WEASEL+MUSE

yolov SCDwon

最新资源