异构协同教学：无监督跨域行人重识别

需积分: 12 186 浏览量更新于2024-08-05 收藏 918KB PDF 举报

"异构协同教学用于无监督跨域行人重识别" Asymmetric Co-Teaching 是一种在无监督跨域行人重识别（Person Re-Identification, Re-ID）任务中的新方法，旨在解决深度学习模型在源领域表现良好，但在未见过的目标领域泛化能力差的问题。行人重识别是一个具有挑战性的任务，因为它涉及到同一身份样本内的高变异性以及不同的成像条件。论文的作者们指出，尽管深度学习在固定场景（源领域）的行人重识别上已经取得了显著的准确性，但在无监督的未知目标领域，即领域适应问题上，现有的方法往往表现不佳。一个常见的解决方案是通过聚类算法为未标记的目标图像分配伪标签，然后使用这些伪标签来重新训练模型。然而，聚类方法常常引入噪声标签，并将低置信度的样本误判为异常值，这可能会阻碍模型的再训练过程，限制其泛化能力。针对这个问题，研究者提出了Asymmetric Co-Teaching策略，该策略在聚类后添加了一个样本筛选步骤。这个步骤能够区分并过滤掉那些可能带有噪声的样本，从而减轻了噪声标签对模型训练的影响。这种方法利用两个教师模型的异构性，一个模型专注于识别低置信度的样本，而另一个模型则聚焦于高质量的样本。通过这种方式，模型可以更加稳健地学习到目标领域的关键特征，提高跨域的行人重识别性能。此外，论文还可能探讨了如何设计有效的损失函数来优化这种协同教学过程，以及如何评估和比较Asymmetric Co-Teaching与现有无监督领域适应方法的性能。这种方法对于实际应用，如监控系统、智能安全等领域，具有重要的意义，因为它们经常需要在不同环境或条件下识别和跟踪行人。总结来说，Asymmetric Co-Teaching是一种创新的无监督学习方法，它通过改进聚类后的样本处理，提升了深度学习模型在跨域行人重识别任务中的泛化性能，特别是在目标领域数据缺乏标签的情况下。通过两个教师模型的协同工作，该方法能够有效地过滤噪声，强化模型的学习，从而提高识别准确性和鲁棒性。

Asymmetric Co-Teaching for Unsupervised Cross-Domain

Person Re-Identiﬁcation

Fengxiang Yang,

1∗

Ke Li,

Zhun Zhong,

Zhiming Luo,

2†

Xing Sun,

3†

Hao Cheng,

Xiaowei Guo,

Feiyue Huang,

Rongrong Ji,

Shaozi Li

Artiﬁcial Intelligence Department, Xiamen University, China

Post Doctoral Mobile Station of Information and Communication Engineering, Xiamen University, China

Tencent Youtu Lab, Shanghai, China

Abstract

Person re-identiﬁcation (re-ID), is a challenging task due

to the high variance within identity samples and imaging

conditions. Although recent advances in deep learning have

achieved remarkable accuracy in settled scenes, i.e., source

domain, few works can generalize well on the unseen target

domain. One popular solution is assigning unlabeled target

images with pseudo labels by clustering, and then retrain-

ing the model. However, clustering methods tend to introduce

noisy labels and discard low conﬁdence samples as outliers,

which may hinder the retraining process and thus limit the

generalization ability. In this study, we argue that by explic-

itly adding a sample ﬁltering procedure after the clustering,

the mined examples can be much more efﬁciently used. To

this end, we design an asymmetric co-teaching framework,

which resists noisy labels by cooperating two models to se-

lect data with possibly clean labels for each other. Mean-

while, one of the models receives samples as pure as pos-

sible, while the other takes in samples as diverse as pos-

sible. This procedure encourages that the selected training

samples can be both clean and miscellaneous, and that the

two models can promote each other iteratively. Extensive

experiments show that the proposed framework can consis-

tently beneﬁt most clustering based methods, and boost the

state-of-the-art adaptation accuracy. Our code is available at

https://github.com/FlyingRoastDuck/ACT AAAI20.

1 Introduction

Person re-identiﬁcation (re-ID) (Sun et al. 2018; Zheng,

Yang, and Hauptmann 2016; Li, Zhu, and Gong 2018b) aims

to locate the target person in surveillance videos with a given

probe image. With the rapid evolution of deep learning mod-

els, the accuracy of person re-ID has been greatly boosted in

the public datasets. However, models trained on the source

domain often suffer from domain shifts, leading to a perfor-

mance decline on a different target domain.

To alleviate this issue, recent works (Zhong et al. 2019b;

Zhong et al. 2018b) make efforts on the unsupervised do-

∗

This work was done when Fengxiang Yang was an intern at

Youtu Lab (yangfx@stu.xmu.edu.cn).

†

Corresponding Author (zhiming.luo@xmu.edu.cn, winfred-

sun@tencent.com)

 2020, Association for the Advancement of Artiﬁcial

main adaptation (UDA), which aims to transfer the knowl-

edge from the labeled source domain to the unlabeled tar-

get domain. These works mainly lie in two aspects, distri-

bution aligning (Wei et al. 2018; Deng et al. 2018; Chang

et al. 2019; Lin et al. 2018; Wang et al. 2018) and tar-

get pseudo label discovering (Fan, Zheng, and Yang 2018;

Song et al. 2018; Li, Zhu, and Gong 2018a). The former

aims to reduce the distribution gap between domains in

a common space, such as image-level (Wei et al. 2018;

Deng et al. 2018) and attribute-level (Chang et al. 2019;

Lin et al. 2018; Wang et al. 2018) spaces. The latter attempts

to leverage the underlying relations among target samples

and predict pseudo labels for model retraining, e.g. assign-

ing pseudo labels based on clustering (Fan, Zheng, and Yang

2018; Song et al. 2018; Li, Zhu, and Gong 2018a) and k-

nearest neighbors (Zhong et al. 2019a; Yang et al. 2018).

Among them, clustering based methods have reported very

competitive accuracy for UDA in person re-ID. These meth-

ods usually employ an iterative process of predicting pseudo

identities for unlabeled target samples according to the clus-

ters and ﬁne-tuning the model with those predicted samples.

Despite their promising results, clustering based methods

are restricted by two main drawbacks. On the one hand,

the clustering accuracy can not be guaranteed even using

the modern approaches, so that pseudo labels assigned by

clusters can be noisy. Training the model with noisy labels

that assigned to wrong identities will undoubtedly damage

the re-ID performance. On the other hand, most clustering

methods tend to leave low conﬁdence samples as outliers

and do not assign cluster labels to them, e.g., DBSCAN (Es-

ter et al. 1996). These outliers are usually hard samples that

encounter high image variations. Without considering such

samples during training, the model may have a problem in

discriminating high variation testing samples. However, di-

rectly assigning them to the nearest cluster will bring more

noisy labels, hindering the retraining of the model.

Co-Teaching (CT) (Han et al. 2018) is a commonly used

algorithm for training model with noisy labels, which learns

two networks by feeding samples with small losses of one

network to another. However, most co-teaching frameworks

utilize symmetric inputs for both networks, which do not

effectively apply to the context of clustering based cross-

arXiv:1912.01349v1 [cs.CV] 3 Dec 2019

下载后可阅读完整内容，剩余8页未读，立即下载

DeepLearning小舟

粉丝: 2457

异构协同教学：无监督跨域行人重识别

非对称协同教学在无监督跨域人员重识别中的应用代码解析

探索非监督跨域人员识别ACT代码实现

代码驱动的邻接发现协议设计：优化性能与能耗

PersonReID-ACT:AAAI 2020 论文“Asymmetric Co-Teaching for Unsupervised Cross Domain Person Re-Identification”的代码

Asymmetric Co-Teaching在无监督跨域行人重识别任务中是如何减少噪声标签影响并提高模型泛化能力的？

在实施Asymmetric Co-Teaching策略时，如何设计有效的样本过滤机制来提高无监督跨域行人重识别的模型泛化能力？

在无监督跨域行人重识别中，Asymmetric Co-Teaching如何通过样本过滤提升模型的泛化能力？请详细说明该方法的具体实现流程。

Analyses and computations of asymmetric Z-scan for large phase shift from diffraction theory

Asymmetric Student-Teacher Networks for Industrial AD

Asymmetric Fabry-Perot interferometric cavity for fiber optical sensors

最新资源