ICCV 2021：广义无源领域自适应提升模型跨域性能

pytorch

31 浏览量更新于2024-08-03 收藏 873KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

在2021年的国际计算机视觉会议（ICCV）上，一篇名为《Generalized Source-free Domain Adaptation》的重要论文引起了广泛关注。该论文针对计算机视觉领域中的一个关键挑战——领域自适应（DA），特别是在无源域适应（SFDA）的背景下，提出了一个新的方法。SFDA通常允许模型仅使用源领域的预训练模型，并且在没有源数据的情况下对目标领域进行适应。然而，这篇论文强调了在保持源性能的同时，让模型在目标领域表现出色的重要性。论文的核心创新在于提出了一种新的范式，即广义无源域自适应（G-SFDA）。在G-SFDA中，模型不仅要在目标领域上表现良好，还要在源领域上维持原有的性能，这样在实际应用中更具实用性。为了实现这一目标，作者们设计了两个关键组件： 1. 局部结构聚类（LSC）：这是一种无监督的特征学习策略，通过在目标数据上应用聚类算法，使得模型能够找到与目标特征语义相似的邻居。这种方法有助于在缺乏源数据的情况下，让模型更好地理解并适应目标领域特有的分布。 2. 稀疏域注意力（SDA）：这是另一个关键技术，它引入了一种注意力机制，使模型能够更专注于目标领域中对性能提升最为关键的特征区域，从而减少对源数据的依赖。论文的实现依赖于PyTorch 1.3和CUDA 10.0环境，通过阅读'requirements.txt'文件，研究人员可以了解到复现实验所需的基础环境配置。此外，该研究还提到了在GitHub等平台上进行讨论、查看统计信息以及作者简介的链接，例如https://www.researchgate.net/publication/353677939，便于学术交流和进一步研究。 Shiqi Yang、Yaxing Wang（南京科技大学）、Joost van de Weijer（巴塞罗那自治大学）、Luis Herranz（CVC计算机视觉中心）和Shangling Jui（华为海思解决方案）作为主要作者，共同贡献了这篇重要的研究工作。他们的研究成果展示了在没有源数据的情况下如何有效地进行领域自适应，这对于跨领域视觉任务的迁移学习具有深远的影响。由于该论文的创新性和实用性，预计将在未来几年内吸引更多研究者的关注和引用。

资源详情

资源推荐

In this paper, to perform adaptation to the target domain

without source data, we ﬁrst propose Local Structure Cluster-

ing (LSC), that clusters each target feature together with its

nearest neighbors. The motivation is that one target feature

should have similar prediction with its semantic close neigh-

bors. To keep source performance, we propose to use sparse

domain attention (SDA), applied to the output of the feature

extractor, activating different feature channels depending

on the particular domain. The source domain attention will

be used to regularize the gradient during target adaptation

to prevent forgetting of source information. With LSC and

SDA, the adapted model can achieve excellent performance

on both source and target domains. In the experiments, we

show that for target performance our method is on par with

or better than existing DA and SFDA methods on several

benchmarks, speciﬁcally achieving state-of-the-art perfor-

mance on VisDA (85.4%), while simultaneously keeping

good source performance. We also extend our method to

Continual Source-free Domain Adaptation, where there is

more than one target domain, further demonstrating the efﬁ-

ciency of our method.

We summarize our contributions as follows:

•

We propose a new domain adaptation paradigm denoted

as Generalized Source-free Domain Adaptation (G-

SFDA), where the source-pretrained model is adapted

to target domains while keeping the performance on the

source domain, in the absence of source data.

•

We propose local structure clustering (LSC) to achieve

source-free domain adaptation, which utilizes local

neighbor information in feature space.

•

We propose Sparse domain attention (SDA) which acti-

vates different feature channels for different domains,

and regularizes the gradient of back propagation dur-

ing target adaptation to keep information of the source

domain.

•

In experiments, we show that where existing methods

suffer from forgetting and obtain bad performance on

the source domain, our method is able to maintain

source domain performance. Furthermore, when fo-

cusing on the target domain our method is on par with

or better than existing methods, especially we achieve

state-of-the-art target performance on VisDA.

2. Related Works

Here we discuss related domain adaptation settings.

Domain Adaptation.

Early domain adaptation methods

such as [

] adopt moment matching to align feature

distributions. Inspired by adversarial learning, DANN [

]

formulates domain adaptation as an adversarial two-player

game. CDAN [

] trains a deep networks conditioned on

several sources of information. DIRT-T [

] performs do-

main adversarial training with an added term that penalizes

violations of the cluster assumption. Domain adaptation has

also been tackled from other perspectives. MCD [

] adopts

prediction diversity between multiple learnable classiﬁers to

achieve local or category-level feature alignment between

source and target domains. DAMN [

] introduces a frame-

work where each domain undergoes a different sequence of

operations. AFN [

] shows that the erratic discrimination

of target features stems from much smaller norms than those

found in source features. SRDC [

] proposes to directly

uncover the intrinsic target discrimination via discriminative

clustering to achieve adaptation. The most relevant paper

to our LSC is DANCE [

], which is for universal domain

adaptation and based on neighborhood clustering. But they

are based on instance discrimination [

] between all fea-

tures, while our method applies consistency regularization

on only a few semantically close neighbors.

Source-free Domain Adaptation.

Normal domain adap-

tation methods require access to source data during adap-

tation. Recently, there are several methods investigating

source-free domain adaptation. USFDA [

] and FS [

]

explore the source-free universal DA [

] and open-set

DA [

], DECISION [

] is for multi-source DA. Related to

our work are SHOT [

] and 3C-GAN [

], both for close-

set DA. SHOT proposes to ﬁx the source classiﬁer and match

the target features to the ﬁxed classiﬁer by maximizing mu-

tual information and pseudo label. 3C-GAN synthesizes

labeled target-style training images based on conditional

GAN. Recently, BAIT [

] extends diverse classiﬁer based

domain adaptation methods to also be applicable for SFDA.

Though achieving good target performance, these methods

cannot maintain source performance after adaptation. Other

than these methods, we aim to maintain source-domain per-

formance after adaptation.

Continual Domain Adaptation.

Continual learning

(CL) [

] speciﬁcally focuses on avoiding catas-

trophic forgetting when learning new tasks, but it is not

tailored for DA since new tasks in CL usually have labeled

data. Recently, a few works [

] have emerged that

aim to tackle the Continual Domain Adaptation (CDA) prob-

lem. [

] uses sample replay to avoid forgetting together with

domain adversarial training, [

] builds a domain relation

graph, and [

] builds a domain-speciﬁc memory buffer for

each domain to regularize the gradient on both target and

memory buffer. Although these methods achieve good per-

formance, they all demand access to source data. And [

]

is source-free but they focus on class incremental single

target domain adaptation where there is only one-shot la-

beled target data per class, while our method is related to

剩余10页未读，继续阅读

OverlordDuke

粉丝: 1566
资源: 10

ICCV 2021：广义无源领域自适应提升模型跨域性能

自适应识别ICCV 论文

ICCV2021_Submission7567:该项目包括ICCV2021 Submission7567的源代码

ECCV, ICCV CVPR 关于transformer在遥感领域的论文

iccv2023+遥感

ICCV paper什么等级？

上手实践iccv2013的jda(joint distribution adaptation)方法-附件资源

你知道2021-ICCV_TRANSREID_TRANSFORMER-BASED-OBJECT-RE-IDENTIFICATION这篇文章嘛

使transformer快速收敛

usss 2019iccv

iccv8的怎么使用

还有哪些2021年发表的vision transformer 加速器的文献

transformer的目标检测

iccv word模板

iccv2023 医学图像分割

请给我几个计算机方向顶刊顶会的链接

特征提取cvpr iccv

iccv7 for avr

ICCV rebuttal是什么

pcl点云最小割算法 iccv2009

iccv中医学图像的多模态融合

最新资源