Fast-MoCo：利用组合补丁加速自监督对比学习

自监督学习

25 浏览量更新于2024-06-19 收藏 19.37MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

Fast-MoCo: Boost Momentum-based Contrastive Learning 5

Divide

Momentum

update

Combine

Grad.

encoder 





encoder 

Combine

Target

Branch

Online

Branch

Momentum

update



Contrastive

Loss



Fig. 2: Overview of Fast-MoCo framework. It consists of four steps: 1) Divide

step, where the input image in the online branch is divided into multiple patches;

2) Encode step, which the encoder f encodes the features of the patches sepa-

rately; 3) Combine step, which combines the encoded features (at the last layer

of the neural network); 4) the combined features are fed into projector g, pre-

dictor q, and contrastive loss for contrastive learning. Compared with MoCo,

we add the Divide step and Combine Step in the online branch, with details in

Section 3.2. The target branch is the same as MoCo.

3.2 Fast-MoCo

In this section, we introduce Fast-MoCo, a simple method that can greatly im-

prove the training eﬃciency of self-supervised learning with negligible extra cost.

An overview of Fast-MoCo is shown in Fig.2. With MoCo v3 as the baseline,

Fast-MoCo only makes three modiﬁcations, 1) add a Divide step to divide an

image into multiple patches before sending the patches to the encoder

‡

of the

online branch, 2) insert a Combine step (e.g., Combine) immediately behind

the encoder to combine patches, and 3) a slightly modiﬁed deﬁnition of positive

and negative pairs corresponding to the divide and combine operations. In the

following, we illustrate the Divide step, Combine step, and the modiﬁed loss

function in detail.

Divide Step. For the online branch, instead of directly feed the given the

augmented image x

into the encoder, we ﬁrst divide it into a m × m grid of

patches {x

|p ∈ {1, . . . , m

}} as shown in Fig.2, with p denotes the set of patch

index {p}. The inﬂuence of m in will be analyzed in Section 5.4.

Combine Step. Instead of directly using the encoded embedding of each

patch individually for further step, we combine multiple (less than m

) patch

embeddings v

to form combined embeddings c before sending them to further

step, i.e., the projector.

To form a combined embedding, we take a subset of n indices from the

patch index set p, noted as p

(⊆ p), and collect their corresponding features

= {v

|p ∈ p

}. While there could be diverse options to combine multiple

embeddings (e.g., concatenate, sum), we empirically found that simply averag-

‡在本文中，我们仅探索了ResNet50作为编码器，而将ViT版本的MoCo

v3的评估作为我们的未来工作。

+v:mala2255获取更多论文

剩余21页未读，继续阅读

cpongm

粉丝: 5
资源: 2万+

Fast-MoCo：利用组合补丁加速自监督对比学习

Fast-MoCo：组合补丁加速自监督学习的对比学习

"Fast-MoCo: 基于动量的对比学习利用组合补丁提升自监督学习加速效果

"Fast-MoCo：基于组合补丁的对比学习提速自监督学习

"CFL-Net: 对比学习的图像伪造定位

以 SimCLR、InfoLoss、MOCO、BYOL为关键词讲解对比学习

一文梳理无监督对比学习（MoCo/SimCLR/SwAV/BYOL/SimSiam）

MoCo是怎么自监督的

自监督的对比学习框架都有哪些

class MoCo_ResNet(MoCo):

moco训练自己的数据集

simclr和moco

MoCo queue

postman怎么moco

MoCo method

moco v3的batchsize

pytorch实现moco模型

Moco与MAE的思路

目标检测算法发展综述

最新资源