零射击度量学习：ZSML方法提升未见类别相似度

61 浏览量更新于2024-08-27 收藏 539KB PDF 举报

零射击度量学习（Zero-Shot Metric Learning, ZSML）是一种先进的机器学习方法，它专注于解决零样本（zero-shot）问题，即在没有直接训练数据的情况下，学习如何度量不同类别之间的相似性。在传统分类任务中，模型通常依赖于有标注的数据来识别新的类别，而在零射击学习中，目标是扩展这种能力，甚至处理从未见过的数据集。 ZSML的核心理念是捕捉数据之间的多非线性但连续的关系，这是通过以下几个关键步骤实现的： 1. **关系表示**：首先，ZSML采用了一种创新的方法，通过重新构建一组特定形状的卷积核（convolutional kernels），这些卷积核能够捕获数据对之间的复杂关系。这些卷积核的作用类似于特征提取器，但它们的设计更注重表达数据间的多种视角下的联系。 2. **多关系向量生成**：通过对数据对进行组合，这些卷积核不仅提取单一特征，还能生成一系列关系向量。这些向量包含了不同层次和角度的相似性信息，有助于模型理解数据之间的深层次联系。 3. **连续性考虑**：传统的二元监督（如正负样本分类）无法完全刻画视觉相似性的连续性，因此，ZSML引入了一个新的交叉更新回归损失函数。这个损失函数允许模型学习到数据相似性的连续变化，而不仅仅是二元对立关系。 4. **实验验证**：作者进行了广泛而深入的实验，包括在四个基准数据集上进行数据集内的迁移学习（intra-dataset transfer）和数据集间的迁移学习（inter-dataset transfer）。实验结果证明了ZSML在零射击度量学习任务中表现出色，能够超越现有的方法，达到最先进的性能。零射击度量学习是一种创新的方法，它通过非线性关系建模、多维度关系表示和连续性优化，解决了类别间相似度度量的难题，尤其在没有直接相关数据的情况下展现出强大的泛化能力。这对于实际应用中的跨领域识别、零样本分类等场景具有重要意义。

Zero-shot Metric Learning

Xinyi Xu , Huanhuan Cao , Yanhua Yang , Erkun Yang and Cheng Deng

∗

School of Electronic Engineering, Xidian University, Xian 710071, China

xyxu.xd@gmail.com, hhcao@stu.xidian.edu.cn, yanhyang@xidian.edu.cn,

{erkunyang, chdeng.xd}@gmail.com

Abstract

In this work, we tackle the zero-shot metric learn-

ing problem and propose a novel method abbre-

viated as ZSML, with the purpose to learn a dis-

tance metric that measures the similarity of unseen

categories (even unseen datasets). ZSML achieves

strong transferability by capturing multi-nonlinear

yet continuous relation among data. It is moti-

vated by two facts: 1) relations can be essentially

described from various perspectives; and 2) tradi-

tional binary supervision is insufﬁcient to represent

continuous visual similarity. Speciﬁcally, we ﬁrst

reformulate a collection of speciﬁc-shaped convo-

lutional kernels to combine data pairs and generate

multiple relation vectors. Furthermore, we design

a new cross-update regression loss to discover con-

tinuous similarity. Extensive experiments including

intra-dataset transfer and inter-dataset transfer on

four benchmark datasets demonstrate that ZSML

can achieve state-of-the-art performance.

1 Introduction

Metric learning aims to ﬁnd appropriate similarity measure-

ments of data points, whose core intuition is to preserve the

distance between data points in embedding space. This topic

is of important practice due to its wide applications in many

related areas, such as face recognition

[

Guillaumin et al.,

2009

]

, clustering

[

Davis et al., 2007; Xing et al., 2003

]

, and

retrieval

[

Zhou et al., 2004

]

Euclidean distance is one of the most common similar-

ity metrics since it does not require priori information and

training process. However, unsatisfactory results may be

yielded as it treats all feature dimensions equally and inde-

pendently, thus fails to capture the idiosyncrasies of data. In

contrast, parametric Mahalanobis distance that can model the

different dimension importance, has been adopted in many

works. Some representative Mahalanobis approaches

[

Hoi et

al., 2006; Xing et al., 2003

]

project data linearly and mini-

mize Euclidean distance between positive pairs, while max-

imize it between negative pairs. Alternatively, one may also

∗

Corresponding author.

directly optimize the Mahalanobis metric for nearest neigh-

bor classiﬁcation, among which representative works include,

but are not limited to, Neighborhood Component Analysis

(NCA)

[

Roweis et al., 2004

]

, Large Margin Nearest Neigh-

bor (LMNN)

[

Weinberger and Saul, 2009

]

, and Nearest Class

Mean (NCM)

[

Mensink et al., 2013

]

. Priori information plays

a pivotal role in the success of these metric learning schemes.

Therefore, unsatisfactory results can be produced when the

priori is not available.

In this paper, we are committed to a more challenging task:

zero-shot metric learning, whose ambition is to learn an effec-

tive metric for unseen categories and datasets. It claims that

the learned metric must measure the similarity without access

to the target data. Powerful transferability can be obtained by

capturing the multi-nonlinear and continuous relations, which

is consistent with the innate character of data. Particularly, we

ﬁrst reformulates a set of speciﬁc-shaped convolutional ker-

nels to discover various kinds of relations. It is well known

that convolutional neural network (CNN) has great power in

feature embedding

[

Lecun et al., 1998; Donahue et al., 2013;

Toshev and Szegedy, 2014

]

, while in this paper it is employed

to reveal the correlation among data. Then, we design a cross-

update regression loss, which relax the binary supervision

employed on the positive pairs (PPs) and negative pairs (NPs)

to extend generalization capability. Speciﬁcally, we initial-

ize a coarse continuous label as a weak supervision of the

predicted similarity, and update the coarse label and the pre-

dicted similarity alternately till convergence. By doing so,

we can learn the similarity order and improve transferability.

To better demonstrate the superiority of ZSML, we present

multi-level transfer tasks, which covers transferring to unseen

category within one dataset (intra-dataset ZSML) and unseen

datasets (inter-dataset ZSML). In a nutshell, the main contri-

butions of our work can be summarized as follows:

• Departing from the traditional single and linear rela-

tion representation, we reformulate a family of speciﬁc-

shaped convolutional kernels which can capture the

multi-nonlinear relations among data points.

• We devise a cross-update regression loss for learning

continuous similarity to improve generalization capabil-

ity, which is veriﬁed in our empirical study.

• Extensive transfer experiments demonstrate that our

model can better measure the similarity of unseen cate-

Proceedings of the Twenty-Eighth International Joint Conference on Artiﬁcial Intelligence (IJCAI-19)

3996

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38726186

粉丝: 5
资源: 895

零射击度量学习：ZSML方法提升未见类别相似度

XQDA.rar_XQDA_XQDA 度量学习_度量学习_行人重识别

在线度量学习代码

零样本学习中的度量学习：相似性度量的艺术

LMNN.rar_LMNN_度量学习_度量学习源码

余弦度量学习

度量学习算法

class-norm:连续零射击学习的班级归一化

itml.rar_ITML 度量学习_itml_nowut4_度量学习

无监督距离度量学习工具包：Matlab中无监督距离度量学习工具包

度量学习C++程序

最新资源