面向对象识别的群敏感多核学习

69 浏览量更新于2024-07-15 收藏 1.5MB PDF 举报

"这篇论文提出了一种针对对象识别的组敏感多核学习（GS-MKL）方法，旨在处理类内多样性和类间关联。通过在物体类别和单个图像之间引入“组”作为中间表示，GS-MKL旨在学习组敏感的多核组合以及相关的分类器。对于每个对象类别，来自同一类别的图像集合被划分为多个组，具有相似外观的图像被分到同一组，这对应于物体类别的子类别。因此，类内的多样性可以通过同一类别但外观不同的组集来表示，而类间关联则可以通过不同类别间的组之间的相关性来表示。这种方法利用动态分割分组（DDG）和循环混合分组技术，以适应图像的复杂特性，并优化多核学习过程。" 这篇论文深入探讨了计算机视觉领域中的对象识别问题，特别是如何处理在同一类别中物体的多样性和不同类别之间的相关性。提出的GS-MKL方法是基于多核学习（MKL）的一种扩展，其核心在于通过将图像数据按照类别和相似性进行分组，从而更好地捕捉类内和类间的特征差异。首先，GS-MKL强调了“组”的概念，这是解决类内多样性的关键。每个对象类别下的图像集合被分成多个小组，这些小组代表了该类别下的子类别，每个子类别具有相似的外观特征。这种分组策略允许模型更精细地捕捉类内的变化，提高了识别的精度。其次，GS-MKL考虑了类间关联，这是通过分析不同类别之间小组的相关性来实现的。这种方法有助于识别模型学习到不同类别之间的通用特征，增强了模型区分不同对象的能力。论文中还提到了动态分割分组（DDG）和循环混合分组技术，这两种技术是GS-MKL实现的关键。DDG可能用于动态调整图像的分组结构，以优化类内多样性的表示；循环混合分组则可能涉及反复调整和组合核函数的过程，以寻找最佳的多核组合，进一步提高识别性能。 GS-MKL是一种创新的机器学习方法，它通过引入组概念和优化多核学习策略，有效地解决了对象识别中的类内多样性和类间关联问题。这种方法对于提升计算机视觉系统在实际应用中的性能，如图像分类、目标检测等，具有重要的理论和实践价值。

2840 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 21, NO. 5, MAY 2012

• Promising experimental results comparable to the

state-of-the-art results have been obtained on Caltech101,

Pascal VOC2007, and Scene15 data sets, and signiﬁcant

improvements have been achieved over several existing

MKL methods across the four data sets. A new bound

is established for the performance of the state-of-the art

MKL method on object recognition.

The remainder of this paper is organized as follows. Section II

briefs the related work. In Section III, the GS-MKL framework

is introduced for object recognition. The learning algorithm of

GS-MKL is presented in Section IV. Section V presents two

sample grouping strategies for GS-MKL. The experimental re-

sults are given in Section VI. Finally, Section VII concludes this

paper.

A preliminary version of this work has been published in

[53]. The main extensions include two grouping strategies,

where sample grouping interacts with GS-MKL training,

grouping strategy comparison, comparisons of GS-MKL and

other MKL methods, and more extensive experiments.

II. R

ELATED

WORK

In the past decade, research efforts have been devoted to

characterizing visual statistics for a number of object categories

[2], [7], [13], [14], [19], [27]. Among them, the kernel method

[3], [5], [15], [16], [18] is one of the attractive research areas.

Generally speaking, the kernel method offers two advantages

in learning object categories: (1) A kernel explicitly deﬁnes a

visual similarity measure between image pairs and implicitly

maps the input space to the feature space [13], thereby avoiding

the explicit feature representation and the curse of dimension;

(2) Combined with SVM, the kernel method can ﬁnd out the

optimal separating hyper-plane between positive and negative

samples efﬁciently. Hence, the SVM-based kernel method

has been applied to many recognition problems (e.g., object

detection [40] and image and video annotation [41]–[43]), in

addition to object recognition. Generally, SVM-based kernel

methods used in object recognition can be categorized into

four types, i.e., individual kernel designing, canonical MKL,

SS-MKL, and SVM ensemble. We brief the related works as

follows.

A. Individual Kernel Designing

Recently, many efforts have been made to delicately design

individual kernels for the similarity of an image pair. A kernel

based on a multiresolution histogram is introduced in [15] to

measure the image similarity at different granularities. A spa-

tial pyramid matching kernel (PMK) is introduced in [3] to en-

force the loose spatial information, which matches images with

spatial coordinates. A kernel based on the local feature distri-

bution is presented in [16] to model the image local context. A

chi-squared kernel based on the pyramid histogram of orientated

gradients (PHOG) is presented in [33] to capture the shape sim-

ilarity with spatial layout.

All these methods rely on the features that represent particular

visual characteristics. However, not all kernels play the same

role in differentiating object categories. Hence, kernel selec-

tion/fusion over a set of available kernels is usually desired for

generic object recognition. It is worthy to note that individual

kernels can be incorporated into the proposed GS-MKL frame-

work to investigate the corresponding contributions in object

recognition.

B. Canonical MKL

Recently, instead of using a single kernel, a classiﬁer based on

multikernel combination has been introduced into object recog-

nition, yielding promising results [5], [18], [38], [45]. In [5]

and [18], multiple features (e.g., appearance and shape) and

kernels [e.g., PMK and spatial pyramid kernels (SPKs) with

different hyper-parameters] are employed and combined in the

MKL framework. Bosch et al. [45] strengthens MKL with a

cross validation strategy. The initial weights of multiple kernels

are learnt by an extended MKL [5] and then reﬁned by an ex-

haustive search to minimize the classiﬁcation error over a vali-

dation set. In [44], kernel alignment is utilized to optimize mul-

tikernel combination over color, shape, and appearance features.

Basically, these methods adopt a uniform multikernel combi-

nation over the whole input space. Hence, when training data

exhibit high intraclass variation and interclass correlation on

local training samples, these methods may suffer a degraded

performance due to the choice of global uniform multikernel

combination.

C. SS-MKL

More recently, SS-MKL methods have been proposed in [23],

[27], and [29] by using sample-speciﬁc kernel weighting strate-

gies. The basic idea is that kernel weights depend not only on

the kernel functions but also on the samples themselves. Com-

pared with canonical MKL, SS-MKL tends to reﬂect the relative

importance of different kernels at the level of individual sample

rather than at the level of object category. Despite some perfor-

mance improvements, learning too many parameters may lead

to the expensive computation cost and the risk of overﬁtting.

It has to be noted that, although the proposed GS-MKL and

the methods [5], [18], [23], [27], [45] reviewed above are all ex-

tended from the MKL framework, GS-MKL provides a mech-

anism of evaluating multiple kernels over sample groups. From

this view, GS-MKL is a more ﬂexible framework that can be

generalized to canonical MKL and SS-MKL by changing the

number of groups. GS-MKL provides a tractable solution to

adapt multikernel combination to the local data distributions for

sample groups.

D. Learning With Classiﬁer Ensemble

Instead of a single classiﬁer, classiﬁer ensemble has been

proposed as an alternative technique to improve classiﬁcation

accuracy. Classiﬁer ensemble can take place at data, feature,

and classiﬁer levels [46]. To cope with the diversity of data,

a straightforward classiﬁer ensemble method employs a data

partitioning strategy where each base classiﬁer is trained over

a distinct subset of the training data. Such divide and conquer

methods train multiple base classiﬁers that are experts in their

speciﬁc parts of the data space. However, base classiﬁers are in-

dependently trained, leaving out the other partitions of the data.

When such independence condition is not satisﬁed, it cannot be

assured that the decision of the base classiﬁer will improve the

ﬁnal classiﬁcation performance.

剩余14页未读，继续阅读

weixin_38608025

粉丝: 6
资源: 937

面向对象识别的群敏感多核学习

Deep Cost-Sensitive Kernel Machine for Binary Software Vulnerability Detection

Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search

Topic-sensitive_pagerank_A_context-sensitive_ranking_algorithm_for_web_search

Cost-Sensitive Face Recognition

case-sensitive-paths-webpack-plugin

请给出Cost-Sensitive SVM图像的MATLAB代码

context-free grammar 和 context-sensitive有什么区别

Locality-sensitive hashing（LSH）的Python代码

Privacy-Preserving Machine Learning Using Federated Learning and Secure Aggregation

最新资源