多核学习驱动的审美图像分类方法

60 浏览量更新于2024-08-31 收藏 349KB PDF 举报

本文主要探讨了"基于多核学习的审美图像分类"这一主题，针对图像美学质量预测这一具有挑战性的问题，提出了一个创新的方法。在当今数字时代，自动识别照片是否能引发大多数人的强烈喜爱或冷漠反应变得越来越重要，尤其是在社交媒体和在线内容创作领域。审美图像分类涉及两个关键任务：一是构建与图像美感相关的特异性特征，二是设计有效的模型来弥合情感概念与视觉特征之间的"语义鸿沟"。作者Ningning Liu、Xin Jin、Hui Lin 和 De Zhang分别来自中国对外经济贸易大学信息科技与管理学院、北京电子科技学院计算机科学技术系以及北京建筑大学自动化系，他们共同合作，将多核学习（Multiple Kernel Learning, MKL）方法应用于审美图像分类的研究中。多核学习是一种机器学习技术，它允许模型利用多个不同的特征子空间，通过组合它们的优点来提高分类性能。这种方法的优势在于，能够适应不同的特征表示，从而增强对图像美学特征的捕捉和理解。在他们的研究中，提出了一种基于MKL的审美图像分类框架。首先，通过深度学习或传统的视觉特征提取技术（如SIFT、HOG或CNN），从图片中抽取一组丰富的视觉特征。然后，这些特征被映射到不同的核函数空间，每个空间代表了图像美感的不同维度。接下来，MKL算法会动态地选择和组合这些核函数，形成一个优化的权重向量，以最大化分类的准确性。该方法通过解决不同特征空间之间的协同作用，有效地解决了传统单一核函数可能无法捕捉到的复杂美感特征。此外，通过多核学习，模型可以更好地处理非线性和复杂关系，从而提高了审美分类的精度和鲁棒性。最后，实验部分展示了所提方法在公开数据集上的性能评估，与其他经典方法进行了对比，证明了其在审美图像分类任务中的优越性。总结来说，这篇文章的核心贡献在于将多核学习理论与图像美学特性相结合，为图像分类问题提供了一个新的解决方案，有助于推动计算机视觉和人工智能领域的研究进展，特别是在艺术和视觉媒体分析领域有着广泛的应用前景。

Aesthetic Image Classification based on

Multiple Kernel Learning

Ningning Liu

, Xin Jin

, Hui Lin

, De Zhang

University of International Business and Economics, School of Information technology and

Management, BJ, 100029, P.R.China

Beijing Electronic Science and Technology Institute, Department of Computer Science and

Technology, BJ, 100070, P.R.China

Beijing University of Civil Engineering and Architecture, Department of Automation, BJ,

100044, China

{ningning.liu@uibe.edu.cn, jinxin@besti.edu.cn, linhuivicky@foxmail.com, zhang-

de@bucea.edu.cn,}

Abstract

Aesthetic image classification aims at predicting the aesthetic quality of photos au-

tomatically, i.e. whether the photo elicits a high or low level of affection in a majority

of people. To solve the problem, one challenge is to build features specific to image

aesthetic perceptions, and another one is to build effective learning models to bridge

the “semantic gap” between the emotion related concepts and the extracted visual

features. In this paper, we present an approach for aesthetic image classification based

on Multiple Kernel Learning (MKL) method, which seeks for maximizing the classi-

fication performance without explicit feature selection steps. The experiments are

conducted on a large diverse database built from online photo sharing website, and

the results demonstrated the advantages of MKL in terms of feature selection, classi-

fication performance, and interpretation, for the aesthetic image classification task.

Keywords: aesthetic quality, image classification, multiple kernel learning

1 Introduction

Aesthetics is a sub discipline of philosophy and axiology dealing with the nature of

beauty, art, and taste. The assessment or prediction of aesthetic value in images is

considered to be of subjectivity and universality. The subjective feature suggests that

the judgment relies on individual personal feelings, and there is no single agreement

on what it exactly belongs to. In contrast, the universality indicates that certain fea-

tures in photographic images are believed to please humans more than others. In con-

clusion, though the evaluation of beauty and other aesthetic qualities of photographs

is highly subjective, still they have certain stability and generality across different

people and cultures as a universal validity to classify images in terms of aesthetic

quality [2]. Figure 1 shows two photos from an online website, and according to the

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38693657

粉丝: 0
资源: 926

多核学习驱动的审美图像分类方法

基于多核学习的GIST全局和SIFT局部特征融合遥感图像检索方法.pdf

图像卷积的国内外研究现状

多核cpu应用场景_一“芯”多用——基于多核处理器的汽车软件开发应用

多核学习 matlab

帮我写一篇基于多核异构处理器系统带宽调整策略专利

不是，是多视图学习里的多核学习

多视图分类主要包括四种学习范式：多核学习、子空间学习、稀疏学习和加权学习

多视图分类以什么分成多核学习、子空间学习、稀疏学习和加权学习

shogun多核学习

fpga图像处理最新的算法

最新资源