深度学习解析美学图像剪裁：ASM-NET模型

图片美学

深度学习

需积分: 9 51 浏览量更新于2024-09-05 1 收藏 3.85MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

“ASM-NET可解释的美学评分及图像剪裁技术主要关注图像美学和深度学习在图像裁剪中的应用。ASM-NET是一种创新的深度学习模型，它旨在揭示美学评估的内在机制，并能对图像的各个部分进行解释性的美学评分。” 在图像处理领域，图像美学是一个重要的研究方向，它涉及到如何评价一张图片的视觉吸引力和艺术价值。ASM-NET（Aesthetic Score Map Network）是针对这一问题提出的一种深度学习模型，特别关注于图像的裁剪任务。图像裁剪是为了寻找图像中最具美学价值的部分，创建出最吸引人的小图像窗口。这一过程在许多应用场景中都非常重要，如社交媒体分享、广告设计等。 ASM-NET的核心在于其生成的美学评分地图。这是一个全卷积网络（Fully Convolutional Network）的产物，它可以对输入图像的每一个像素区域进行评分，生成一个覆盖整个图像的评分图。这个评分图对于所有可能的裁剪候选区域都是共享的，意味着在裁剪评估阶段，模型可以考虑不同裁剪方式对整体美学质量的影响。为了提高评分的准确性和解释性，ASM-NET引入了两个关键概念：构图感知（Composition-Aware）和显著性感知（Saliency-Aware）。构图感知是指模型在评估时会考虑到图像的布局、颜色、对比度等构图元素，这些元素对于图像的整体美感有着重要影响。显著性感知则意味着模型会关注图像中的显著区域，这些区域通常包含画面的重点或兴趣点，它们的保留程度对美学质量有直接影响。在ASM-NET的框架下，同一区域可能因为不同的构图关系和显著性特征而获得不同的美学评分。通过这种方式，模型不仅能够决定最佳裁剪位置，还能解释为什么选择这个位置，为用户提供了一种理解模型决策的途径。这在深度学习模型中是相对少见的，大多数模型往往被视为“黑箱”，而ASM-NET的可解释性特征有助于提升模型的可信度和实用性。 ASM-NET是深度学习在图像美学领域的一个重要进展，它的贡献在于提供了一种新的、解释性强的图像裁剪方法，这对于理解和优化图像美学评价模型具有重要意义。通过构图和显著性相结合的方式，ASM-NET能够在保持高美学标准的同时，确保裁剪结果的合理性和可解释性。

资源详情

资源推荐

Image Cropping with Composition and Saliency Aware Aesthetic Score Map

Yi Tu,

Li Niu,

1∗

Weijie Zhao,

Dawei Cheng,

Liqing Zhang

1∗

MoE Key Lab of Artiﬁcial Intelligence, Department of Computer Science and Engineering

Shanghai Jiao Tong University, Shanghai, China

{tuyi1991, ustcnewly, dawei.cheng}@sjtu.edu.cn, zhang-lq@cs.sjtu.edu.cn

Versa Inc, Shanghai, China

weijie.zhao@versa-ai.com

Abstract

Aesthetic image cropping is a practical but challenging task

which aims at ﬁnding the best crops with the highest aesthetic

quality in an image. Recently, many deep learning methods

have been proposed to address this problem, but they did not

reveal the intrinsic mechanism of aesthetic evaluation. In this

paper, we propose an interpretable image cropping model to

unveil the mystery. For each image, we use a fully convo-

lutional network to produce an aesthetic score map, which

is shared among all candidate crops during crop-level aes-

thetic evaluation. Then, we require the aesthetic score map

to be both composition-aware and saliency-aware. In par-

ticular, the same region is assigned with different aesthetic

scores based on its relative positions in different crops. More-

over, a visually salient region is supposed to have more sensi-

tive aesthetic scores so that our network can learn to place

salient objects at more proper positions. Such an aesthetic

score map can be used to localize aesthetically important re-

gions in an image, which sheds light on the composition rules

learned by our model. We show the competitive performance

of our model in the image cropping task on several bench-

mark datasets, and also demonstrate its generality in real-

world applications.

1 Introduction

Given an image, the image cropping task aims at ﬁnding the

crops with the best aesthetic quality. It is an important task

that can be widely used in a lot of down-stream applications,

e.g., photo post-processing (Chen et al. 2017b), view rec-

ommendation (Li et al. 2018; Wei et al. 2018), and image

thumbnailing (Esmaeili, Singh, and Davis 2017). In order to

ﬁnd the best crop, an image cropping model will ﬁrst gen-

erate a large number of candidate crops and then determine

the best crop based on crop-level aesthetic evaluation. So an

image cropping model is usually composed of two stages,

candidate generation and aesthetic evaluation. A good image

crop is achieved by selecting important contents and placing

them with a good composition. The required knowledge for

such a task can be categorized into two parts, i.e., content

preference and composition preference. Therefore, a good

∗

Corresponding author.

 2020, Association for the Advancement of Artiﬁcial

Figure 1: Images crop with composition rules. The orange

box in each image denotes a good crop found based on

human-deﬁned composition rules. The white dotted lines de-

note the auxiliary lines used in these composition rules.

image cropping model should be able to learn and leverage

such preferences when searching for the best crop.

Early methods achieve this goal by explicitly utilizing

some photography knowledge like human-deﬁned compo-

sition rules, e.g., Rule of Thirds and Rule of Central (See

Figure 1). With the development of deep learning, recent

researchers learn image cropping in a data-driven manner

and many aesthetic datasets are constructed to encode the

aesthetic preference of humans. Recent methods (Wang and

Shen 2017; Chen et al. 2017b; Wei et al. 2018; Lu et al.

2019c) treat it as an object detection task. They used aes-

thetic datasets to train an aesthetic evaluation model and ap-

plied it to compare candidate crops. Due to the power of

deep learning, these methods have brought progresses in this

ﬁeld, but the intrinsic mechanism remains unrevealed.

In this paper, we propose an interpretable image cropping

model to produce both composition-aware and saliency-

aware Aesthetic Score Maps, called ASM-Net. Our ap-

proach was ﬁrst inspired by the Class-Activation-Map

(CAM) method (Zhou et al. 2016), which uses a class activa-

tion map to localize the most discriminative image regions

in image classiﬁcation task. Similarly, we expect to use an

aesthetic score map to localize aesthetically important image

regions. The aesthetic score of a region can be obtained via

average pooling and the regions with larger aesthetic scores

are of higher aesthetic quality. However, direct application

of CAM has been proven ineffective because the aesthetic

evaluation task is more complicated than classiﬁcation and

one region cannot be simply represented by a single score.

arXiv:1911.10492v1 [cs.CV] 24 Nov 2019

下载后可阅读完整内容，剩余7页未读，立即下载

cq_liyo

粉丝: 0
资源: 9

深度学习解析美学图像剪裁：ASM-NET模型

asm-util.jar

redhat/centos6.9 kmod-oracleasm/oracleasm-support/oracleasm rpm包

kmod-oracleasm-2.0.8-26.el7.x86_64.rpm

kmod-oracleasm-2.0.8-15.el6_9.x86_64安装包

oracleasm-support-2.1.8-1.el6.x86_64.rpm

oracleasm-support-2.1.8-3.el7.x86_64.rpm 下载

kmod-oracleasm-2.0.8-13.el6_8.x86_64.rpm

oracleasm-support-2.1.8-3.el7.x86_64.rpm

oracleasm-support 安装

asm-3.2.jar

arch/x86/Makefile:184: *** Compiler lacks asm-goto support.. Stop.

kernel < 2.6.32-359.el6 is needed by kmod-oracleasm-2.0.6.rh1-2.el6.x86_64

arch/x86/makefile:184: *** compiler lacks asm-goto support.。 停止。

ln -s asm-$2 asm

ubuntu /usr/include/asm-generic/目录没有上面的文件和文件夹asm/sysreg-defs.h

解读E：\oracle\product\10.2.0\db_1\bin\oradim.exe-startup -sid +asm -usrpwd systrqxv "" -log oradim.log -nocheck 0

Caused by: java.nio.file.NoSuchFileException: C:\Users\86198\.m2\repository\org\glassfish\hk2\hk2\2.6.1\asm-commons.jar

./configure --prefix=/home/lw/opencv_4/zwork --host=arm-linux-gnueabihf --cross-prefix=arm-linux-gnueabihf- --disable-asm --enable-shared

oracleasmlib-2.0.12-1.el7.x86_64.rpm下载

crs-34927: cannot stop resource 'ora.asm' outside of its resource group 'ora

最新资源

arch/x86/makefile:184: *** compiler lacks asm-goto support.。停止。