犯罪现场调查图像检索：CNN与低级特征融合

需积分: 22 66 浏览量更新于2024-09-11 收藏 708KB PDF 举报

“Image Retrieval using CNN and Low-level Feature Fusion for Crime Scene Investigation Image Database” 本文主要探讨了利用卷积神经网络（CNN）和低级特征融合技术在犯罪现场调查图像数据库中的图像检索应用。随着深度学习技术的发展，CNN在大规模图像检索任务中表现出了卓越的性能。然而，由于犯罪现场调查图像数据集通常数量有限，这可能导致CNN模型在训练时过拟合的问题。为了克服这一挑战，论文提出了一种新的方法，即串联两个基于迁移学习的CNN模型，并结合CNN特征与低级特征进行融合。迁移学习是一种利用预训练模型的知识来改进新任务性能的技术。在这种情况下，研究人员可能首先使用预训练的CNN（如VGG或ResNet）在大规模公共数据集（如ImageNet）上进行微调，以适应犯罪现场图像的特性。低级特征通常包括色彩、纹理和形状等基本信息，这些特征对于识别图像中的关键元素至关重要，特别是在犯罪现场调查中寻找线索。将CNN的高层语义特征与低级特征融合，可以增强模型对图像细节的捕获能力，提高检索的准确性。这种方法可能涉及到特征金字塔网络（FPN）或者多尺度特征融合策略，使得模型能够同时利用全局和局部信息。在犯罪现场调查图像检索中，准确和快速地找到相关证据是至关重要的。传统的基于内容的图像检索方法可能依赖于手工设计的特征，而这种方法可能无法处理复杂和多变的犯罪场景。通过引入深度学习，特别是CNN，可以自动学习和提取更丰富的图像特征，从而提高检索效率和准确性。实验部分可能会详细描述如何训练和评估这两个CNN模型的性能，包括使用何种评估指标（如精度、召回率、F1分数等），以及与其他现有方法的比较。此外，可能还会讨论不同特征融合策略对结果的影响，以及在实际犯罪现场调查中的应用潜力和限制。这篇论文为犯罪现场调查提供了新的图像检索解决方案，通过结合深度学习和低级特征，有望提高犯罪证据搜索的效率和准确性，从而在解决各种犯罪问题上发挥重要作用。

Image Retrieval using CNN and Low-level Feature

Fusion for Crime Scene Investigation Image Database

Ying Liu

123

, Yanan Peng

1 *

, Dan Hu

, Daxiang Li

123

, Keng-Pang Lim

, Nam Ling

1 Center for Image and Information Processing, Xi’an University of Posts and Telecommunications, Xi’an 710121, China

2 Key Laboratory of Electronic Information Application Technology for Scene Investigation, Ministry of Public Security, Xi’an,

710121, China

3 International Joint Research Center for Wireless Communication and Information Processing, Shaanxi, Xi’an, 710121, China

4 Department of Computer Engineering, Santa Clara University, California, 95053, USA

Abstract — Crime scene investigation (CSI) image retrieval is

used to search for crime evidences and is critical in helping in

solving various crimes. In recent years, using Convolutional

Neural Network (CNN) has demonstrated outstanding

performances in large-scale image database retrieval. However,

to prevent over-fitting in the training of CNN model due to

limited number of CSI images, this paper proposes to cascade

two CNN models obtained based on transfer learning and

combine CNN features with low-level image feature to better

describe CSI images. First, two pre-trained CNN models are

fine-tuned using the target image set. CNN features are

extracted from fully connected layer of each model and

are concatenated as high-level features for the image. These

concatenated CNN features are then fused with the low-level

image features of the target image set. The final fused image

features are used in the image retrieval. Experimental results on

CSI image database proved the effectiveness of the proposed

algorithm for limited number of training sets. In addition,

experiments carried out on the GHIM-10K database proved

the generalizability of the proposed algorithm.

I. INTRODUCTION

Crime scene investigation (CSI) image is an important part

of the information collected at crime scenes. Classification

and retrieval of CSI images provide important clues and play

an important role in solving serial crimes [1]. Therefore, there

is an urgent need for an automatic and effective image

classification and retrieval system to quickly find relevant

images from a large number of CSI images to improve the

efficiency of the investigation while saving human power and

material resources.

Currently, there are few studies on CSI image retrieval.

Existing CSI image retrieval technologies can be divided into

two categories: CSI image retrieval based on low-level

features and that based on high-level semantics. CSI image

retrieval technology based on low-level features uses a

content-based image retrieval (CBIR) framework to extract

low-level features of the image (such as color histogram, gray

level co-occurrence matrix, Gabor features, wavelet texture

features, etc.) or to fuse different low-level features, which

confirms the feasibility of CBIR technology in CSI image

retrieval [2, 3]. In [4], the author proposes to combine low

level features of image dominant color descriptors as color

features, gray-level co-occurrence matrix as texture features

and the edge feature obtained by gradient vector flow to

improve CSI image retrieval performance. The disadvantage

is that the computation is complex and slow. In [5], an image

retrieval method based on regional semantic template is

proposed. First, the user submits the query image and the

region of interest, thereby constructing a regional semantic

template and performing pre-classification. Finally, the image

is sorted. Experiments show that the algorithm is effective to

improve the accuracy of CSI image retrieval. Ref. [6]

proposes a two-layer system for CSI image retrieval

frameworks. First, the corresponding feature database of the

CSI image database is computed, and a support vector

machine (SVM) classifier model that can achieve

multi-semantic classification is pre-trained. After the

investigator submits the retrieved images, SVM automatically

determines the semantic categories based on the image

features, and then it performs matching retrieval on the image

library containing only the semantics. Experimental results

show that this method outperforms the Query By Example

(QBE) method in multiple retrieval indexes, with significant

reduction in retrieval time, by half. It is also an effective

method to introduce relevant feedback (RF) into the CSI

image retrieval. In [7], RF is used to automatically adjust

the weights of shoe print features to improve precision.

Although the above method achieved some good results, they

lack the “semantic gap” which may improve the accuracy of

image retrieval significantly.

With the pioneering work by Hinton et al. [8] in 2006, deep

learning has developed rapidly in the recent decade. There are

several types of deep learning frameworks such as

convolutional neural networks (CNN) and deep belief

networks (DBN), applied to digit recognition [9], image

classification [10], face recognition [11], and other

applications with unprecedented success. Deep learning has a

wide range of applications in image classification and

retrieval as well. For example, in the ImageNet competition,

the accuracy of using traditional classifiers in 2010 (top 5

accuracy) was 71.8%, and in 2011 it was 74.3%. In 2012,

Hinton and his student Alex et al. used deep learning to

improve the accuracy rate to 84.7%. In the 2017 competition,

the final accuracy rate was as high as 97.3%. A. Babenko and

J. Donahue [12,13] extracted features of the CNN fully

connected layer as high-level semantic features for retrieval,

and also extracted image features from the convolutional layer

for retrieval [14], and achieved good results. Juan A. Carvajal

1208

Proceedings, APSIPA Annual Summit and Conference 2018 12-15 November 2018, Hawaii

下载后可阅读完整内容，剩余6页未读，立即下载

kaichu2

粉丝: 888
资源: 71

犯罪现场调查图像检索：CNN与低级特征融合

Image Retrieval via Decoupling Diffusion into Online and Offline Processing

Multiple feature fusion for face recognition-FG的MATLAB代码

精细金属掩模板(FMM)行业研究报告 显示技术核心部件FMM材料产业分析与市场应用

【创新未发表】斑马算法ZOA-Kmean-Transformer-LSTM负荷预测Matlab源码 9515期.zip

j link 修复问题套件

C#实现modbusRTU(实现了01 3 05 06 16等5个功能码)

【创新未发表】基于matlab粒子群算法PSO-PID控制器优化【含Matlab源码 9659期】.zip

Python毕业设计-豆瓣电影短评数据挖掘与情感分析项目源码（高分项目）

yolo算法-血细胞数据集-946张图像带标签--红细胞-血小板.zip

YOLOV5交通标志识别的代码+标注好的6105张数据集（高分完整项目代码）配置完环境就能运行

最新资源

精细金属掩模板(FMM)行业研究报告显示技术核心部件FMM材料产业分析与市场应用