UAV影像实时车辆检测：单次拍摄与多尺度特征融合

6 浏览量更新于2024-08-25 收藏 1.22MB PDF 举报

"这篇研究论文探讨了通过无人机图像进行实时车辆检测的技术挑战和解决方案，重点关注在无人驾驶航空器（UAV）图像中实现快速且准确的车辆检测。文章出自2018年IEEE第四届多媒体大数据国际会议（BigMM），由西电大学人工智能学院的学者们共同撰写。" 在当前的技术背景下，通过无人机图像进行实时车辆检测是具有重要意义但又极具挑战性的任务，广泛应用于交通监控、智能城市、公共安全等多个领域。然而，由于无人机视角下车辆的尺寸小、特征少、尺度变化大以及样本不平衡问题，现有的深度学习方法在准确性和速度上都无法达到理想效果，这构成了一个经典的性能与速度之间的权衡问题。论文提出了一个新的单一框架车辆检测器，旨在解决上述问题。首先，他们设计了一个多尺度特征融合模块，该模块结合了高分辨率但语义较弱的特征和低分辨率但语义较强的特征。这一融合策略旨在引入更丰富的上下文信息，增强对小目标（如无人机图像中的车辆）的识别能力。这种融合方式可以有效弥补不同尺度下的特征缺陷，提高检测精度。其次，为了实现实时检测，论文可能还涉及了优化网络结构和推理速度的策略。可能包括轻量级网络架构的设计，如使用MobileNet或YOLO系列的网络，以减少计算复杂度，同时保持良好的检测性能。此外，可能还利用了数据增强技术来处理样本不平衡问题，通过模拟不同光照、角度和遮挡条件的图像，增强模型对多样化场景的适应性。此外，论文可能会探讨训练策略，比如采用迁移学习，利用预训练在大规模数据集（如COCO或VID）上的模型权重，来初始化网络，从而加速收敛并提高检测效果。同时，可能会提到在线数据平衡策略，动态调整训练过程中的类权重，以改善小类别（如无人机图像中的车辆）的检测性能。最后，论文可能还评估了提出的车辆检测方法在多个公开的UAV数据集上的性能，对比了其他现有的车辆检测算法，展示了其在准确率、召回率和F1分数等指标上的优越性，并分析了在不同环境条件下的鲁棒性。通过这些实验结果，论文为无人机图像中的实时车辆检测提供了一种有效而实用的解决方案。这篇研究论文为无人机图像中的车辆检测提供了一个新的视角，通过创新的特征融合模块和优化的网络设计，实现了在保证检测准确性的同时，提高检测速度，对于推动无人机监控和智能交通等领域的发展具有重要的理论和实践意义。

展开

2018 IEEE Fourth International Conference on Multimedia Big Data (BigMM)

Real-time Vehicle Detection from UAV Imagery

Xuemei Xie*, Wenzhe Yang, Guimei Cao, Jianxiu Yang, Zhifu Zhao, Shu Chen, Quan Liao, Guangming Shi

School of Artiﬁcial Intelligence

Xidian University, Xi’an, China 710071

xmxie@mail.xidian.edu.cn

Abstract—Fast and accurate vehicle detection in unmanned

aerial vehicle (UAV) imagery is a meaningful but challenging task,

playing an important role in a wide range of applications. Due to

its tiny size, few features, variable scales and imbalance vehicle

sample problems in UAV imagery, current deep learning methods

used in this task cannot achieve a satisfactory performance both

in accuracy and speed, which is obvious a classical trade-off

problem. In this paper, we propose a single-shot vehicle detector,

which focuses on accurate and real-time vehicle detection in UAV

imagery. We make contributions in the following two aspects:

1) presenting a multi-scale feature fusion module to combine

the high resolution but semantically weak features with the low

resolution but semantically strong features, aiming to introduce

context information to enhance the feature representation of

the small vehicles; 2) proposing a dynamic training strategy

(DTS) which constructs the network to learn more discriminative

features of hard examples, via using cross entropy and focal loss

function alternately. Experimental results show that our method

can achieve 90.8% accuracy in UAV images and can run at

59 FPS on a single NVIDIA 1080Ti GPU for the small vehicle

detection in UAV images.

Index Terms—vehicle detection, unmanned aerial vehicle im-

agery, feature fusion, dynamic training strategy

I. INTRODUCTION

Nowadays, vehicle detection in unmanned aerial vehicles

(UAV) imagery plays a signiﬁcant role for a wide range of

applications [1]–[3]. However, there are some negative char-

acteristics in real-time vehicle detection from UAV imagery,

tiny objects, various orientation of the targets, and imbalance

samples, which lead to unsatisfactory performance both in

speed and accuracy.

Traditional methods are mainly based on the handcrafted

features [4], [5] and sliding window search algorithms [6], [7].

The handcrafted features cannot extract good semantic repre-

sentation. Some following studies [8], [9] exploit deep learning

methods to improve the feature representation capability com-

pared with handcrafted ones, bringing certain improvement

in detection accuracy. But there is still a gap to real-time

detection. Faster R-CNN [10], one of CNN-based detectors,

has achieved a good performance in UAV imagery [11]–

[14]. While, it has a limitation in speed due to its detection

mechanism. Subsequently, YOLOs [15], [16] are employed to

achieve real-time detection with lower accurate [17]. Due to

the wide range of view of UAV images, the vehicle objects

*This work is supported by Natural Science Foundation (NSF) of China

(Nos.61472301, 61632019), the Foundation for Innovative Research Groups of

the National Natural Science Foundation of China (No. 61621005), Ministry

of Education project (No. 6141A02011601).

are usually small, occluded and with complex background. In

the context of the situations, accurately detecting the vehicles

from UAV imagery is quite difﬁcult.

In this paper, we propose a single shot network using multi-

level feature fusion method which utilizes context information

efﬁciently and effectively, make a certain progress in accu-

racy and achieve real-time vehicle detection simultaneously.

Moreover, the extremely hard-easy class imbalance in UAV

dataset causes two problems as follows: 1) model training

is insufﬁcient for the categories which with a small amount

of examples, so that it is hard for the network to extract

representative features [18], [19]; 2) most easy samples will

overwhelm the total loss and gradients computation so the

network cannot learn the discriminative features well [20]. To

solve these, we design a dynamic training strategy (DTS)

to solve the imbalance problem and improve the network

detection performance.

To summarize, we present a single-shot detector, which

focuses on accurate and real-time vehicle detection from UAV

imagery. Speciﬁcally, our main contributions are as follows:

• We present a multi-scale feature fusion module to com-

bine the high resolution but semantically weak features

with the low resolution but semantically strong features,

which aims to introduce context information to enhance

feature representation of the small vehicles;

• We propose a dynamic training strategy (DTS) which

instruct the network to learn more discriminative features

of hard examples, via using cross entropy and focal loss

function alternately;

Experimental results show that our method can achieve

90.8% accuracy which is 7.5% and 3.1% higher than SSD

[21] and ReﬁneDet [22] respectively in UAV images. And the

proposed network can run at 59 FPS on a single NVIDIA

1080Ti GPU for the small vehicle detection.

II. RELATED WORK

A. UAV Vehicle Detector

Vehicle detection from UAV imagery has attracted extensive

research attention in past years. Moranduzzo et al. [23], Shao

et al. [4] and Kembhavi et al. [6] explore the vehicle detection

by using handcrafted features (e.g., Haar, HOG, SIFT, local

binary pattern, etc.) and intersection kernel SVM, which

make some progress. Xu et al. [14] improves original Viola-

Jones object detection scheme for better performance from

low-altitude UAV imagery. However, traditional handcrafted

下载后可阅读完整内容，剩余4页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

weixin_38704386

粉丝: 3

UAV影像实时车辆检测：单次拍摄与多尺度特征融合

无人机图像行人车辆检测：Yolo与SSD算法源码及部署指南

空中小目标检测数据集：无人机图像与YOLO目标检测

国内自制无人机航拍车辆检测数据集发布

人工智能大作业基于yolo和ssd算法实现无人机图像行人车辆检测源码+项目部署说明.zip

无人机拍摄车辆检测图像数据集

原创基于MATLAB图像处理的车辆检测与识别pdf-基于MATLAB图像处理的车辆检测与识别.pdf

三七出品--自制国内无人机航拍视角下车辆检测数据集

人工智能 无人机图像目标检测 .zip

基于图像空间金字塔检测模型的航空图像鲁棒车辆检测

人工智能大作业-无人机图像目标检测.zip

最新资源

人工智能无人机图像目标检测 .zip