YOLOv7驱动的无人机实时热红外人体检测：精度与速度优化

版权申诉

8 浏览量更新于2024-08-03 收藏 2.01MB PDF 举报

本文主要探讨了在无人机遥感领域中利用计算机视觉技术实现热红外（TIR）图像和视频的人体实时检测问题。对象检测是计算机视觉和遥感中的核心挑战，尤其是在无人机生成的TIR多场景照片和视频中，由于目标尺寸小、场景复杂、分辨率相对较低且标注数据稀缺，这使得传统的目标检测任务面临严峻的挑战。作者针对这一问题，提出了一个基于卷积神经网络（CNN）架构的"你只看一次"（YOLO）模型，具体采用了YOLOv7版本，这是一种先进的目标检测算法。YOLOv7以其高效性和准确性著称，它能够在处理实时视频流时达到很高的检测精度。实验结果显示，在验证任务中，对于人体目标检测，YOLOv7的平均精度达到IOU（交并比）为0.5的情况下，准确率达到了72.5%，同时保持了相对较高的检测速度，大约为每秒161帧（FPS）。论文详细介绍了YOLOv7在无人机TIR视频中对人体检测的性能评估，特别是在不同无人机视角下的跨视角检测能力。这种应用证明了YOLO架构在解决遥感数据中目标检测问题的实用性，特别是在缺乏大量标记数据集和训练模型的条件下。通过定性和定量评估，研究人员得出了深度学习模型在TIR图像和视频目标检测领域的积极效果，这对于推动公共安全监控和无人机技术的发展具有重要意义。这篇研究为解决无人机热红外遥感数据中的人体检测问题提供了一个创新的方法，展示了深度学习在解决实际应用场景中的强大潜力，同时也为其他类似领域的研究者提供了有价值的参考模板和数据集使用指南。

Real Time Human Detection by Unmanned Aerial

Vehicles

Walid Guettala

Computer Science Department,

Biskra University, Algeria

walidguettala@gmail.com

Ali Sayah

Computer Science Department,

Biskra University, Algeria

Sayah.Ali@hotmail.com

Laid Kahloul

LINFI Laboratory, Computer Science Department,

Biskra University, Algeria

l.kahloul@univ-biskra.dz

Ahmed Tibermacine

LESIA laboratory, Computer Science Department,

Biskra University, Algeria

ahmed.tibermacine@univ-biskra.dz

Abstract—One of the most important problems in computer

vision and remote sensing is object detection, which identiﬁes

particular categories of diverse things in pictures. Two crucial

data sources for public security are the thermal infrared (TIR)

remote sensing multi-scenario photos and videos produced by

unmanned aerial vehicles (UAVs). Due to the small scale of the

target, complex scene information, low resolution relative to the

viewable videos, and dearth of publicly available labeled datasets

and training models, their object detection procedure is still

difﬁcult. A UAV TIR object detection framework for pictures and

videos is suggested in this study. The Forward-looking Infrared

(FLIR) cameras used to gather ground-based TIR photos and

videos are used to create the “You Only Look Once” (YOLO)

model, which is based on CNN architecture. Results indicated

that in the validating task, detecting human object had an

average precision at IOU (Intersection over Union) = 0.5, which

was 72.5%, using YOLOv7 (YOLO version 7) state of the art

model [1], while the detection speed around 161 frames per

second (FPS/second). The usefulness of the YOLO architecture

is demonstrated in the application, which evaluates the cross-

detection performance of people in UAV TIR videos under a

YOLOv7 model in terms of the various UAVs’ observation angles.

The qualitative and quantitative evaluation of object detection

from TIR pictures and videos using deep-learning models is

supported favorably by this work.

Index Terms—Human detection — Human tracking — Ther-

mal Imaging — YOLOv7 — UAV

I. INTRODUCTION

Unmanned aerial vehicle (UAV) object detection is devel-

oping technology with a wide range of uses, including aerial

picture analysis, intelligent surveillance, and route inspection

[2, 3]. Recently, there has been a lot of advancement in object

detection. The deep neural network (DNN), in particular the

convolutional neural network (CNN) [4], has shown record

breaking performance in computer vision applications like ob-

ject recognition [5], especially with the introduction of large-

scale visual datasets and greater computing power. However,

given the unique perspective, it is still a difﬁcult task.

“Deep-learning-based object detection” [6] and “conven-

tional manual feature-based object detection” [7] are two

different approaches to object detection. It focuses on the

target-feature extraction technique design for manual feature-

based object detection, but because it is still difﬁcult to meet

various constraints, most of these sorts of approaches are only

employed in certain environments [8]. On the other hand,

deep-learning-based techniques may now achieve real-time

detection in addition to improving accuracy as computing

technology advances.

Although deep-learning-based techniques have signiﬁcantly

advanced object recognition, miss-detection problems still

exist in UAV. The following are the key contributing factors

to these problems: (1) The network’s receptive ﬁeld is not

sufﬁciently resistant to small objects and Thermal Imaging,

and (2) the training dataset is conﬁned to UAV viewpoint.

In general, object feature representation and the associated

training dataset are crucial for enhancing object detection

performance. Additionally, the trade-off between accuracy and

processing speed is crucial for real-world applications.

We are motivated by these issues to create an object

detection technique based on the You Only Look principle

(YOLO) [9], and we focus on the detection of tiny objects. To

enhance detection performance on tiny objects, we gather data

based on UAV views, and we enhance the YOLOv7 network.

to our dataset by transfer learning. The following are some

of our study’s contributions: (1) develop a UAV perspective-

based dataset for person detection that may be used to enhance

human detection; (2) enhance YOLO’s network architecture

to expand the receptive area and further improve tiny human

detecting performance using transfer learning.

The remaining of this article is structured as follows: The

related work is introduced in Section 2, the experimental

setup is further explained in Section 3, and presents the

experimental ﬁndings and talks about detailed comparative

analysis. Concluding observations are included in Section 4.

II. RELATED WORK

In the literature, considerable numbers of works has been

introduced to handle the challenging tasks of object detection.

This section will brieﬂy discuss novel approaches and meth-

ods.

arXiv:2401.03275v1 [cs.CV] 6 Jan 2024

下载后可阅读完整内容，剩余5页未读，立即下载

人工智能_SYBH

粉丝: 5w+

YOLOv7驱动的无人机实时热红外人体检测：精度与速度优化

YOLOv7无人机检测模型训练与数据集标注完整指南

YOLOv7无人机视觉检测训练权重发布与应用

yolov5无人机检测模型训练权重发布，附带训练代码

YOLOv7无人机视觉检测+训练好的无人机权重+标注好的数据集

yolov7无人机俯视视角下热红外行人小目标检测权重+数据集

YOLOV7无人机目标检测：带实战数据集与源码

YOLOv8无人机视觉应用：实时目标跟踪与分析技术

yolov5无人机跟踪

阿旭yolov8无人机

基于yolov9无人机

最新资源