利用YOLOv7与ESRGAN提升低分辨率坑洼检测性能

版权申诉

29 浏览量更新于2024-08-03 收藏 1.35MB PDF 举报

本文探讨了如何结合先进的深度学习技术和计算机视觉方法来提升坑洼检测的效率和准确性。主要关注的是在实际应用中常见的挑战，如使用低分辨率摄像头或图像进行目标检测。研究者们提出了一个创新的方案，利用超分辨率生成对抗网络（ESRGAN）与You Only Look Once (YOLO) v7网络的结合，以解决低质量图像下的坑洼检测问题。首先，卷积神经网络（CNNs），尤其是深度学习模型，已经成为道路安全领域中的关键工具，特别是在目标检测任务中。传统的CNN，如YOLOv7，以其高效性和实时性在物体识别上表现出色，但当面临分辨率较低的输入时，其性能可能会受限。因此，文章着重研究了如何通过ESRGAN来增强图像质量，以便在低分辨率条件下也能有效地捕捉坑洼特征。 ESRGAN作为一种强大的图像超分辨率技术，它运用生成对抗网络（GAN）的原理，通过对抗训练生成高分辨率的近似图像。将这个技术与坑洼检测任务相结合，有助于提升在低质量图像上的细节恢复，从而提高检测准确度。这种方法的优势在于，即使在设备限制或资源有限的情况下，也能提供相对清晰的图像输入，以便YOLOv7模型能更好地识别和定位坑洼。接下来，作者通过实验验证了这一方法的有效性，首先在低质量和高质量的行车记录仪图像上分别使用YOLOv7作为基线，对比了不同处理后的速度和准确性。结果表明，通过ESRGAN的预处理，不仅提高了检测速度，还显著提升了在低分辨率条件下的检测精度。这表明，尽管增加了额外的超分辨率步骤，但整体系统在时间和性能上仍保持了良好的平衡。本研究的关键贡献在于将YOLOv7和ESRGAN有效地集成，以提升在低质量视觉输入下自动检测坑洼的能力。这对于道路监控和自动驾驶等领域具有重要的实际意义，尤其是在资源受限的场景下，可以有效改善行车安全。这项工作展示了深度学习和图像增强技术的协同作用，为未来在实时、复杂环境下的智能交通系统提供了新的研究方向。

Improved Pothole Detection Using YOLOv7

and ESRGAN

Nirmal Kumar Rout, Gyanateet Dutta, Varun Sinha, Arghadeep Dey, Subhrangshu Mukherjee, Gopal Gupta

Abstract— Potholes are common road hazards that is causing damage to vehicles and posing a safety risk to drivers. The introduction of Convolutional

Neural Networks (CNNs) is widely used in the industry for object detection based on Deep Learning methods and has achieved significant progress in

hardware improvement and software implementations. In this paper, a unique better algorithm is proposed to warrant the use of low-resolution cameras or

low-resolution images and video feed for automatic pothole detection using Super Resolution (SR) through Super Resolution Generative Adversarial

Networks (SRGANs). Then we have proceeded to establish a baseline pothole detection performance on low quality and high quality dashcam images using

a You Only Look Once (YOLO) network, namely the YOLOv7 network. We then have illustrated and examined the speed and accuracy gained above the

benchmark after having upscaling implementation on the low quality images.

Index Terms- CNN, Deep Learning, Pothole Detection, YOLOv7, ESRGAN, Transfer Learning.



1 I

NTRODUCTION

OTHOLES are a major issue on roads worldwide, causing

damage to vehicles and posing a safety risk to drivers.

Automated pothole detection systems can help to identify and

repair potholes more efficiently, but the use of low-resolution

cameras or low-quality video feed can be a challenge. In this

paper, we propose a novel approach for improving the

performance of pothole detection using low-resolution cameras

or low-quality images and video feed. Our approach involves

using an Enhanced Super Resolution Generative Adversarial

Networks (ESRGAN) [1] to enhance the resolution of low-

quality images and video feed, and then applying the You

Only Look Once(YOLOv7) [2] object detection algorithm to

detect potholes in the enhanced images. We compare the speed

and accuracy of our approach to a baseline pothole detection

system using YOLOv7 on high-quality images and show that it

provides a significant improvement in both areas. We also

demonstrate that our approach can be applied to a range of

different road conditions and pothole types. One of the major

advantages of our approach is its cost-effectiveness. ESRGAN

can be used to improve the resolution of low-quality images

and video feed from low-cost cameras, rather than requiring

the use of high-resolution cameras with expensive sensors.

This can greatly reduce the cost of implementing pothole

detection systems, especially in resource-constrained settings.

To validate the effectiveness of our approach, we conduct a

series of experiments on a medium sized dataset of dash-cam

images and video feed from a variety of international locations

which indicate real life scenarios. Our results show that use of

ESRGAN and YOLOv7 can significantly improve the

performance of pothole detection systems and provide a

reliable solution for detecting potholes in low-resolution

images and video feed. This has the potential to greatly

enhance the efficiency and effectiveness of pothole repair

efforts and improve road safety for drivers worldwide.

 Nirmal Kumar Rout is with School of Electronics Engineering, KIIT

University, Bhubaneswar, India. Email: nkrout@kiit.ac.in.

 Gyanateet Dutta, Varun Sinha, Subhrangshu Mukherjee, Arghadeep

Dey, Gopal Gupta are with School of Electronics Engineering, KIIT

University, Bhubaneswar, India. E-mail: {1930198, 1930055,

1930053, 1930069, 1930020} @kiit.ac.in.

2 RELATED WORKS

A number of approaches have been proposed in the literature

for automated pothole detection. The earlier approaches [3]

required 3-D equipment which can be very expensive and not

suitable for use for all purposes. These techniques frequently

use image data taken by digital cameras [4, 5] and depth

cameras, thermal technology, and lasers. Recent approaches

rely on machine learning algorithms and deep learning

algorithms for image processing and detect potholes.

Techniques based on Convolutional-neural-networks (CNN)

are widely used for feature extraction of potholes from

images because they can accurately model the non-linear

patterns and perform automatic feature extraction and their

robustness in separating unecessary noise and other image

conditions in road images [6]. Even though, CNNs have been

used in many approaches [7, 8, 9] they are ineffective in

certain scenarios like while detecting objects which are smaller

relative to the image. This can be solved by using high

resolution images for detection but then the computational cost

required for processing is too high, reason being CNNs are

very memory consuming and they also require significantly

high computation time. For addressing this issue, Chen et al.

[10] suggested to using smaller input images or image patches

from HR images for training the network. The first method is a

two-phase system where a localization network (LCNN) is

employed initially for locating frame segment of pothole in the

image and then using a network for classification developed

on part (PCNN) to calculate the classes. A recent study by

Salcedo et al. [11] developed a road maintenance prioritization

system for India using deep learning models such as UNet,

which incorporates ResNet34 as the encoder, EfficientNet, and

YOLOv5 on the Indian driving dataset(IDD). The study by

Silva et al. [11], employed the YOLOv4 to detect damage on

roads on a dataset of images taken from overhead view of an

airborne drone. The study experimentally evaluated the

accuracy and applicability of YOLOv4 in subject to

recognizing highway road damages, and found an accuracy of

95%. The work proposed by Mohammad et al. [12] comprised

of a system of using an edge platform using the AI kit(OAK-D)

on frameworks such as the YOLOv1, YOLOv2, YOLOv3,

YOLOv4, Tiny-YOLOv5, and SSD - mobilenet V2. In the

work Anup et al. [13] proposed a 1D Convolutional Neural

下载后可阅读完整内容，剩余5页未读，立即下载

人工智能_SYBH

粉丝: 4w+
资源: 222

利用YOLOv7与ESRGAN提升低分辨率坑洼检测性能

基于高分项目YOLOv5实现的路面坑洼检测方法系统python源码+文档说明+模型.zip

基于改进YOLOv5的路面坑洼检测方法

基于yolov5模型的路面坑洼检测代码

基于yolov5的道路坑洼检测

道路坑洼检测yolov7的权重文件

yolov7 可以用于道路缺陷检测吗？

道路坑洼检测yolov5s

yolov8检测路面破损

卷积神经网络matlab道路坑洼检测

基于matlab的道路坑洼检测

最新资源