DiffYOLO：基于扩散模型和YOLO的抗噪声目标检测框架

版权申诉

125 浏览量更新于2024-08-03 1 收藏 829KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

DiffYOLO：通过YOLO和扩散模型进行抗噪声目标检测本文讨论的是目标检测模型在低质量数据集上的应用问题。现有的目标检测模型，如YOLO系列，已经在高质量数据集上取得了很好的成绩，但是在低质量数据集上却遇到了挑战。为了解决这个问题，作者提出了一个框架，称为DiffYOLO，该框架结合了YOLO模型和扩散概率模型，以增强训练有素的模型，使其能够在高质量数据集上微调，并在低质量数据集上进行测试。 **目标检测** 目标检测是计算机视觉领域中的一个重要任务，旨在检测图像中的目标对象。常见的目标检测算法有YOLO、SSD、Faster R-CNN等。YOLO（You Only Look Once）是一种实时目标检测算法，能够快速地检测图像中的目标对象。YOLO系列模型由于其高效、准确的特点，已经在许多实际应用中得到了广泛的应用。 **扩散概率模型** 扩散概率模型是一种生成模型，能够学习到图像中的噪声特征，从而实现图像去噪的功能。该模型可以学习到图像中的噪声分布，然后根据该分布生成去噪后的图像。在目标检测任务中，扩散概率模型可以用于增强训练有素的模型，使其能够更好地检测低质量图像中的目标对象。 **DiffYOLO框架** DiffYOLO框架的主要思想是将YOLO模型和扩散概率模型结合起来，以增强训练有素的模型。该框架的工作流程如下： 1. 首先，使用扩散概率模型来学习图像中的噪声特征。 2. 然后，将学习到的噪声特征与YOLO模型结合，以增强训练有素的模型。 3. 最后，在高质量数据集上微调YOLO模型，并在低质量数据集上进行测试。 **实验结果** 实验结果表明，DiffYOLO框架可以在低质量数据集上取得很好的检测结果，并且在高质量测试数据集上的检测结果也得到了改善。该框架的提出为解决目标检测模型在低质量数据集上的应用问题提供了一种有效的解决方案。 **结论** 本文提出了一个DiffYOLO框架，该框架结合了YOLO模型和扩散概率模型，以增强训练有素的模型，使其能够在高质量数据集上微调，并在低质量数据集上进行测试。实验结果表明，该框架可以在低质量数据集上取得很好的检测结果，并且在高质量测试数据集上的检测结果也得到了改善。该框架的提出为解决目标检测模型在低质量数据集上的应用问题提供了一种有效的解决方案。

资源详情

资源推荐

DiffYOLO: Object Detection for Anti-Noise via YOLO

and Diffusion Models

Yichen Liu

liuyichen21@mails.ucas.ac.cn

Huajian Zhang

zhanghj@impcas.ac.cn

Daqing Gao

gaodq@impcas.ac.cn

Abstract

Object detection models represented by YOLO series have been widely used and

have achieved great results on the high quality datasets, but not all the working

conditions are ideal. To settle down the problem of locating targets on low quality

datasets, the existing methods either train a new object detection network, or need a

large collection of low-quality datasets to train. However, we propose a framework

in this paper and apply it on the YOLO models called DiffYOLO. Speciﬁcally, we

extract feature maps from the denoising diffusion probabilistic models to enhance

the well-trained models, which allows us ﬁne-tune YOLO on high-quality datasets

and test on low-quality datasets. The results proved this framework can not only

prove the performance on noisy datasets, but also prove the detection results on

high-quality test datasets. We will supplement more experiments later (with various

datasets and network architectures).

1 Introduction

YOLO has become prevailed in target detection tasks, from automatic driving to medical image

processing. Alice Froidevaux et al. used YOLO to detect vehicles through satellite images[

];

Sudipto Paul et al. applied YOLO to brain cancer recognition on MRI images[

]; Ethan Grooby et

al. explored automated facial landmark detection using YOLO[

]. Although YOLO has achieved

great success in object detection tasks, capturing objects from images with noises is still a great

challenge. Normally object detection models are trained on high quality images, but the test condition

may not be so ideal. Fig.1 shows on the test images with noise, a well-trained YOLO on high quality

datasets has poor detection results. If these models trained on high-quality data sets can perform well

on noise test sets with simple enhancements, then the trained models can be better utilized.

Transfer learning on pretrained models is an important method to make full use of pre-trained models.

It ﬁrst appeared in language models called ﬁne-tune[

], bringing many beneﬁts, such as making

training more efﬁciently and less dependent on high-quality training sets, therefore we hope to ﬁnd a

method to leverage other well-trained models to improve the performance of YOLO models.

Denoising diffusion probabilistic models(DDPM) was put forward by Sohl-Dickstein et al., has

shown great advantage in many generation tasks[

]. Othmane Laousy et al. demonstrated that the

diffusion method is not susceptible to perturbations [10], so we decided to incorporate the diffusion

model into the YOLO model.

Therefore, we propose a framework in this paper for improving the noise resistance of models already

trained on high-quality data sets, called DiffYOLO. We ﬁrst extract some features from the Unet

of the already trained Diffusion models, fuse them, and then splice them into the neck module of

YOLO. The feature extracted by such a diffusion model can improve the YOLO model to obtain

Preprint. Work in progress.

arXiv:2401.01659v1 [cs.CV] 3 Jan 2024

下载后可阅读完整内容，剩余6页未读，立即下载

人工智能_SYBH

粉丝: 4w+
资源: 222

DiffYOLO：基于扩散模型和YOLO的抗噪声目标检测框架

使用YOLO进行实时目标检测：项目实战.md

FastAPI封装YOLO目标检测模型

对比yolo5模型与其他常用的目标检测模型，例如Faster R-CNN、SSD等，分析yolo5模型的优缺点，说明为什么选择使用yolo5模型。

java yolo 目标检测

yolo单类别目标检测

OpenCV：YOLO目标检测 c++

yolo昆虫目标检测

yolo目标检测杨建华

基于yolo的目标检测

python实现yolo目标检测_用YOLO实现目标检测

自动驾驶目标检测yolo提点方法

红外图片使用YOLO进行检测

yolo算法目标检测的优点

yolo研究生目标检测选题

YOLO目标检测算法的好处

怎么用YOLO做一个障碍物检测详细一点

为什么常用SSD,YOLO,Faster R-CNN目标检测算法检测无人机而不用其他目标检测算法呢

yolo方法进行目标检测的意义

yolo旋转目标检测

基于yolo的人脸识别模型说明书

最新资源