EFPN：针对小目标检测的扩展特征金字塔网络

特征融合

目标检测

需积分: 10 145 浏览量更新于2024-07-16 收藏 5.39MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

本文主要探讨了"小目标检测"这一领域的研究挑战，特别是在仅凭少量像素提取小目标信息时的困难。作者针对现有的特征金字塔网络（Feature Pyramid Network, FPN）进行了扩展，提出了一种新型的深度学习模型——增强型特征金字塔网络（Extended Feature Pyramid Network, EFPN）。EFPN特别关注于解决小目标检测中的性能问题，通过引入新的设计元素来优化处理能力。首先，EFPN的关键创新在于引入了一个额外的高分辨率金字塔层级，专为小目标检测而设计。这个设计允许模型在不同尺度层次上更精细地捕捉和解析小物体的细节，从而提高检测精度。这与传统FPN相比，更加注重特征的精确度而非简单的一维融合。其次，为了进一步提升性能，文中提出了一个名为“特征纹理转移”（Feature Texture Transfer, FTT）的模块。这个模块的作用是通过对特征进行超分辨率处理，同时提取出可信的区域细节，增强了对小目标特征的敏感性和区分度。超分辨率技术在此发挥了重要作用，帮助模型在小目标的边缘和形状上获得更清晰的表示。此外，针对小目标检测任务中前景和背景区域不平衡的问题，文章设计了一种新的损失函数，旨在平衡前景和背景的检测效果。这种平衡策略有助于减少误报和漏报，提高了整体检测的稳健性。在实验部分，作者证明了EFPN在计算效率和内存占用方面具有优势，同时在交通标志（Traffic Sign）等小目标数据集上取得了最先进的性能。这表明EFPN不仅提升了小目标检测的准确性，还兼顾了实际应用中的资源管理需求。总结来说，这篇论文深入探讨了小目标检测中面临的技术挑战，并通过EFPN和FTT模块的创新设计，有效地解决了小目标的识别和定位问题。通过结合高分辨率特征和平衡的损失函数，EFPN为当前的计算机视觉领域提供了有效的解决方案，为未来的小目标检测研究奠定了坚实的基础。

资源详情

资源推荐

4 Chunfang Deng, Mengmeng Wang, Liang Liu, and Yong Liu

pre-deﬁned anchor boxes. Recently, anchor-free frameworks [13,38,31,39] also be-

come increasingly popular. Despite of the development of deep object detectors,

small object detection remains an unsolved challenge. Dilated convolution [34]

is introduced in [23,17,16] to augment receptive ﬁelds for multi-scale detection.

However, general detectors tend to focus more on improving the performance

of easier large instances, since the metric of general object detection is average

precision of all scales. Detectors specialized for small objects still need more

exploration.

2.2 Cross-Scale Features

Utilizing cross-scale features is an eﬀective way to alleviate the problem arising

from object scale variation. Building image pyramids is a traditional approach to

generating cross-scale features. Use of features from diﬀerent layers of network

is another kind of cross-scale practice. SSD [24] and MS-CNN [4] detect objects

of diﬀerent scales on diﬀerent layers of CNN backbone. FPN [19] constructs

feature pyramids by merging features from lower layers and higher layers via

a top-down pathway. Following FPN, FPN variants explore more information

pathways in feature pyramids. PANet [22] adds an extra down-top pathway to

pass shallow localization information up. G-FRNet [1] introduces gate unit on

the pathway, which passes crucial information and block ambiguous information.

NAS-FPN [6] delves into optimal pathway conﬁguration using AutoML. Though

these FPN variants improve the performance of multi-scale object detection, they

continue to use the same number of layers as original FPN. But these layers are

not suitable for small object detection, which leads to still poor performance of

small objects.

2.3 Super-Resolution in Object Detection

Some studies introduce SR to object detection, since small object detection al-

ways beneﬁts from large scales. Image-level SR is adopted in some speciﬁc situa-

tions where extremely small objects exist, such as satellite images [15] and images

with crowded tiny faces [2]. But large-scale images are burdensome for subse-

quent networks. Instead of super-resolving the whole image, SOD-MTGAN [3]

only super-resolves the area of RoIs, but large quantities of RoIs still need con-

siderable computation. The other way of SR is to directly super-resolve features.

Li et al. [14] use Perceptual GAN to enhance features of small objects with the

characteristics of large objects. STDN [37] employs sub-pixel convolution on top

layers of DenseNet [12] to detect small objects and meanwhile reduce network

parameters. Noh et al. [25] super-resolve the whole feature map and introduce

supervision signal to training process. Nevertheless, above-mentioned feature SR

methods are all based on restricted information from a single feature map. Re-

cent reference-based SR methods [35,36] have capacity of enhancing SR images

with textures or contents from reference images. Enlightened by reference-based

SR, we design a novel module to super-resolves features under the reference of

剩余15页未读，继续阅读

Activewaste

粉丝: 1w+
资源: 1

EFPN：针对小目标检测的扩展特征金字塔网络

Feature Pyramid Networks for Object Detection.pdf

Pyramid Feature Attention Network for Saliency detection.ppt

推荐100个以上比较好的目标检测模型

feature pyramid networks for object detection

1612.03144.pdf

用中文说一下Pyramid Feature Attention Network for Saliency detection针对的问题和解决方法

yolox+BiFPN

yolov7+bifpn

帮我列举十五篇左右的近五年来欧美人关于基于深度学习的目标检测以及YOLOv3的参考文献

FPN特征金字塔网络文献引用

feature pyramid networks for o

计算机视觉——多尺度模型架构 参考文献

超高分辨率图像目标检测的相关参考文献

可以写改进的YOLO:AF-FPN替换金字塔模块提升目标检测精度方法的PYTHON代码吗

CE-FPN代码复现

润色并优化：SPP-Net（Spatial Pyramid Pooling Network）是一种用于图像分类的卷积神经网络架构，主要思想是在卷积神经网络中添加空间金字塔池化层，提高网络的感受野 ，从而适应不同大小的输入图像。

给我推荐20个比较流行的目标检测算法模型

推荐40个以上比较好的目标检测模型

推荐20个以上比较好的目标检测模型

最新资源

计算机视觉——多尺度模型架构参考文献

润色并优化：SPP-Net（Spatial Pyramid Pooling Network）是一种用于图像分类的卷积神经网络架构，主要思想是在卷积神经网络中添加空间金字塔池化层，提高网络的感受野，从而适应不同大小的输入图像。