YOLO-Ant：轻量级天线干扰源检测器

版权申诉

198 浏览量更新于2024-06-13 收藏 13.55MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"YOLO-Ant是为天线干扰源检测设计的一种轻量级深度学习模型，结合了深度可分离卷积和大卷积核的DSLK-Block以及带有变压器结构的DSLKVit-Block，提高了小目标检测能力并应对复杂背景的挑战。该模型在天线干扰源数据集及公共数据集上表现出色。" YOLO-Ant是针对5G通信中天线干扰源检测问题而开发的一种创新性轻量级探测器。在5G通信时代，有效地识别和消除影响通信质量的干扰源至关重要，但目前专门针对这一任务的学习样本和检测模型较为匮乏。为了解决这一问题，研究人员构建了一个专门的天线干扰源数据集，这为后续的研究提供了基础。 YOLO-Ant模型融合了卷积神经网络（CNN）和变压器架构，旨在在保持模型轻量化的同时，提升目标检测性能。模型设计的核心在于两个独特模块：DSLK-Block和DSLKVit-Block。DSLK-Block利用深度可分离卷积（Depthwise Separable Convolution）降低计算复杂度，同时引入大卷积核以增强特征提取能力，这对于检测小型且难以辨识的天线干扰源尤其关键。深度可分离卷积通过将深度卷积分解为深度卷积和逐点卷积，显著减少了计算量，而大卷积核则有助于捕捉更大的上下文信息。另一方面，DSLKVit-Block结合了DSLK-Block与变压器结构，提升了模型处理复杂背景和大类间差异的能力。变压器在计算机视觉领域的应用通常能捕获长距离的依赖关系，这在处理具有广泛上下文关联的检测任务时非常有用。通过这种组合，YOLO-Ant能够更准确地定位和识别天线干扰源，即使在背景复杂的情况下也能保持高精度。实验证明，YOLO-Ant不仅在专为天线干扰源构建的数据集上表现优秀，而且在其他公共数据集上的结果也具有竞争力。这表明，YOLO-Ant不仅适用于专业领域，还具备通用性，能够在多种场景下实现高效、精确的检测。这项工作为天线干扰源检测提供了一个新的、高效的解决方案，并为未来的研究开辟了新的方向。

资源详情

资源推荐

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 14, NO. 8, AUGUST 2021 4

nature of transformer models and detection accuracy remains a

crucial research scope in the current ﬁeld of computer vision.

C. Combination of CNN and Transformer

In object detection, CNNs and transformers have distinct

applications and advantages. CNNs are known for their strong

image feature extraction abilities, ability to perform multi-

channel processing, and ability to learn spatial correlations.

However, CNN-based models have limitations in handling

objects of different sizes and proportions due to ﬁxed window

sizes and strides. On the other hand, transformers exhibit

excellent performance in capturing long-range dependencies

within input sequences without prior knowledge, albeit at a

slower speed and requiring substantial amounts of training

data. Evidently, the amalgamation of CNNs and transform-

ers offers complementarity across various dimensions, and

researchers have already delved into numerous methodologies

to explore this synergy.

The pioneering DETR model replaces fully connected and

convolutional layers with transformers while using ResNet

as the feature extractor, improving accuracy and efﬁciency.

Huawei’s CMTBlock combines depthwise separable convolu-

tion and the transformer’s multihead self-attention module for

local and global information fusion. The CMT model [44]

stacks the CMTBlock in a hybrid CNN-transformer structure.

The Conformer [45] adopts a dual-network structure, where

the CNN branch enhances local perception of the transformer

branch. The mobile-former [46] features parallel CNN and

transformer modules with bidirectional bridges, leveraging

MobileNet [47] for local processing and the transformer for

global interaction. However, networks or models employing

such hybrid structures face challenges in effectively balancing

accuracy and lightweight design. For instance, detectors such

as DETR, lacking FPN structures, exhibit suboptimal perfor-

mance in small object detection. While the CMT and Con-

former networks have proven effective in classiﬁcation tasks,

their application to downstream tasks such as object detection

deviates from the realm of lightweight design. In contrast

to the aforementioned models, which concatenate both struc-

tures, an alternative approach involves making transformer-

style improvements directly on the CNN network. ConvNeXt

[48] implements novel architectures and optimization strate-

gies similar to those of transformers, achieving competitive

results without attention structures. RepLKNet [49] employs

large convolutional kernels to widen the receptive ﬁeld, thus

emulating the transformer-like capability for global feature

extraction. By investigating the computational principles of

transformers, ACMix [50] maps their operation process onto

convolutional operators, thereby combining them with tra-

ditional convolution operations to construct a novel CNN

architecture. Parc-Net [51] introduces circular convolution

for global information extraction within a pure convolutional

structure. Although these innovative networks may not achieve

SOTA performance, their greater signiﬁcance lies in exploring

the factors contributing to the success of transformers from

a CNN perspective, providing inspiration for subsequent re-

search endeavors. The fusion of transformers and CNNs offers

a ﬂexible and diverse range of integration methods. Future

research should strive to deepen the understanding of their

interactions to improve design and optimization.

D. Object Detection of Antenna Interference Sources

Regularly monitoring and mitigating antenna interference

sources has become one of the most critical tasks in the

wireless communication ﬁeld. In the past, detecting antenna

interference sources mainly relied on traditional techniques

such as spectrum analysis, signal recognition and positioning.

However, these methods have many limitations. For example,

when detection personnel identify a radio interference signal

through a spectrum analyzer, they can determine only the

approximate direction of the interference source based on the

strength of the received signal and cannot accurately determine

its position.

The rapid advancement of deep learning and computer

vision has facilitated the successful application of object

detection-assisted tasks in various industries. Examples in-

clude defect detection in industrial settings, pest/weed de-

tection in agriculture, and vehicle and pedestrian detection

in transportation [52] [53] [54] [55] [56]. These solutions

provide effective ideas for our antenna interference source

detection task. When investigators conﬁrm the approximate

direction of the interference source antenna through a signal

receiver and spectrum analyzer, they can use drones with

cameras and related object detection algorithms to replace

manual accurate positioning work. Unfortunately, the ﬁeld of

antenna interference source detection based on object detection

tasks has largely not been explored. Due to the lack of

learning samples and models for related antenna interference

source detection, existing detection methods are not suitable

for antenna detection. Therefore, it is urgent and meaningful

to create a professional dataset and train a model suitable for

this detection task to address the difﬁculty of locating antenna

interference sources in the wireless communication ﬁeld.

III. PROPOSED DETECTION FRAMEWORK

A. Overall model structure

The overall idea for the network(Fig. 2) lies in the com-

bination of a CNN and transformer, both the inductive bias

ability of the convolutional operation and the ability of the

transformer to extract global information, while also meeting

the needs of a lightweight model with low computational

complexity. YOLO-Ant adopts DSLKNet, which is composed

of DSLK-Blocks, as the backbone for downsampling and

feature extraction in images. In DSLKNet, four DSLK-Layers

employ convolutional kernels of varying sizes to sequentially

extract rich features from different receptive ﬁelds of the

image. To address the challenge of detecting small objects, we

incorporate the neck structures of the FPN and PAN for multi-

scale feature learning. On the neck component, we conducted

pruning based on YOLOv5-s (detailed data provided in Section

IV. EXPERIMENT). In comparison to the baseline model, the

pruned neck model features an increased number of module

stacks and a reduced number of channels in each module.

This structural modiﬁcation effectively alleviates redundancy

剩余17页未读，继续阅读

人工智能_SYBH

粉丝: 4w+
资源: 220

会员权益专享

YOLO-Ant：轻量级天线干扰源检测器

YOLO介绍（深度网络物体检测）

基于改进YOLO轻量化网络的目标检测方法

yolo 深度可分离卷积

YOLOv5深度可分离卷积

轻量级深度张量网络在目标检测中的应用

yolov8加入深度可分离卷积

yolo的轻量化方法有哪些

YOLO-PermissionError: [Errno 13] Permission denied: .

yolo-pose 和 open-pose

yolo1-8算法优缺点

适合小目标检测的YOLO网络

Yolo-FastestV2的网络结构解析与改进之处分析

yolo-v5学习笔记

yolo-fastestv1-xl与yolov3

yolo-v4_tiny人工智能平台

yolov5与yolo-fastestv1-xl的区别

windows10部署yolo-fastest v2

yolo-nas代码

怎样对yolo系列网络进行轻量化

yolo-fasterv2

会员权益专享

最新资源