YOLO-CIANNA：深度学习在射电天文星系检测中的应用——以SKAO SDC1为例

版权申诉

176 浏览量更新于2024-06-13 收藏 4.27MB PDF 举报

"YOLO-CIANNA是针对天文数据集的深度学习目标检测器，特别设计用于处理射电天文图像中的源检测。此方法基于流行的YOLO（You Only Look Once）框架，旨在应对平方公里阵列（SKA）等大型天文项目产生的海量数据。YOLO-CIANNA在SKAO SDC1数据集上的表现超越了所有已发表的结果，显著提高了源检测的准确性和效率。" YOLO-CIANNA是受到YOLO启发的深度学习算法，专门针对天文数据进行了优化。YOLO是一种实时目标检测系统，以其速度和准确性而著名。在YOLO-CIANNA中，研究人员针对射电天文图像的特性进行了定制，以解决该领域的独特挑战，例如噪声、复杂背景和源的形状变化。在射电天文学中，源检测是识别和测量天空中射电源的关键步骤。随着SKA这样的大型射电望远镜即将投入运营，其产生的数据量将对现有的数据分析方法提出巨大挑战。YOLO-CIANNA的目的是利用深度学习的强大力量，有效处理这些大数据集，并实现高效的目标检测。文章中提到，YOLO-CIANNA在SKAO SDC1数据集上的性能测试中表现出色，不仅提高了源检测的纯度，还检测到了比先前最佳结果多40%至60%的源。此外，即使在强制后处理以达到99%的纯度后，YOLO-CIANNA仍能检测到比其他高分方法多10%至30%的源，显示了其在保持检测质量的同时，具有高效率和鲁棒性。 YOLO-CIANNA的成功在于它能够实时处理图像，这对于处理SKA等设施产生的数据流至关重要。在单个GPU上，它可以每秒处理多个图像，这在处理天文数据的速度方面是一个显著的改进。 YOLO-CIANNA的开发标志着深度学习在天文数据分析中的一个重要里程碑，特别是在射电天文学领域。通过结合现代计算机视觉技术与天文科学的特定需求，该方法有望成为未来处理大规模天文数据的标准工具，极大地提升了源检测的效率和准确性。

Cornu et al.: YOLO-CIANNA: Galaxy detection with deep learning in radio data









 !"#!

$%

$&'#!(

$ !

$ $#!

$#!

$)*

$#

++

$&

$,-".#/

Go To

+

0

Target loop

Detection unit loop

Grid loop

Fig. 4: Summary of the target-prediction association algorithm of YOLO-CIANNA. All the steps are done independently by each

grid cell. All the elements are executed in order from top to bottom, with the left column representing the general case for the

association. All the reﬁnements corresponding to the B.2 block are described in Sect. A.6.

Article number, page 7 of 40

A&A proofs: manuscript no. yolo_sdc1_paper

Target boxes Raw pred. boxes Current best pred. Boxes to remove Final pred. boxes

(a) (b) (c) (d)

Fig. 5: Illustration of the NMS process in a given image. The dashed boxes are the target, and the solid boxes are the predictions.

The line widths of the boxes are scaled on their respective objectness score. The colors indicate the state of the box in the NMS

process at diﬀerent steps. Frame (a) represents the targets and the raw network predictions that remain after the objectness ﬁltering.

Frames (b) and (c) represent two successive steps of the NMS process with a diﬀerent best current box. Frame (d) represents the

remaining boxes after completion of the NMS.

2.8. Prediction ﬁltering and non-maximum suppression

We expect a properly trained detector to order its predictions by

quality based on the predicted objectness score for each detec-

tion unit. The raw detector output is always a static list of boxes

of size

, g

, N

×(6 + N

+ N

)

regardless of the input con-

tent. Consequently, the predicted boxes must be ﬁltered based on

their objectness score to remove those that are unlikely to repre-

sent an object. By design, the number of actually detectable ob-

jects in the image should be low compared to the total number of

detection units in the grid. Therefore, most of the predicted boxes

belong to the background type with a low objectness score.

While the continuous objectness score is the best direct rep-

resentation of the detector’s inner workings, it is incompatible

with some ﬁnal metric that needs a list of considered "good" de-

tections. Visualizing the predicted boxes also requires ﬁltering

to preserve only the plausible detection. In such cases, an object-

ness threshold can be used to remove low-conﬁdence detections.

This is usually done on a validation or test dataset not used for

training but for which the targets are known. The threshold can

be optimized to maximize a detection metric on this test dataset.

The main diﬃculty is that all detection units have ﬁtted

their objectness independently. Due to the fully convolutional

structure of the network, a given detection unit represents the

same objectness ﬁtting over the full grid, meaning that the same

threshold can be used. On the contrary, the predicted objectness

between two independent detection units is not comparable as

it depends on the type and frequency of targets associated with

each of them at training time. The classical solution is to ﬁt an

individual objectness threshold for each of them. However, it

considers that predictions from diﬀerent detection units are in-

dependent, which is not true for most applications. Still, ﬁtting

a per-detection-unit objectness threshold removes the vast ma-

jority of false positives. To achieve the best results, the object-

ness regimes must be homogenized between the diﬀerent detec-

tion units from the start by adjusting the individual λ

void

factors

(Sect. 2.5). While this is mainly an empirical and iterative pro-

cess, the main principle is to balance the ratio between detection

and background cases based on how the objects from the training

sample are expected to distribute over the detection units.

With most false positives removed, there can still be multi-

ple high-objectness predictions that represent the same object.

To preserve only the best-detected box for each object, we use a

classical post-processing step called non-maximum suppression

(NMS, Felzenszwalb et al. 2010; Girshick et al. 2013). It con-

sists of an iterative search for the box with the highest objectness

score in the image to remove all the overlapping predicted boxes.

To consider that there is an overlap, the two boxes must verify

fIoU > L

fIoU

NMS

with the fIoU being computed between the two

predicted boxes. All boxes with a lower objectness score than

the highest-scoring box are removed, and the best box is added

to a static prediction list. This process is repeated until no boxes

are left in the raw-prediction list. It is illustrated in Fig. 5.

The NMS is done regardless of what detection unit generated

the predicted boxes, demonstrating that they are not independent

as they can remove each other based on their respective object-

ness. This is one of the main reasons to force all detection units to

have similar objectness distributions. The detection quality can

only be evaluated after the NMS, so searching for the best λ

void

factors is dependent on the L

fIoU

NMS

and respectively.

3. Dataset description and network training

In this section, we present the main aspects of the SDC1 data

along with the expected products and the associated metrics. A

complete description of the SDC1 challenge and its data prod-

ucts can be found in Bonaldi et al. (2021), while the under-

lying T-RECS simulation is detailed in (Bonaldi et al. 2019).

We also present the pre-processing of the data to construct our

training sample. From this, we describe our best-performing

network architecture and specify the corresponding setup and

hyper-parameters for our YOLO-CIANNA detector.

3.1. Sub-challenge deﬁnition

The SDC1 is a source detection and characterization task in sim-

ulated SKA-like data products (Sect. 1) that comprises nine 4GB

images (three frequencies, with three integration times each) of

the same ﬁeld. The SDC1 is only modestly challenging regard-

ing data volume, especially compared to the SDC2 with its 1TB

data cube. Still, it represents signiﬁcant challenges for detec-

tion methods in many other aspects. All the images have the

same size of 32768 square pixels. As the frequency increases,

the angular resolution improves while the ﬁeld of view reduces.

Therefore, images at diﬀerent frequencies only partially over-

lap, meaning that the problem to solve varies with the position

in the ﬁeld. In addition, the number of detectable sources varies

signiﬁcantly with the integration time and frequency (Table 2 in

Article number, page 8 of 40

剩余39页未读，继续阅读

人工智能_SYBH

粉丝: 4w+
资源: 222

YOLO-CIANNA：深度学习在射电天文星系检测中的应用——以SKAO SDC1为例

基于深度学习的YOLO目标检测综述，具有研究学习价值

yolo网络 深度学习 源码

YOLO介绍（深度网络物体检测）

YOLO-CL：深度学习在SDSS中检测星系团

yolo-hand:using使用YOLO进行手部检测

yolo-tiny-v1-mobile：适用于Android和iOS的Yolo-用Kotlin和Swift编写的实时移动深度学习对象检测

YOLO-Nano:新版YOLO-Nano

YOLO-V5:使用对象检测模型YOLO-V5对图像进行定位和分类

YOLO-Tutorials:YOLO对象检测教程

yolo-unity:适用于Unity的YOLO游戏中对象检测（Windows）

最新资源

yolo网络深度学习源码