夜间实时行人检测与追踪：红外视觉系统

5星 · 超过95%的资源需积分: 50 101 浏览量更新于2024-07-29 1 收藏 2.08MB PDF 举报

"本文介绍了红外行人识别系统在夜间驾驶员辅助系统中的实时行人检测和跟踪技术。该系统采用近红外（NIR）相机，通过三个模块——感兴趣区域（ROI）生成、对象分类和跟踪——的级联集成，利用互补视觉特征来区分20-80米范围内的目标与杂乱背景。系统基于夜间NIR图像中目标比附近背景更亮的普遍事实，采用双阈值分割算法进行高效的ROI生成。由于行人类别内部有很大的变异性，因此提出了一个树状结构的两阶段检测器，通过对非重叠子集进行独立分类器训练来解决这一问题。" 文章详细探讨了红外行人识别技术，特别是在夜间驾驶环境中用于驾驶员辅助系统的重要应用。红外技术能够在光照条件不佳的情况下提供有效的视觉信息，这对于保障交通安全具有重大意义。描述中提到的系统由三个主要部分组成： 1. **ROI（感兴趣区域）生成**：这个模块的目标是快速定位可能包含行人区域的图像部分。通过双阈值分割算法，系统可以有效地识别出比背景亮的物体，这在红外图像中是行人常见的特征。 2. **对象分类**：考虑到行人图像的多样性，系统采用了树状结构的两阶段检测器。这种设计允许系统对不同的行人特征进行细分和学习，以适应行人姿态、大小和光照的变化，提高识别准确性。 3. **跟踪**：最后，系统通过跟踪模块来维持对检测到的行人的连续追踪，即使在短暂遮挡或快速运动的情况下也能保持目标的连贯性。此外，文章可能还涵盖了以下知识点： - **深度学习和特征提取**：可能涉及到使用深度神经网络（如卷积神经网络CNN）来提取和学习行人特征，这些特征有助于区分行人与其他可能的物体。 - **数据集和训练**：为了训练分类器，通常需要大量的标注红外行人图像数据集。这些数据集的构建和使用对于系统的性能至关重要。 - **计算效率**：由于目标是在实时环境下运行，系统设计必须考虑计算效率，以确保在不影响驾驶体验的同时实现快速响应。 - **误报和漏报率**：在行人检测中，平衡假阳性和假阴性是关键问题，系统可能有相应的策略来降低这两种错误。 - **实时性能**：系统如何优化处理流程以实现在高帧率下的稳定运行，如使用并行计算或硬件加速。红外行人识别技术结合了计算机视觉、机器学习和红外成像的原理，旨在提高夜间驾驶的安全性，通过实时的行人检测和跟踪，为驾驶员提供及时的警告和辅助。

286 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 10, NO. 2, JUNE 2009

large intraclass variability in the pedestrian class. To deal with

this problem, we propose a tree-structured two-stage detector

based on Haar-like and HOG features to distinguish the objects

from nonpedestrian candidates. Gentle AdaBoost is used to se-

lect the critical features and learn the classiﬁers from the train-

ing images. The classiﬁer based on Haar-like features is used

for rough classiﬁcation, focusing on rejecting the nonpedestrian

candidates and selecting the well-bounded candidates. As the

size of the pedestrians varies in a wide range, three HOG-based

classiﬁers are trained on three separate sets containing images

of different size ranges to give precise classiﬁcation. This way,

the classiﬁcation complexity is reduced, and it helps to improve

system performance.

Although the object classiﬁcation can achieve an FA rate as

low as 1%, there still exist some FAs ﬂashing due to the huge

number of candidates in real-time processing. To suppress the

spurious detections and ﬁll the detection gap between frames,

pedestrian tracking based on the Kalman ﬁlter and template

matching is adopted to ﬁlter and optimize the detection results.

The tracking algorithm relies on the Kalman ﬁlter to provide

a spatial estimation of the detected pedestrians, and the detec-

tion conﬁdence in each frame is accumulated to determine the

detection certainty over time. For data association, the nearest

overlapped neighbor following the combined distance criterion

is selected as the observation. If the nearest-neighbor method

fails, template matching based on the appearance is used to

search for the possible observation. The tracking process is

divided into two stages: pretracking and tracking. Newly de-

tected objects enter the pretracking stage. Only after passing the

multiframe validation in the pretracking stage do they start to

be tracked as pedestrians by the system and be shown as output

alarms.

IV. ROI G

ENERATION

The ROI generation module, which tries to get regions that

potentially contain pedestrians, can be regarded as a rough

classiﬁer operated on the entire original image. However, most

learning-based approaches are time consuming and, thus, un-

suitable, even though we adopt the most efﬁcient AdaBoost

classiﬁer based on simple Haar-like features. The hard real-

time constraint means that the rule-based methods are the only

choices.

Different rule-based methods can be applied to different

types of images. In the gray images captured at night by an

NIR or normal camera, the fact that pedestrians always appear

brighter than the surrounding background is usually utilized to

extract the ROIs through thresholding.

A. Image Segmentation

Thresholding is the common and simple way to divide a gray

image into foreground and background. Under uneven lighting

conditions, the popular solution is adaptive thresholding, where

different thresholds are used for different pixels or subregions

in the image [25].

Generally, the adaptive threshold for each pixel is individ-

ually calculated based on its local neighborhood [13], [21].

However, in cluttered scenes, the segmented object regions may

Fig. 3. Analysis of a typical pedestrian area. (a) Original image. (b) Topo-

graphic surface of (a). (c) and (d) The intensity values of the scan lines marked

with arrows.

connect with the bright background and split by the nonuniform

brightness of pedestrians. The false segmentation often makes

the classiﬁcation fail and decreases the DR.

To take advantage of the low computation of the thresholding

method while cutting down the faults in segmentation, we

propose an adaptive dual-threshold segmentation algorithm to

efﬁciently segment the foreground. Unlike Tian et al. [13],

who calculate the thresholds on a square neighborhood, we

locally determine the two thresholds in horizontal scan lines

and optimize the parameters by experiments.

If the pedestrians appear brighter than the surrounding back-

ground, the situation will keep the same from the view of the

horizontal scan lines, even when the pedestrians have nonuni-

form brightness. Fig. 3 presents an example of pedestrians

with dark upper body and bright lower body. The two scan

lines show that the pixels from the pedestrian area are brighter

than the nearby background pixel on both sides of the person.

Obviously, this condition is easier to be satisﬁed than that

of common adaptive thresholding algorithms based on local

regions, where the pixels that belong to objects must be brighter

than the background in a large square neighborhood.

Meanwhile, calculating the thresholds from the scan lines has

another advantage that the algorithm is inclined to segment the

vertical bright regions of proper width, which not only helps

to break the connection to the background but also can prevent

segmenting the bright region of large horizontal size. Fig. 4(b)

and (c) gives a comparison of the results from [13] and (1),

where the thresholds calculated from the square neighborhood

produce a large bright region on the road that does not exist in

the result of the thresholds calculated from the scan lines.

However, because the brightness of the background and that

of the pedestrians vary in a wide range, the 1-D signals from the

scan lines are always contaminated by noises, and employing

a single threshold for each pixel may easily cause a failure, as

shown in Fig. 4(c). Thus, two thresholds are adopted to suppress

剩余15页未读，继续阅读

ZrongH

粉丝: 30

夜间实时行人检测与追踪：红外视觉系统

红外行人识别：支持向量机与统计分类方法解析

热红外行人检测技术：现状与挑战

YOLOv7红外检测权重：车辆与行人识别

跨模态配对图像生成-用于RGB红外行人重识别算法的跨模态配对图像生成-附项目源码+流程教程-优质项目实战.zip

红外行人数据集.zip

YOLOv7红外检测新突破：车辆行人识别及数据集分享

红外行人数据集及源码发布

可见光红外行人重识别

可见光-红外行人重识别

基于GAN 的可见光-红外行人重识别

最新资源