没有合适的资源?快使用搜索试试~ 我知道了~
首页基于Object-C用于目标跟踪的全卷积暹罗网络
本文主要探讨了一种创新的基于Object-C的目标跟踪方法,即全卷积暹罗网络(Fully-Convolutional Siamese Networks, FC-Siamese)。传统的对象跟踪问题通常依赖于在线学习,只使用视频本身作为训练数据,这限制了模型的复杂性和适应性。作者们注意到,虽然深度卷积网络具有强大的表达能力,但当需要追踪的目标未知时,为了实时调整网络权重,采用随机梯度下降会导致系统速度大幅下降。 FC-Siamese网络作为一种新颖的解决方案,旨在克服这一局限。它是一种端到端训练的架构,借鉴了Siamese网络的思想,即两个相同的神经网络同时处理输入的两帧图像,从而捕捉目标的相似性或变化。这种设计允许网络在整个视频序列中学习,无需在每次新帧出现时都需要重新训练,大大提高了效率。 在ILSVRC15数据集上进行训练,FC-Siamese网络特别用于视频中的对象检测任务,它不仅提供了高效的实时性能,而且即使在极其简单的设置下,也能在多个基准测试中展现出先进的跟踪效果。这种技术的优势在于它能够处理复杂的场景和动态变化,而无需预先知道目标,这对于实时应用场景如自动驾驶、视频监控和运动分析等具有显著的价值。 本文的贡献在于提出了一种结合了深度学习和卷积神经网络的高效目标跟踪策略,它通过全卷积架构实现了在线学习的灵活性和速度的提升,使得目标跟踪在实际应用中更加可靠和实时。这种技术的发展对于推动计算机视觉领域,特别是目标跟踪技术的发展具有重要意义。
资源详情
资源推荐
![](https://csdnimg.cn/release/download_crawler_static/88981526/bg4.jpg)
4 L. Bertinetto, J. Valmadre, J. F. Henriques, A. Vedaldi, P. H. S. Torr
where b
1
denotes a signal which takes value b ∈ R in every location. The output
of this network is not a single score but rather a score map defined on a finite
grid D ⊂ Z
2
as illustrated in Figure 1. Note that the output of the embedding
function is a feature map with spatial support as opposed to a plain vector. The
same technique has been applied in contemporary work on stereo matching [23].
During tracking, we use a search image centred at the previous position of
the target. The position of the maximum score relative to the centre of the
score map, multiplied by the stride of the network, gives the displacement of the
target from frame to frame. Multiple scales are searched in a single forward-pass
by assembling a mini-batch of scaled images.
Combining feature maps using cross-correlation and evaluating the network
once on the larger search image is mathematically equivalent to combining fea-
ture maps using the inner product and evaluating the network on each translated
sub-window independently. However, the cross-correlation layer provides an in-
credibly simple method to implement this operation efficiently within the frame-
work of existing conv-net libraries. While this is clearly useful during testing, it
can also be exploited during training.
2.2 Training with large search images
We employ a discriminative approach, training the network on positive and
negative pairs and adopting the logistic loss
`(y, v) = log(1 + exp(−yv)) (3)
where v is the real-valued score of a single exemplar-candidate pair and y ∈
{+1, −1} is its ground-truth label. We exploit the fully-convolutional nature of
our network during training by using pairs that comprise an exemplar image and
a larger search image. This will produce a map of scores v : D → R, effectively
generating many examples per pair. We define the loss of a score map to be the
mean of the individual losses
L(y, v) =
1
|D|
X
u∈D
`(y[u], v[u]) , (4)
requiring a true label y[u] ∈ {+1, −1} for each position u ∈ D in the score map.
The parameters of the conv-net θ are obtained by applying Stochastic Gradient
Descent (SGD) to the problem
arg min
θ
E
(z,x,y)
L(y, f(z, x; θ)) . (5)
Pairs are obtained from a dataset of annotated videos by extracting exemplar
and search images that are centred on the target, as shown in Figure 2. The
images are extracted from two frames of a video that both contain the object
and are at most T frames apart. The class of the object is ignored during training.
The scale of the object within each image is normalized without corrupting the
剩余15页未读,继续阅读
![rar](https://img-home.csdnimg.cn/images/20210720083606.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![](https://profile-avatar.csdnimg.cn/de94cf74eb0e4fd0b6c74b36e6ee2ebd_sinat_19383265.jpg!1)
初心不忘产学研
- 粉丝: 5390
- 资源: 180
上传资源 快速赚钱
我的内容管理 收起
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助
![](https://csdnimg.cn/release/wenkucmsfe/public/img/voice.245cc511.png)
会员权益专享
最新资源
- BSC关键绩效财务与客户指标详解
- 绘制企业战略地图:从财务到客户价值的六步法
- BSC关键绩效指标详解:财务与运营效率评估
- 手持移动数据终端:常见问题与WIFI设置指南
- 平衡计分卡(BSC):绩效管理与战略实施工具
- ESP8266智能家居控制系统设计与实现
- ESP8266在智能家居中的应用——网络家电控制系统
- BSC:平衡计分卡在绩效管理与信息技术中的应用
- 手持移动数据终端:常见问题与解决办法
- BSC模板:四大领域关键绩效指标详解(财务、客户、运营与成长)
- BSC:从绩效考核到计算机网络的关键概念
- BSC模板:四大维度关键绩效指标详解与预算达成分析
- 平衡计分卡(BSC):绩效考核与战略实施工具
- K-means聚类算法详解及其优缺点
- 平衡计分卡(BSC):从绩效考核到战略实施
- BSC:平衡计分卡与计算机网络中的应用
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
![](https://img-home.csdnimg.cn/images/20220527035711.png)
![](https://img-home.csdnimg.cn/images/20220527035711.png)
![](https://img-home.csdnimg.cn/images/20220527035111.png)
安全验证
文档复制为VIP权益,开通VIP直接复制
![](https://csdnimg.cn/release/wenkucmsfe/public/img/green-success.6a4acb44.png)