outputs.gather

outputs.gather是一个用于多GPU并行计算中的方法。在多GPU训练中，模型通常会在每个GPU上并行计算不同的输入数据，得到各自的输出。outputs.gather的作用就是将不同GPU上的输出进行收集和合并，以得到最终的输出结果。具体来说，outputs.gather接收一个outputs的列表作为输入，这个列表中的每个元素是一个tensor，代表一个GPU上的输出。函数会将这些tensor按照指定的方式进行合并，得到一个合并后的tensor作为输出。合并方式可以是按维度拼接（concatenate）或按维度堆叠（stack）。举个例子，假设我们有两个GPU，在每个GPU上进行了相同的计算，得到了两个不同的输出tensor：output1和output2。如果我们希望将这两个tensor按照第0维进行拼接，就可以使用outputs.gather([output1, output2], dim=0)这样的方法。这样，函数会将两个tensor在第0维上拼接，并返回一个合并后的tensor作为最终输出。 outputs.gather的使用可以使得多GPU计算变得更加方便和高效。通过合并不同GPU上的输出，我们可以得到整个模型在整个训练集上的输出结果，以进行下一步的操作，比如计算损失函数或进行精度评估。总结起来，outputs.gather是一个用于多GPU并行计算中的方法，用于将不同GPU上的输出进行收集和合并，得到最终的输出结果。使用这个方法，我们可以方便地将多个GPU上的计算结果整合在一起，以便进行后续的操作。

分析这个代码class OhemCrossEntropy(nn.Module): def init(self, ignore_label=-1, thres=0.7, min_kept=100000, weight=None): super(OhemCrossEntropy, self).init() self.thresh = thres self.min_kept = max(1, min_kept) self.ignore_label = ignore_label self.criterion = nn.CrossEntropyLoss( weight=weight, ignore_index=ignore_label, reduction='none' ) def _ce_forward(self, score, target): ph, pw = score.size(2), score.size(3) h, w = target.size(1), target.size(2) if ph != h or pw != w: score = F.interpolate(input=score, size=( h, w), mode='bilinear', align_corners=config.MODEL.ALIGN_CORNERS) loss = self.criterion(score, target) return loss def _ohem_forward(self, score, target, **kwargs): ph, pw = score.size(2), score.size(3) h, w = target.size(1), target.size(2) if ph != h or pw != w: score = F.interpolate(input=score, size=( h, w), mode='bilinear', align_corners=config.MODEL.ALIGN_CORNERS) pred = F.softmax(score, dim=1) pixel_losses = self.criterion(score, target).contiguous().view(-1) mask = target.contiguous().view(-1) != self.ignore_label tmp_target = target.clone() tmp_target[tmp_target == self.ignore_label] = 0 pred = pred.gather(1, tmp_target.unsqueeze(1)) pred, ind = pred.contiguous().view(-1,)[mask].contiguous().sort() min_value = pred[min(self.min_kept, pred.numel() - 1)] threshold = max(min_value, self.thresh) pixel_losses = pixel_losses[mask][ind] pixel_losses = pixel_losses[pred < threshold] return pixel_losses.mean() def forward(self, score, target): if config.MODEL.NUM_OUTPUTS == 1: score = [score] weights = config.LOSS.BALANCE_WEIGHTS assert len(weights) == len(score) functions = [self._ce_forward] * \ (len(weights) - 1) + [self._ohem_forward] return sum([ w * func(x, target) for (w, x, func) in zip(weights, score, functions) ])

这是一个实现了OHEM(Online Hard Example Mining)的交叉熵损失函数，用于解决深度学习中存在难样本或噪声样本导致训练效果不好的问题。其中，thresh表示像素的softmax预测概率阈值，小于该阈值的像素被认为是难样本；min_kept表示每张图中保留的难样本最小数量；ignore_label表示忽略的标签；weight表示权重。_ce_forward函数实现了普通的交叉熵损失的计算；_ohem_forward函数实现了OHEM的计算。在forward函数中，对于有多个输出的模型，采用了权重平衡的方式进行计算。

阅读全文

相关推荐

Linux平台Logstash 7.15.2版本发布

Zelio 3逻辑输入控制三输出功能详解

Arduino音序器项目开发的深入探讨

信息系统项目管理 inputs 和 outputs 综合指南

TensorFlow动态循环神经网络(tf.nn.dynamic_rnn)返回值解析

复古怀旧教室桌椅素材同学聚会毕业纪念册模板.pptx

数据结构与算法中的回文串：马拉车算法与中心扩展法的Python实现及其应用

前端分析-2023071100789

国货彩妆品牌Q1社媒营销投放分析报告.pdf

内螺旋覆盖算法（ISC）matlab代码

21电平MMC整流站、MMC逆变站、两端柔性互联的MATLAB仿真模型，4端柔性互联、MMC桥臂平均值模型、MMC聚合模型（四端21电平一分钟即能完成2s的工况仿真） 1-全部能正常运行，图四和图五为

永磁同步电机电流滞环控制Matlab simulink仿真模型，参数已设置好，可直接运行 属于PMSM转速电流双闭环矢量控制系统模型 电流内环采用电流滞环控制（pang-pang控制），转速外环为

基于java的项目申报管理系统设计与实现.docx

大家在看

MTK_Camera_HAL3架构.doc

plink的GWAS数据处理作业流程.docx

清华virtuoso简明教程

如何使用matlab中的ode45函数进行仿真，详细讲解

西安石油大学2019-2023 计算机考研808数据结构真题卷

最新推荐

微软内部资料-SQL性能优化3

复古怀旧教室桌椅素材同学聚会毕业纪念册模板.pptx

HTML挑战：30天技术学习之旅

【CodeBlocks精通指南】：一步到位安装wxWidgets库（新手必备）

andorid studio 配置ERROR: Cause: unable to find valid certification path to requested target

VC++实现文件顺序读写操作的技巧与实践

【大数据时代必备：Hadoop框架深度解析】：掌握核心组件，开启数据科学之旅

opencv的demo程序

NeuronTransportIGA: 使用IGA进行神经元材料传输模拟

【Linux多系统管理大揭秘】：专家级技巧助你轻松驾驭

永磁同步电机电流滞环控制Matlab simulink仿真模型，参数已设置好，可直接运行属于PMSM转速电流双闭环矢量控制系统模型电流内环采用电流滞环控制（pang-pang控制），转速外环为