vanishing gradient problem

Vanishing Gradient Problem（梯度消失问题）是指在深度神经网络中，由于反向传播算法的特性，随着反向传播的深入，梯度会逐渐变得非常小，甚至趋近于零，导致神经网络无法继续学习或学习非常缓慢。这个问题通常出现在有很多层的神经网络中，而深度神经网络的优势往往正是在于其层数的增加，因此解决Vanishing Gradient Problem是深度学习中需要关注的重要问题之一。

Compared with homogeneous network-based methods, het- erogeneous network-based treatment is closer to reality, due to the different kinds of entities with various kinds of relations [22– 24]. In recent years, knowledge graph (KG) has been utilized for data integration and federation [11, 17]. It allows the knowledge graph embedding (KGE) model to excel in the link prediction tasks [18, 19]. For example, Dai et al. provided a method using Wasser- stein adversarial autoencoder-based KGE, which can solve the problem of vanishing gradient on the discrete representation and exploit autoencoder to generate high-quality negative samples [20]. The SumGNN model proposed by Yu et al. succeeds in inte- grating external information of KG by combining high-quality fea- tures and multi-channel knowledge of the sub-graph [21]. Lin et al. proposed KGNN to predict DDI only based on triple facts of KG [66]. Although these methods have used KG information, only focusing on the triple facts or simple data fusion can limit performance and inductive capability [69]. Su et al. successively proposed two DDIs prediction methods [55, 56]. The first one is an end-to-end model called KG2ECapsule based on the biomedical knowledge graph (BKG), which can generate high-quality negative samples and make predictions through feature recursively propagating. Another one learns both drug attributes and triple facts based on attention to extract global representation and obtains good performance. However, these methods also have limited ability or ignore the merging of information from multiple perspectives. Apart from the above, the single perspective has many limitations, such as the need to ensure the integrity of related descriptions, just as network-based methods cannot process new nodes [65]. So, the methods only based on network are not inductive, causing limited generalization [69]. However, it can be alleviated by fully using the intrinsic property of the drug seen as local information, such as chemical structure (CS) [40]. And a handful of existing frameworks can effectively integrate multi-information without losing induction [69]. Thus, there is a necessity for us to propose an effective model to fully learn and fuse the local and global infor- mation for improving performance of DDI identification through multiple information complementing.是什么意思

这段文字讨论了异质网络方法相对于同质网络方法更接近现实的原因，并介绍了知识图谱和知识图谱嵌入模型在链接预测任务中的应用。作者提出了一些方法来解决多信息融合和归纳能力限制等问题，包括使用药物的局部信息如化学结构，以及利用多种信息来提高药物相互作用识别的性能。这段文字主要是关于药物相互作用预测领域的研究。

阅读全文

vanishing gradient problem

相关推荐

深度学习基础及卷积神经网络应用详解

三维立体消隐技术-立方体旋转方法探究

LSTM理论详解：解决长短期记忆问题与实验验证

Power Tools for Training Deep Networks, Solving Gradient Vanishing, Enhancing Model Performance

阶段3愤怒的小鸟：Vanishing Pigs游戏解析

深度学习困境：梯度消失与爆炸现象详解——以Kaggle房价预测为例

utlog.sqlite

钢结构原理课程设计：露顶式平面钢闸门设计任务及指南

springboot-vue-数计学院学生综合素质评价系统的设计与实现-源码工程-29页从零开始全套图文详解-28页设计论文-21页答辩ppt-全套开发环境工具、文档模板、电子教程、视频教学资源分享

四相交错并联同步整流Buck变器 MATLAB仿真 低压大电流 输入：12VDC 输出：1V 100A 单相电流25A 关键参数设计，磁元件设计 理想仿真，实现均流输出

Python机器人-这是机器人算法的 Python 代码集合

mellitz_3dj_01_2411.zip

Shell编程范例完整版PDF最新版本

MMC-HVDC仿真模型，pscad柔性直流输电仿真mmc仿真模型，双端mmc模型，MMC为21电平NLM和均压控制，还有多端如张北直流电网以及基本mmc逆变器，自己为biye网上收集的一些觉得有用的

MATLAB金属表面缺陷分析（面板）.rar

python项目实战之旅游推荐数据分析可视化(源码+部署说明).zip

西门子1200与ABB机器人TCP 通信案例.zip

森林草原防灭火指挥系统解决方案（和对讲）PPT(44页).pptx

Java源码ssm框架社区生活超市进销存购物商城管理-毕业设计论文-期末大作业.rar

大家在看

Compax 3 调试步骤.pdf

Code-Generation-ARM-Compiler-V5.05update

Morpho3.2操作手册

seadas海洋遥感软件使用说明

Fundamentals of Wireless Communication-David Tse -课后习题答案

最新推荐

utlog.sqlite

钢结构原理课程设计：露顶式平面钢闸门设计任务及指南

springboot-vue-数计学院学生综合素质评价系统的设计与实现-源码工程-29页从零开始全套图文详解-28页设计论文-21页答辩ppt-全套开发环境工具、文档模板、电子教程、视频教学资源分享

四相交错并联同步整流Buck变器 MATLAB仿真 低压大电流 输入：12VDC 输出：1V 100A 单相电流25A 关键参数设计，磁元件设计 理想仿真，实现均流输出

Python机器人-这是机器人算法的 Python 代码集合

易语言例程：用易核心支持库打造功能丰富的IE浏览框

管理建模和仿真的文件

STM32F407ZG引脚功能深度剖析：掌握引脚分布与配置的秘密（全面解读）

给出文档中问题的答案代码

Docker构建与运行Next.js应用的指南

四相交错并联同步整流Buck变器 MATLAB仿真低压大电流输入：12VDC 输出：1V 100A 单相电流25A 关键参数设计，磁元件设计理想仿真，实现均流输出

四相交错并联同步整流Buck变器 MATLAB仿真低压大电流输入：12VDC 输出：1V 100A 单相电流25A 关键参数设计，磁元件设计理想仿真，实现均流输出