信息融合：统计机器学习与深度学习的竞合分析

需积分: 9 113 浏览量更新于2024-09-09 收藏 683KB PDF 举报

“统计机器学习与深度学习在信息融合中的竞合关系探讨” 在信息融合领域，统计机器学习（SML）和深度学习（DL）是两种重要的技术手段，它们各自有着独特的优势，并在某些情况下相互竞争，而在其他情况下则可能协同工作。本文由Ling Guan、Lei Gao、Nour El Din Elmadany（来自加拿大多伦多的瑞尔森大学）以及Chengwu Liang（来自中国郑州的郑州大学）共同撰写，深入研究了这两种方法在信息融合中的应用。统计机器学习通过引入先验知识、熵度量、相关分析以及输入数据的内在统计结构，发展了许多创新的信息融合方法。这种方法强调对不同模态数据之间内在关系的智能挖掘，以提取更有效或区分性强的信息，特别适用于多媒体处理和生物识别等领域。另一方面，深度学习近年来取得了显著的进步，吸引了大量机器学习研究者的关注。深度神经网络（DNN）和卷积神经网络（CNN）等技术能够自动学习多层次的特征表示，尤其在图像识别、语音识别和自然语言处理等方面表现出强大的性能。深度学习在处理大量数据时的优势在于其能自动从原始输入中学习复杂模式，无需人为设计复杂的特征工程。在信息融合中，统计机器学习和深度学习可能存在竞争关系。例如，在有限数据集上，统计机器学习可能由于其对数据分布的理解和建模能力而表现更好；而深度学习在大数据场景下，凭借其强大的泛化能力和适应性可能更胜一筹。然而，两者的协作也日益受到重视。结合SML的理论基础和DL的模型表达能力，可以构建更加鲁棒和高效的信息融合系统。例如，将统计学习的先验知识融入深度网络的训练过程，或者利用深度学习的特征提取能力改进统计模型，都有可能提升整体系统的性能。总结来说，统计机器学习和深度学习在信息融合中既存在竞争，也有合作。理解并充分利用它们的互补性，对于推动信息融合技术的进步至关重要。未来的研究趋势可能集中在如何更好地集成这两种方法，以实现更高效、更智能的信息融合解决方案。

Statistical Machine Learning vs Deep Learning in Information Fusion:

Competition or Collaboration?

Ling Guan, Lei Gao, Nour El Din Elmadany

Ryerson University, Toronto, Canada

lguan@ryerson.ee.ca; iegaolei@gmail.com; nourelmadany@gmail.com

Chengwu Liang

Zhengzhou University, Zhengzhou, China

cliang@ee.ryerson.ca; or liangchengwu0615@126.com

Abstract

Information fusion is the process of coherently and intel-

ligently combining knowledge extracted from different sen-

sors/modalities, in order to obtain more useful or discrimi-

nant information for the purpose of multimedia processing

and biometrics, among others. The key to successful infor-

mation fusion is to intelligently exploit the intrinsic relation-

s between the data of different modalities. Statistical ma-

chine learning (SML) has played a major role in develop-

ing new information fusion methods, by incorporating prior

knowledge and entropy metric, correlation analysis, inher-

ent statistical structures of input data, and nonlinear rela-

tions. On the other hand, the recent development of deep

learning (DL) draws enormous attention from the machine

learning community. DL algorithms possess deep struc-

tures, requiring a large amount of data to train the huge

number of parameters, an ultra-expensive process. Howev-

er, the payoff is enormous; unprecedented success in many

applications. This paper will ﬁrst review recent develop-

ment of both SML and DL in the context of information fu-

sion, then analyze their pros and cons, and compare their

performance in a number of application domains. Based

on preliminary results, some thoughts will be presented on

how SML and DL can work together to bring the study in

machine learning to the next level, better serving human

needs.

1 Introduction

Information is obtained through different types of ac-

quisition techniques and multiple sources. The availabili-

ty of such multimodal information has been growing with

extremely fast pace[1]. Therefore, information fusion of

multiple sources is becoming an increasingly important re-

search topic for multimedia analysis, pattern recognition,

computer vision and biometrics[2].

In general, natural integration of multiple media, their

associated features, and the intermediate representation or

decisions are referred to as information fusion[3]. Com-

monly, there are three levels of information fusion: data

level, feature/representation level and decision level.

Among the three levels of information fusion, da-

ta/feature level fusion has drawn signiﬁcant attention from

the research communities of multimedia and biometrics due

to its capacity of information preservation and impressive

progress has been made[4]. Statistical machine learning

(SML) based approaches have stood out[5][6]. Among

them, Bayesian networks (BNs)[7], correlation based meth-

ods, and discriminative analysis have been the mainstreams,

since they are able to handle uncertainties by modeling the

dependencies, and describe the domain knowledge mathe-

matically in a graphical structure.

For the recognition tasks, correlation based methods,

e.g., Canonical Correlation analysis (CCA)[8] and its dis-

criminative version Discriminative CCA (DCCA)[9], and

Multi-set CCA (MCCA)[10] were proposed to identify the

correlation and discriminative information of different fea-

ture streams for visual recognition. After that, discrimina-

tive MCCA (DMCCA) is presented[11, 12] for performance

enhancement.

One of the prominent recent studies of SML is in the

area of biometric applications and video based human ac-

tion recognition [13][14]. With the release of cost-effective

sensors such as Kinect RGB-Depth camera, there has been

an increasing interest in developing new models and meth-

ods for recognizing actions with multimodal information,

such as sub-action segmentation and feature coding by the

discriminative locality-constrained afﬁne subspace coding

method[15].

On the other hand, deep learning (DL) based method-

s, such as Convolutional neural networks (CNN), Recur-

rent neural networks (RNN) and Long Short Term Memo-

ry networks (LSTM) have dramatically improved the state-

of-the-art in visual object recognition, object detection, ac-

tion recognition and other applications[16, 17, 18]. Among

these methods, CNN is one of the most notable approaches.

It has been found highly effective and is also the most com-

monly used in diverse computer vision applications. Chan

et al.[17] proposed a new deep learning architecture named

principal components analysis network (PCANet) whose

251

2018 IEEE Conference on Multimedia Information Processing and Retrieval

DOI 10.1109/MIPR.2018.00059

下载后可阅读完整内容，剩余5页未读，立即下载

sgphoto

粉丝: 0
资源: 3

信息融合：统计机器学习与深度学习的竞合分析

在ML，DL，AI下标记的中型文章数据集.zip

AI-ML-DL.eddx

ML-DL-implementation:仅使用NumPy和Matplotlib在python中从头开始实现ML和DL算法

ML-DL

ML_DL

ML_DL_study:MLDL研究领域

AI_ML_DL:AI ML Dl基础教程

Ineuron_Assignments：ML和DL

ml_academic：#ml #dl

Medium Articles tagged under ML/DL/AI 在 ML/DL/AI 下标记的中等文章-数据集

最新资源