随机多模型深度学习：提升数据分类的性能与鲁棒性

版权申诉

101 浏览量更新于2024-09-11 收藏 1.33MB PDF 举报

随着每年不断增长的复杂数据集数量，机器学习方法的需求也在日益增强，以提供更强大、准确的数据分类能力。近年来，深度学习方法在与传统机器学习算法的对比中表现出卓越性能。然而，如何选择合适的深度学习模型结构一直困扰着研究人员。本文提出了一种新的集成深度学习方法——随机多模型深度学习（Random Multimodel Deep Learning，简称RMDL）。 RMDL旨在解决深度学习模型结构选择的问题，通过同时提升模型的鲁棒性和准确性，通过一系列深度神经网络（Deep Neural Networks，DNN）、卷积神经网络（Convolutional Neural Networks，CNN）和循环神经网络（Recurrent Neural Networks，RNN）的随机组合。RMDL的基本思想是并行训练多个随机生成的模型，然后将它们的结果进行整合，以期获得单个模型无法达到的更好性能。在论文中，作者详细介绍了RMDL模型的设计，特别关注了图像分类、文本分类以及人脸识别等应用场景。实验部分使用了MNIST和CIFAR-10这两个经典的基准数据集来验证RMDL的性能。MNIST是一个手写数字识别的数据集，而CIFAR-10则包含了更加多样化的图像类别，如飞机、汽车等，这两者都是评估模型在视觉识别任务上的经典选择。通过对比RMDL与其他单一模型或传统集成方法（如bagging、boosting等），研究展示了RMDL在保持模型效率的同时，显著提高了分类准确性和鲁棒性。此外，文中可能还会探讨模型优化策略，如超参数调整、模型融合技巧以及对不同任务的适应性。 RMDL作为一种创新的深度学习解决方案，不仅有助于解决深度模型结构选择难题，还为实际应用中的数据分类任务提供了强有力的工具，尤其是在处理大量复杂数据时，其优势尤为明显。在未来的研究中，RMDL有可能成为深度学习领域的一个重要分支，推动更多高效且精确的机器学习模型的发展。

input feature space.

The final type of deep learning architectures that is utilized

in RMDL model is Recurrent Neural Networks (RNN) where

outputs from the neurons are fed back into the network as

inputs for the next step. Some recent extensions to this

architecture uses Gated Recurrent Units (GRUs) [5] or Long

Short-Term Memory (LSTM) units [24]. These new units

help control for instability problems in the original network

architecture. RNN have been successfully used for natural

language processing [25]. Recently, Z. Yang et al.

in 2016 [26] developed hierarchical attention networks for

document classification. These networks have two important

characteristics: hierarchical structure and an attention

mechanism at word and sentence level.

New work has combined these three basic models of the

deep learning structure and developed a novel technique for

enhancing accuracy and robustness. The work of M. Turan et

al. in 2017 [7] and M. Liang et al.

in 2015 [27] implemented

innovative combinations of CNN and RNN called A

Recurrent Convolutional Neural Network (RCNN). K.

Kowsari et al. in 2017 [1] introduced hierarchical deep

learning for text classification (HDLTex) which is a

combination of all deep learning techniques in a hierarchical

structure for document classification has improved accuracy

over traditional methods. The work in this paper builds on

these ideas, specifically the work of [1] to provide a more

general approach to supervised learning for classification.

III. BASELINES

In this paper, we use both contemporary and traditional

techniques of document and image classification as our

baselines. The baselines of image and text classification are

different due to feature extraction and structure of model;

thus, text and image classification’s baselines are described

separately as follows:

A. Text Classification Baselines

Text classification techniques which are used as our

baselines to evaluate our model are as follows: regular deep

models such as Recurrent Neural Networks (RNN),

Convolutional Neural Networks (CNN), and Deep Neural

Networks (DNN). Also, we have used two different

techniques of Support Vector Machine (SVM), naive bayes

classification (NBC), and finally Hierarchical Deep Learning

for Text Classification (HDLTex) [1].

1) Deep learning

The baseline, we used in this paper is Deep Learning

without Hierarchical level. One of our baselines for text

classification is [26]. In our methods’ Section V, we will

explain the basic models of deep learning such as DNN,

CNN, and RNN which are used as part of RMDL model.

2) Support Vector Machine (SVM)

The original version of SVM was introduced by Vapnik,

VN and Chervonenkis, A Ya [28] in 1963. The early 1990s,

nonlinear version was addressed in [29].

Multi-class SVM:

The basic SVM is used for binary classification, so for

multi class we need to generate Multimodel or MSVM.

One-Vs-One is a technique for multi-class SVM and needs to

build N(N-1) classifiers.



󰇛



󰇜











󰇛



󰇜



(1)

where 



is one classifier to distinguish of each pair of

classes i and j. In such representation, class i is positive

examples and class j refers to negative examples such that:









(2)

The natural way to solve k-class problem is to construct a

decision function of all k classes at once [30], [31]. In

general, multi-class SVM is an optimization problem of:



































   



󰇛







󰇜

(3)















  





󰇛  











󰇜

(4)

such that:









 





 





󰇛







󰇜󰇝󰇞



(5)

where

󰇛









󰇜

is training data point such that 󰇛







󰇜 D. C is

the penalty parameter, ζ is slack parameter, k stands for

classes, and w is learning parameters

Another technique of multi-class classification using SVM

is All-against-One. In SVM many methods for feature

extraction have been addressed [32], but we are using two

technique word sequences feature extracting [33], and Term

frequency-inverse document frequency (TF-IDF).

Stacking Support Vector Machine (SVM): We use

Stacking SVMs as another baseline method for comparison

with RMDL for datasets which has capability to use

hierarchical labels. The stacking SVM provides an ensemble

of individual SVM classifiers and generally produces more

accurate results than single-SVM models [34], [35].

3) Naive Bayes Classification (NBC)

This technique has been used in industry and academia for

a long time, and it is the most traditional method of text

categorization which is widely used in Information Retrieval

[36]. If the number of n documents, fit into k categories the

predicted class as output is c  C. Naive bayes is a simple

algorithm which uses bayes’ rule described as follows:

󰇛󰇜

󰇛󰇜󰇛󰇜



󰇛



󰇜

(6)

where d is document, and c indicates a class.









󰇛󰇜󰇛󰇜





󰇛











󰇜󰇛󰇜

(7)

The baseline of this paper is word level of NBC [37]. Let







be the parameter for word j, then

󰇛











󰇜

󰇛







󰇜󰇛













󰇜

󰇛







󰇜

(8)

剩余12页未读，继续阅读

Fun_He

粉丝: 19
资源: 104

随机多模型深度学习：提升数据分类的性能与鲁棒性

Deep_Learning_for_Encrypted_Traffic_Classification_An_Overview.pdf

Sanet.st_Deep_Learning,_Vol._2_From_Basics_to_Practice_-_Andrew_Glassner_带书签.pdf

[machine_learning_mastery系列]deep_learning_with_python.pdf(with code)

DeepLearningImageClassificationExample.rar_deep_deep learning_de

A_curated_list_of_deep_learning_image_classificati

federated_learning_for_image_classification.ipynb

Machine_Learning_and_Deep_Learning

Number_classification_with_MNIST_data._(基于MNIST数据集_MNIS

Context_Encoders_Feature_Learning_by_Inpainting.pdf

copardic_release_version_1.rar_dictionary learning_mexutils.h

最新资源