一维卷积神经网络在信用评分数据集特征选择中的应用

下载需积分: 5 | PDF格式 | 3.52MB | 更新于2024-07-14 | 109 浏览量 | 举报

"Yoichi Hayashi和Naoki Takano在2020年发表的IEEE论文‘One-Dimensional Convolutional Neural Networks with Feature Selection for Highly Concise Rule Extraction from Credit Scoring Datasets with Heterogeneous Attributes’探讨了如何处理具有异质属性的信用评分数据集的分类问题。" 本文主要关注的是在金融和银行业广泛应用的、包含异质属性的信用评分数据集的分类问题。传统的卷积神经网络（CNN）虽然在许多领域表现出高效性，但并不适用于所有类型的数据集，尤其是那些拥有多种不同类型特征的数据。这类数据集的分类难度较大，现有的高精度分类器和规则提取方法往往无法达到足够高的分类准确率或生成简洁的分类规则。为了应对这一挑战，作者提出了一种新的方法，即采用一维（1D）全连接层先行的CNN结构，结合特征选择策略。这种方法的创新之处在于，通过1D CNN对数据进行初步处理，捕获局部特征，然后通过全连接层进行特征融合和信息提取。特征选择在此过程中起着关键作用，它有助于减少模型的复杂性，提高模型的可解释性，同时保持高分类准确性。文章可能详细讨论了以下几点： 1. **1D CNN的结构与工作原理**：阐述了1D CNN如何应用于一维序列数据，如时间序列或者具有线性结构的特征。 2. **特征选择的策略**：可能介绍了如何在CNN中实施特征选择，以降低维度，提高模型效率，并增强模型的透明度。 3. **实验设计与结果**：可能包含了在不同信用评分数据集上的实验，比较了新方法与其他传统方法（如决策树、随机森林等）在准确率、规则简洁性和运行时间等方面的表现。 4. **模型的解释性**：由于金融领域的监管要求，模型需要具备一定的解释性，作者可能会讨论如何通过该方法提取出易于理解的分类规则。 5. **应用与未来研究方向**：可能探讨了该方法在实际信用评估中的应用前景，以及未来可能的研究方向，如改进特征选择算法、优化CNN架构等。 Hayashi和Takano的这篇IEEE论文为解决异质属性数据集的分类问题提供了一个新颖且具有潜力的解决方案，强调了模型的透明度和简洁性，这对于金融行业的信用评估尤其重要。

Electronics 2020, 9, 1318 3 of 15

multi-layer perceptron (DIMLP) ensembles, which was a pioneering work on rule extraction for NN

ensembles. Setiono et al. [

] ﬁrst proposed a unique algorithm for concise rule extraction using the

concept of recursive-rule extraction. As a promising means to address the “black box” problem, a rule

extraction technology that is well-balanced between accuracy and interpretability was proposed for

shallow NNs [

]. Recently, Hayashi and Oisi [

] proposed a high-accuracy priority rule extraction

algorithm to enhance both the accuracy and interpretability of extracted rules; this is realized by

reconciling both of these criteria.

However, recently, a “new black box” problem caused by highly complex deep neural networks

(DNNs) generated by DL has arisen. To resolve this “new black box” problem, transparency and

interpretability are needed in DNNs. Symbolic rules were initially generated from deep belief

networks (DBNs) by Tran and Garcez d’Avila [

], who trained a DBN using the MNIST dataset.

The present author previously carried out a survey on the right direction needed to develop “white

box” deep learning for medical images [

] and also provided new uniﬁed insights on deep learning

for radiological and pathological images [26].

1.6. Recursive-Rule Extraction (Re-RX) and Related Algorithms

The Re-RX algorithm developed by Setiono et al. [

] repeats a backpropagation NN (BPNN),

NN pruning [

], and a C4.5 decision tree (DT) [

] in a recursive manner. A major advantage of

the Re-RX algorithm, which was designed as a rule extraction tool, is that it provides a hierarchical,

recursive consideration of discrete variables prior to the analysis of continuous data. Additionally,

it can generate classiﬁcation rules from NNs that have been trained based on discrete and continuous

attributes. We previously proposed Re-RX with J48graft [

] for improving the interpretability of

extracted rules, Continuous Re-RX [

] for improving the accuracy of rule extraction, and Continuous

Re-RX with J48graft [18] for high accuracy-priority rule extraction.

2. Motivation for This Work

Motivation for Research

Recently, DL has been applied in many ﬁelds because of its theoretical appeal and remarkable

performance in terms of predictive accuracy. Despite comparisons with standard data mining

algorithms that highlight the superiority of such tools, its application to credit scoring for datasets

with heterogeneous attributes remains limited. Thus, it has become increasingly important to interpret

“black boxes” in machine learning, particularly in regard to convolutional neural networks (CNNs),

because of their lack of transparency. However, previous rule extraction methods are inappropriate for

CNNs, largely because they cannot generate concise and interpretable rules [25].

Explanations are particularly relevant in the banking sector, so “black box” models are approached

with caution. Actually, banking managers are typically unwilling to use DL for credit scoring when

credit is denied to a customer.

As shown in Figure 1, the best trade-oﬀ is when accuracy and interpretability can be enhanced

simultaneously. The black line indicates the trade-oﬀ curve (Pareto optimal), which balances accuracy

and interpretability. The red arrow indicates a shift from the trade-oﬀ curve to the ideal point

(high-accuracy and high-interpretability; most concise). We previously proposed a method to achieve

high accuracy-priority rule extraction [

]. “Black box” classiﬁers can be plotted as black dots placed

vertically on the axis for the test dataset accuracy (TS ACC). These accuracies are often higher than

those obtained using high accuracy-priority rule extraction for credit scoring datasets, which indicates

that the latest high-performance classiﬁer for the Australian dataset does not completely overcome the

accuracy–interpretability dilemma [

]. In this section, as Re-RX with J48graft is the most important

component of our proposed method, we depict it using mathematical notations in Figure 2.

剩余14页未读，继续阅读

身份认证购VIP最低享 7 折!

30元优惠券

weixin_45363356

粉丝: 0

一维卷积神经网络在信用评分数据集特征选择中的应用

Econometrics(Fumio Hayashi)

2024年俄罗斯汽车软内饰材料市场机会及渠道调研报告Sample.pdf

Disteniinae Thomson (Insecta, Coleoptera)_ a protected name.pdf

基于PLC的植物工厂环境监控系统设计.pdf

论文研究-一类最优ZCZ序列集的构造 .pdf

基于改进的数量化理论和RBF神经网络组合方法的地下水水质预测.pdf

matlab集成c代码-bl-network-template-matlab:使用Matlab在大脑生活中进行网络分析的示例模板

unblocker:使受区域限制的网站可访问

2019高考英语一轮基础步练Unit16Stories含解析北师大版必修6

【大数据课设】p105出租车数据可视化分析-大数据-实训大作业.zip

最新资源