没有合适的资源?快使用搜索试试~ 我知道了~
首页经济学中的数据科学:机器学习与深度学习方法
资源详情
资源评论
资源推荐
mathematics
Review
Data Science in Economics: Comprehensive Review
of Advanced Machine Learning and Deep
Learning Methods
Saeed Nosratabadi
1
, Amirhosein Mosavi
2,3,
* , Puhong Duan
4
, Pedram Ghamisi
5
,
Ferdinand Filip
6
, Shahab S. Band
7,8,
* , Uwe Reuter
9
, Joao Gama
10
and Amir H. Gandomi
11
1
Doctoral School of Management and Business Administration, Szent Istvan University,
2100 Godollo, Hungary; saeed.nosratabadi@phd.uni-szie.hu
2
Environmental Quality, Atmospheric Science and Climate Change Research Group,
Ton Duc Thang University, Ho Chi Minh, Vietnam
3
Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh, Vietnam
4
College of Electrical and Information Engineering, Hunan University, Changsha 410082, China;
puhong_duan@hnu.edu.cn
5
Helmholtz-Zentrum Dresden-Rossendorf, Helmholtz Institute Freiberg for Resource Technology,
D-09599 Freiberg, Germany; p.ghamisi@hzdr.de
6
Department of Mathematics, J. Selye University, 94501 Komarno, Slovakia; filipf@ujs.sk
7
Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
8
Future Technology Research Center, College of Future, National Yunlin University of Science and
Technology 123 University Road, Section 3, Douliou, Yunlin 64002, Taiwan
9
Faculty of Civil Engineering, Technische Universität Dresden, 01069 Dresden, Germany;
uwe.reuter@tu-dresden.de
10
Faculty Laboratory of Artificial Intelligence and Decision Support (LIAAD)-INESC TEC,
Campus da FEUP, Rua Roberto Frias, 4200-465 Porto, Portugal; jgama@fep.up.pt
11
Faculty of Engineering and Information Technology, University of Technology Sydney,
Sydney, NSW 2007, Australia; gandomi@uts.edu.au
* Correspondence: amirhosein.mosavi@tdtu.edu.vn (A.M.);
shamshirbandshahaboddin@duytan.edu.vn (S.S.B.)
Received: 10 September 2020; Accepted: 15 October 2020; Published: 16 October 2020
Abstract:
This paper provides a comprehensive state-of-the-art investigation of the recent advances
in data science in emerging economic applications. The analysis is performed on the novel data
science methods in four individual classes of deep learning models, hybrid deep learning models,
hybrid machine learning, and ensemble models. Application domains include a broad and diverse
range of economics research from the stock market, marketing, and e-commerce to corporate banking
and cryptocurrency. Prisma method, a systematic literature review methodology, is used to ensure the
quality of the survey. The findings reveal that the trends follow the advancement of hybrid models,
which outperform other learning algorithms. It is further expected that the trends will converge
toward the evolution of sophisticated hybrid deep learning models.
Keywords:
data science; deep learning; economic model; ensemble; economics; cryptocurrency;
machine learning; deep reinforcement learning; big data; bitcoin; time series; network science;
prediction; survey; artificial intelligence; literature review
1. Introduction
Due to the rapid advancement of databases and information technologies, and the remarkable
progress in data analysis methods, the use of data science (DS) in various disciplines, including
Mathematics 2020, 8, 1799; doi:10.3390/math8101799 www.mdpi.com/journal/mathematics
Mathematics 2020, 8, 1799 2 of 25
economics, has been increasing exponentially [
1
]. Advancements in data science technologies for
economics applications have been progressive with promising results [
2
,
3
]. Several studies suggest that
data science applications in economics can be categorized and studied in various popular technologies,
such as deep learning, hybrid learning models, and ensemble algorithms [
4
]. Machine learning
(ML) algorithms provide the ability to learn from data and deliver in-depth insight into problems [5].
Researchers use machine learning models to solve various problems associated with economics. Notable
applications of data science in economics are presented in Table 1. Deep learning (DL), as an emerging
field of machine learning, is currently applied in many aspects of today’s society, from self-driving cars to
image recognition, hazard prediction, health informatics, and bioinformatics [
5
,
6
]. Several comparative
studies have evaluated the performance of DL models with standard ML models, e.g., support vector
machine (SVM), K-nearest neighbors (KNN), and generalized regression neural networks (GRNN) in
economic applications. The evolution of DS methods has progressed at a fast pace, and every day,
many new sectors and disciplines are added to the number of users and beneficiaries of DS algorithms.
On the other hand, hybrid machine learning models consist of two or more single algorithms and are
used to increase the accuracy of the other models [
7
]. Hybrid models can be formed by combining
two predictive machine learning algorithms or a machine learning algorithm and an optimization
method to maximize the prediction function [
8
]. It has been demonstrated that the hybrid machine
learning models outperform the single algorithms, and such an approach has improved the prediction
accuracy [
9
]. Ensemble machine learning algorithms are one of the supervised learning algorithms that
use multiple learning algorithms to improve learning processes and increase predictive accuracy [
10
].
Ensemble models apply different training algorithms to enhance training and learning from data [
11
].
Table 1. Examples of notable classic machine learning methods applied in economics-related fields.
Sources Machine Learning Models Objectives
Lee et al. [12] Support Vector Regression (SVR) Anomaly Detection
Husejinovi´c [13]
Naive Bayesian And C4.5 Decision Tree
Classifiers
Credit Card Fraud Detection
Zhang [14] Improved BP Neural Network
Aquatic Product Export
Volume Prediction
Sundar and Satyanarayana [15]
Multilayer Feed Forward Neural Network
Stock Price Prediction
Hew et al. [16] Artificial Neural Network (ANN) Mobile Social Commerce
Abdillah and Suharjito [17]
Adaptive Neuro-Fuzzy Inference
System (ANFIS)
E-Banking Failure
Sabaityt
˙
e et al. [18] Decision Tree (DT) Customer Behavior
Zatevakhina, Dedyukhina, and
Klioutchnikov [19]
Deep Neural Network (ANN) Recommender Systems
Benlahbib and Nfaoui [20]
Naïve Bayes and Linear Support Vector
Machine (LSVM)
Sentiment Analysis
Various works exist on the state-of-the-art of DS methods in different disciplines, such as image
recognition [
21
], animal behavior [
22
], renewable energy forecasting [
23
]. Hybrid methods have also
been investigated in various fields, including financial time series [
24
], solar radiation forecasting [
25
],
and FOREX rate prediction [
26
], while ensemble methods have been mostly in the fields, ranging from
breast cancer [
27
], image categorization [
28
], electric vehicle user behavior prediction [
29
], and solar
power generation forecasting [
30
]. Exploring the scientific databases such as Thomson Reuters
Web-of-Science (WoS) shows an exponential rise in using both DL and ML in economics. The results of
an inquiry of essential ML and DL in the emerging applications to economics over the past decade is
illustrated in Figure 1. Even though many researchers have applied DS methods to address different
problems in the field of economics, these studies are scattered. At the same time, no single study
provides a comprehensive overview of the contributions of DS in economic-related fields. Therefore,
the current study is conducted to bridge this literature gap. In other words, the main objective of this
study is to investigate the advancement of DS in three parts: deep learning methods, hybrid deep
learning methods, and ensemble machine learning techniques in economics-related fields. The present
Mathematics 2020, 8, 1799 3 of 25
work aims to answer the following research questions. (1) what are the emerging economics domains
with the involvement of data science technologies? (2) what are the popular data science models and
applications in these domains?
Mathematics 2020, 8, x FOR PEER REVIEW 3 of 27
objective of this study is to investigate the advancement of DS in three parts: deep learning methods,
hybrid deep learning methods, and ensemble machine learning techniques in economics-related
fields. The present work aims to answer the following research questions. (1) what are the emerging
economics domains with the involvement of data science technologies? (2) what are the popular data
science models and applications in these domains?
Figure 1. Rapid rise in the applications of data science in economics.
The rest of the paper is organized as follows. The method by which the database of this article
was formed is described in Section 2. Section 3 presents the findings and discussion, including articles
categorized by sector and area of study and models. A taxonomy of the application of data science in
economics is presented as well. Section 4 discusses the analytical results and the conclusions.
2. Materials and Methods
The current study utilized Prisma, a systematic literature review approach, to find the most
published articles that have applied data science methods for addressing an issue in a field related to
economics. Systematic literature review based on the Prisma method includes four steps: (1)
identification, (2) screening, (3) eligibility, and (4) inclusion [31]. In the identification stage, the
documents are identified through an initial search among the mentioned databases. In this study, the
review was limited to the original peer-review research articles published in Thomson Reuters Web-
of-Science (WoS) and Elsevier Scopus. Today, the Scopus database includes almost entire
authenticated scientific repositories, including WoS. Therefore, to avoid overlapping of the inquiries,
the Scopus has been used as the principal database for search, and the results had been verifies using
the WoS repository. This review was limited to articles written in English. This step resulted in 217
articles.
Our search included essential machine learning and deep learning, which were used as the
search keywords. The search for articles’ sources included both economics and computer science
journals. The screening step includes two stages. First, duplicate items were eliminated, resulting in
135 unique articles that were moved to the next stage, where the relevance of the articles was
examined based on their titles and abstracts. The second stage resulted in 84 articles for further
consideration. The third step of the Prisma model is eligibility, in which the full text of articles was
read by the authors, among which 57 were considered eligible for final review. In fact, at this stage,
after reading the abstract and the full text of the articles, the articles that did not develop a model for
one of the fields related to economics using the machine learning method were removed. The Prisma
model's last step is the creation of the database and is used for qualitative and quantitative analysis.
Figure 1. Rapid rise in the applications of data science in economics.
The rest of the paper is organized as follows. The method by which the database of this article
was formed is described in Section 2. Section 3 presents the findings and discussion, including articles
categorized by sector and area of study and models. A taxonomy of the application of data science in
economics is presented as well. Section 4 discusses the analytical results and the conclusions.
2. Materials and Methods
The current study utilized Prisma, a systematic literature review approach, to find the most published
articles that have applied data science methods for addressing an issue in a field related to economics.
Systematic literature review based on the Prisma method includes four steps: (1) identification,
(2) screening, (3) eligibility, and (4) inclusion [
31
]. In the identification stage, the documents are
identified through an initial search among the mentioned databases. In this study, the review was
limited to the original peer-review research articles published in Thomson Reuters Web-of-Science
(WoS) and Elsevier Scopus. Today, the Scopus database includes almost entire authenticated scientific
repositories, including WoS. Therefore, to avoid overlapping of the inquiries, the Scopus has been
used as the principal database for search, and the results had been verifies using the WoS repository.
This review was limited to articles written in English. This step resulted in 217 articles.
Our search included essential machine learning and deep learning, which were used as the search
keywords. The search for articles’ sources included both economics and computer science journals.
The screening step includes two stages. First, duplicate items were eliminated, resulting in 135 unique
articles that were moved to the next stage, where the relevance of the articles was examined based on
their titles and abstracts. The second stage resulted in 84 articles for further consideration. The third
step of the Prisma model is eligibility, in which the full text of articles was read by the authors, among
which 57 were considered eligible for final review. In fact, at this stage, after reading the abstract
and the full text of the articles, the articles that did not develop a model for one of the fields related
to economics using the machine learning method were removed. The Prisma model’s last step is
the creation of the database and is used for qualitative and quantitative analysis. The current study
database comprises 57 articles, which were all analyzed in this study. Figure 2 illustrates the steps of
creating the database of the present study based on the Prisma method.
Mathematics 2020, 8, 1799 4 of 25
Mathematics 2020, 8, x FOR PEER REVIEW 4 of 27
The current study database comprises 57 articles, which were all analyzed in this study. Figure 2
illustrates the steps of creating the database of the present study based on the Prisma method.
217 records identified through
database searching
10 records identified through
cross-referencing
135 records after duplications
removed
135 screened for relevance
84 full-text articles assessed for
eligibility
57 articles included in
qualitative analysis
51 records excluded after
reading title and abstract
27 records excluded
Identification
Inclusion
Eligibility
Screening
Figure 2. Diagram of the systematic selection, evaluation, and quality control of the database using
the Prisma model.
3. Findings and Discussion
Figure 2 shows that this study's database consists of 57 articles that were analyzed and
categorized according to two criteria: (1) research/application area, and (2) the method type. Based
on the review of articles by application, it was found that these articles were designed to address the
issues of five different applications, namely the Stock Market (37 articles), Marketing (6 articles), E-
commerce (8 articles), Corporate Bankruptcy (3 articles), and Cryptocurrency (3 articles) (Tables 2–
7). In addition, the articles were analyzed by the type of method, revealing that 42 unique algorithms
were employed among the 57 reviewed articles (see Figure 3). It was further found that 9 articles used
9 single DL models (Table 8), 18 hybrid deep learning (HDL) models (Table 9), 7 hybrid machine
learning models (Table 10), and 8 ensemble models (Table 11). In the following section, the identified
applications and each of these methods are described in detail.
Figure 2.
Diagram of the systematic selection, evaluation, and quality control of the database using the
Prisma model.
3. Findings and Discussion
Figure 2 shows that this study’s database consists of 57 articles that were analyzed and categorized
according to two criteria: (1) research/application area, and (2) the method type. Based on the review
of articles by application, it was found that these articles were designed to address the issues of
five different applications, namely the Stock Market (37 articles), Marketing (6 articles), E-commerce
(8 articles), Corporate Bankruptcy (3 articles), and Cryptocurrency (3 articles) (Tables 2–7). In addition,
the articles were analyzed by the type of method, revealing that 42 unique algorithms were employed
among the 57 reviewed articles (see Figure 3). It was further found that 9 articles used 9 single DL
models (Table 8), 18 hybrid deep learning (HDL) models (Table 9), 7 hybrid machine learning models
(Table 10), and 8 ensemble models (Table 11). In the following section, the identified applications and
each of these methods are described in detail.
Mathematics 2020, 8, 1799 5 of 25
Mathematics 2020, 8, x FOR PEER REVIEW 5 of 27
Figure 3. Notable methods of deep learning and hybrid deep learning models applied in economics-
related fields; the size of the rectangle is proportional to the number of publications (source: WoS).
3.1. Applications of Data Science in Economics
3.1.1. Stock Market
Applying deep learning in the stock market has become more common than in other economics
areas, considering that most of the research articles reviewed in the present study are classified in
this category (37 out of 57). Table 2 summarizes the articles that employed predictive models in stock
market studies, including research objectives, data sources, and applied models of each article.
Investment in the stock market is profitable, while the higher the profit, the higher the risk. Therefore,
investors always try to determine and estimate the stock value before any action. The stock value is
often influenced by uncontrollable economical and political factors that make it notoriously difficult
to identify the future stock market trends. Not only is the nature of the stock market so volatile and
complex, but the financial time series data are also noisy and nonstationary. Thus, the traditional
forecasting models are not reliable enough to predict the stock value. Researchers are continuously
seeking new methodologies based on DS algorithms to enhance the accuracy of such predictions.
Forecasting stock price was found to be the objective of 29 out of 37 articles. Other studies aimed at
applying DS in sentiment analysis, or the analysis of the context of texts to extracts subjective
information, to identify future trends in the stock market. In addition, portfolio management,
algorithmic trading (i.e., using a pre-programmed automated system for trading), automated stock
trading, socially responsible investment portfolios, the S&P 500 index trend prediction, and
exchange-trade-fund (EFT) options prices prediction were the objectives of other articles that
projected to employ DS methods. Financial time series served as the data source of all these studies,
except for the studies aimed at sentiment analysis, which used different data sources, such as social
media and financial news.
LSTM
Long short-term memory (LSTM) networks are a special kind of recurrent neural network
(RNN) that can overcome the main issue of RNN, i.e., vanishing gradients using the gates to retain
relevant information and discard unrelated details selectively. The structure of an LSTM neural
network is shown in Figure 4, which is composed of a memory unit 𝐶
, a hidden state ℎ
and three
types of gates, where 𝑡 indexes the time step. Specifically, for each step 𝑡, LSTM receives an input
𝑥
and the previous hidden state ℎ
then calculates the activation of the gates. Finally, the memory
unit and the hidden state are updated. The computations involved are described below:
Figure 3.
Notable methods of deep learning and hybrid deep learning models applied in economics-related
fields; the size of the rectangle is proportional to the number of publications (source: WoS).
3.1. Applications of Data Science in Economics
3.1.1. Stock Market
Applying deep learning in the stock market has become more common than in other economics
areas, considering that most of the research articles reviewed in the present study are classified in this
category (37 out of 57). Table 2 summarizes the articles that employed predictive models in stock market
studies, including research objectives, data sources, and applied models of each article. Investment in
the stock market is profitable, while the higher the profit, the higher the risk. Therefore, investors always
try to determine and estimate the stock value before any action. The stock value is often influenced by
uncontrollable economical and political factors that make it notoriously difficult to identify the future
stock market trends. Not only is the nature of the stock market so volatile and complex, but the financial
time series data are also noisy and nonstationary. Thus, the traditional forecasting models are not
reliable enough to predict the stock value. Researchers are continuously seeking new methodologies
based on DS algorithms to enhance the accuracy of such predictions. Forecasting stock price was found
to be the objective of 29 out of 37 articles. Other studies aimed at applying DS in sentiment analysis,
or the analysis of the context of texts to extracts subjective information, to identify future trends in the
stock market. In addition, portfolio management, algorithmic trading (i.e., using a pre-programmed
automated system for trading), automated stock trading, socially responsible investment portfolios,
the S&P 500 index trend prediction, and exchange-trade-fund (EFT) options prices prediction were
the objectives of other articles that projected to employ DS methods. Financial time series served as
the data source of all these studies, except for the studies aimed at sentiment analysis, which used
different data sources, such as social media and financial news.
LSTM
Long short-term memory (LSTM) networks are a special kind of recurrent neural network (RNN)
that can overcome the main issue of RNN, i.e., vanishing gradients using the gates to retain relevant
information and discard unrelated details selectively. The structure of an LSTM neural network is
shown in Figure 4, which is composed of a memory unit
C
t
, a hidden state
h
t
and three types of gates,
where
t
indexes the time step. Specifically, for each step
t
, LSTM receives an input
x
t
and the previous
hidden state
h
t−1
then calculates the activation of the gates. Finally, the memory unit and the hidden
state are updated. The computations involved are described below:
剩余24页未读,继续阅读
syp_net
- 粉丝: 158
- 资源: 1196
上传资源 快速赚钱
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
会员权益专享
最新资源
- zigbee-cluster-library-specification
- JSBSim Reference Manual
- c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf
- 建筑供配电系统相关课件.pptx
- 企业管理规章制度及管理模式.doc
- vb打开摄像头.doc
- 云计算-可信计算中认证协议改进方案.pdf
- [详细完整版]单片机编程4.ppt
- c语言常用算法.pdf
- c++经典程序代码大全.pdf
- 单片机数字时钟资料.doc
- 11项目管理前沿1.0.pptx
- 基于ssm的“魅力”繁峙宣传网站的设计与实现论文.doc
- 智慧交通综合解决方案.pptx
- 建筑防潮设计-PowerPointPresentati.pptx
- SPC统计过程控制程序.pptx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0