没有合适的资源?快使用搜索试试~ 我知道了~
首页使用深度学习进行股票选择和预测-研究论文
使用深度学习进行股票选择和预测-研究论文
需积分: 0 462 浏览量
更新于2023-05-29
评论
收藏 753KB PDF 举报
本文对股票的选择过程进行了系统分析,并使用深度学习对该股票进行了预测。 无需任何先验知识即可学习庞大的原始数据集并从中提取重要数据的能力使其具有吸引力。 深度学习模型的性能高度依赖于优化器,损失函数,网络结构,激活函数和其他参数。 本文旨在研究自动编码器神经网络在选股过程中的性能,并使用长短期记忆(LSTM)模型对这些股票进行未来预测。 该研究涉及标准普尔500指数中每家公司2013年至2018年的5年静态每日数据。该研究的假设是,用户愿意将固定数量的股票投资并已经拥有股票。 自动编码器已应用于主要根据用户要投资的数量和所有股票的最后一天收盘价进行过滤以考虑承受能力的股票。 原始数据和重新创建的数据之间的RMSE得分将用于选择股票。 考虑到用户的行为是中立的,因此选择了RMSE得分最低的前50只股票和RMSE得分最高的倒数50只股票。 这些选定股票的潜在特征将被检查。 在这100种股票中,将通过检查它们与用户已经拥有的股票之间的相关性来选择不同行业的20种股票。 LSTM模型将使用adam优化器预测这20只股票的第二天未来价格,经验结果表明,RMSE得分最低的前50只股票将带来低回报和低风险。 这意味着自动编码器结果中排名前50位的股票为蓝筹股。 同样,具有高RMSE得分的排名前50位的股票也会带来高风险的高回报。 这意味着自动编码器结果中的前50名股票给出了小盘股。 自编码器神经网络后,相关估计得到了显着改善。 通过使用adam优化程序,使用LSTM进行的未来一天库存预测可以提供均方根误差为2.22的高精度。 我们的研究提供了一些见识和有用的路线图,可用于进一步研究使用深度学习网络进行的股票市场分析和预测。
资源详情
资源评论
资源推荐

Stock Selection and Prediction using Deep Learning
Manasi Naik
Student of MTech Data Science (NMIMS university)
Mumbai, India
manasinaik.nmims@gmail.com
Nilambar Mishra
Team Manager (L & T Financial Services)
Mumbai, India
nmmishra@gmail.com
Abstract— This paper offers systematic analysis on
selection process of stocks and prediction of same using Deep
Learning. The ability to learn huge raw dataset and extract
important data from it, without any prior knowledge makes it
attractive. The performance of Deep Learning Model is highly
dependent on optimizer, loss function, network structure,
activation function, and other parameters. This paper is an
attempt to study the performance of Autoencoder Neural
Network in stock selection process, and future prediction of
these stocks using Long Short-Term Memory (LSTM) model.
The study involves static 5 years daily data from 2013 to 2018
of each company in S & P 500. Assumption for this study is
that user is willing to invest a fix amount in stocks and owns a
stock already. Autoencoder has been applied on stocks which
are primarily filtered according to the amount user wants to
invest and the last day closing price of all the stocks to consider
affordability. RMSE score, between original data and
recreated data, will be used to select stocks. Considering
behaviour of user is neutral, top 50 stocks with least RMSE
score and bottom 50 stocks with most RMSE score have been
selected. Latent Features of these selected stocks will be
examined. Among these 100 stocks, 20 stocks from different
sectors will be selected by checking the correlation of them
with the stock user already owns. LSTM model will predict the
next day future price of these 20 stocks with high accuracy
using adam optimiser.
Empirical results suggest that top 50 stocks with least
RMSE score will give low return with low risk. This implies
that top 50 stocks from the result of autoencoder gives blue
chip stocks. Similarly bottom 50 stocks with high RMSE score
will give high return with high risk. This implies that top 50
stocks from the result of autoencoder gives small cap stocks.
Correlation estimation is noticeably improved after
Autoencoder Neural Network. Future one day stock prediction
using LSTM gives high accuracy with Root Mean Square
Error 2.22. Our study offers handful insights and useful
roadmap for further investigation of stock market analysis and
prediction using Deep Learning Networks.
Keywords— Autoencoder Neural Network, Correlation
Matrix, Future Prediction, LSTM, RMSE, Stocks Selection,
Stocks Prediction.
I. INTRODUCTION
Autoencoder Neural Network is a data compression
model. Autoencoder is a tricky three layered neural network,
known as auto-association before, constructs the "building
block" of deep learning, which has been demonstrated to
achieve good performance in various domains [1]. It has
been used for image reconstruction and face recognition [2].
In modern finance the major challenge we face is
selection of stocks for investment, since there are many
stocks in the market. The compression model of
Autoencoder will help to compress huge data of companies
and recreate it. The Root Mean Square Error of original and
recreated data will give the stocks that explains the market.
Latent Features will have the most important data. It will
help us to understand better correlation between stocks. This
will help to select appropriate stocks from the market. As
correlation between stock and return will provide important
strategic information. It will help us in dynamic allocation
[3].
Risk Return analysis is an important factor in financial
studies, as it plays a key role in most individual decision
making process [4]. The risk return analysis will show the
complexity involved in stocks. Beta is important in CAPM
model. It measures the volatility of the expected return of the
asset in comparison to the market. In this study we will
check the top 50 companies having least RMSE score after
processing data through Autoencoder. After plotting risk-
return plot of top 50 companies, it will show that these
companies have low risk low return. This indicates these
companies might be large cap companies. Similarly risk
return plot of bottom 50 companies with higher RMSE score,
will tend to have high risk and high return. It indicates these
companies might be small cap companies. we have used
CAPM methodology to calculate risk return array.
Prediction of the stocks is another major problem faced
by investors. It has been observed that there are regular and
repetitive changes in stock market in some particular time
interval [5]. This seasonal change in the market is due to
production and sale of seasonal utilities. For example
woollen cloth sale will increase in winter [6]. To overcome
this, we require model which will handle long term
dependency while predicting price in stock market. Long
Short-Term Memory networks (LSTM) are special type of
Recurrent Neural Network (RNN). LSTM can learn long
term dependency. Due to this it learns long term seasonal
effect and gives high accuracy in prediction of next day
price. LSTM model used in this paper is giving around
0.0031 loss while predicting next day price.
II. LITERATURE REVIEW
The research paper has used CORN-K algorithm,
RACORN-K algorithm for portfolio selection. In this paper
DJIA, MSCI, SP500(N), HSI datasets have been used. It
shows promising performance on several stock markets. But
the risk is largely ignored when optimizing portfolios, which
may lead to unreliable profits, particularly in volatile
markets. [7].
The research paper has used Naive Bayes Algorithm.
Author has automated stock purchase model algorithm. It is
useful to create online web portal for trading.[8]
The research paper has used Linear Regression and
Neural Network. Authors have proved that ANN techniques
give more accurate result than Linear Regression. The
drawback is, it is applicable to only single company [9].
The research paper has used GARCH (1,1), EGARCH
(1,1), TGARCH (1,1) algorithms to predict the future 19
Electronic copy available at: https://ssrn.com/abstract=3753033

















安全验证
文档复制为VIP权益,开通VIP直接复制

评论0