深度学习与长短期记忆网络在金融市场的预测应用

深度学习

需积分: 10 113 浏览量更新于2024-07-15 收藏 2.9MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

T. Fischer, C. Krauss / European Journal of Operational Research 270 (2018) 654–669 657

Fig. 1. Construction of input sequences for LSTM networks (both, feature vector and sequences, are shown transposed).

Fig. 2. Structure of LSTM memory cell following Graves (2013) and Olah (2015) .

consisting of so called memory cells. Each of the memory cells has

three gates maintaining and adjusting its cell state s

: a forget gate

( f

), an input gate ( i

), and an output gate ( o

). The structure of a

memory cell is illustrated in Fig. 2 .

At every timestep t , each of the three gates is presented with

the input x

(one element of the input sequence) as well as the

output h

t−1

of the memory cells at the previous timestep t −1 .

Hereby, the gates act as ﬁlters, each fulﬁlling a different purpose:

•

The forget gate deﬁnes which information is removed from the

cell state.

•

The input gate speciﬁes which information is added to the cell

state.

•

The output gate speciﬁes which information from the cell state

is used as output.

The equations below are vectorized and describe the update of the

memory cells in the LSTM layer at every timestep t . Hereby, the

following notation is used:

•

is the input vector at timestep t .

•

f , x

, W

f , h

, W

s ,x

, W

i , x

, W

i , h

, W

o , x

, and W

o , h

are weight ma-

trices.

•

, b

, and b

are bias vectors.

•

, i

, and o

are vectors for the activation values of the respec-

tive gates.

•

and

are vectors for the cell states and candidate values.

•

is a vector for the output of the LSTM layer.

During a forward pass, the cell states s

and outputs h

of the LSTM

layer at timestep t are calculated as follows:

In the ﬁrst step, the LSTM layer determines which information

should be removed from its previous cell states s

t−1

. Therefore, the

activation values f

of the forget gates at timestep t are computed

based on the current input x

, the outputs h

t−1

of the memory cells

at the previous timestep ( t − 1 ), and the bias terms b

of the for-

get gates. The sigmoid function ﬁnally scales all activation values

into the range between 0 (completely forget) and 1 (completely

remember):

= sigmoid(W

f,x

+ W

f,h

t−1

+ b

) . (3)

In the second step, the LSTM layer determines which information

should be added to the network’s cell states ( s

). This procedure

comprises two operations: ﬁrst, candidate values

, that could po-

tentially be added to the cell states, are computed. Second, the ac-

tivation values i

of the input gates are calculated:

= tanh (W

+ W

t−1

+ b

) , (4)

= sigmoid(W

i,x

+ W

i,h

t−1

+ b

) . (5)

In the third step, the new cell states s

are calculated based on the

results of the previous two steps with ◦ denoting the Hadamard

(elementwise) product:

= f

◦ s

t−1

+ i

◦

. (6)

In the last step, the output h

of the memory cells is derived as

denoted in the following two equations:

= sigmoid(W

o,x

+ W

o,h

t−1

+ b

) , (7)

= o

◦ tanh (s

) . (8)

When processing an input sequence, its features are presented

timestep by timestep to the LSTM network. Hereby, the input at

each timestep t (in our case, one single standardized return) is pro-

cessed by the network as denoted in the equations above. Once the

last element of the sequence has been processed, the ﬁnal output

for the whole sequence is returned.

During training, and similar to traditional feed-forward net-

works, the weights and bias terms are adjusted in such a way that

they minimize the loss of the speciﬁed objective function across

剩余15页未读，继续阅读

Quant0xff

粉丝: 1w+
资源: 459

深度学习与长短期记忆网络在金融市场的预测应用

halcon-17.12.0.0-windows-images-deep-learning.exe

[machine_learning_mastery系列]long-short-term-memory-networks-with-python.pdf

deep-learning-with-pytorch.pdf 15章

Write a paper about Deep-learning based analysis of metal-transfer images in GMAW process

blitz-bayesian-deep-learning-master

::v-deep .table-box .table-filter { height: 60px; } ::v-deep .__h_table_container.is-pagination { height: calc(100% - 53px); }动态改变这两个高度值

deep q-learning对比q-learning优化了哪些地方

描写一段关于q-learning算法和deep q-learning算法的对比

lstm环境污染的参考文献

No such file or directory: 'E:\\codes\\Bearing-fault-Diagnosis-based-on-deep-learning\\T_SNE\\cwt_picture\\train\\0-0.jpg

Deeplearning4j 视频教程链接有哪些

spark-deep-learning spark和深度学习

Write a paper about Deep-learning based analysis of metal-transfer images in GMAW process , requiring 10000 words

列举出RNN预测股票价格，考虑时间序列数据存在时间相关性的高引用论文

FileNotFoundError: [Errno 2] No such file or directory: 'E:\\codes\\Bearing-fault-Diagnosis-based-on-deep-learning\\T_SNE\\cwt_picture\\train/0-0.jpg'

OSError: Failed to interpret file 'D:\\Work\\projectfile\\deep-learning-for-image-processing-master\\data_set\\RCS_data\\train\\10\\frame_113.mat' as a pickle

最新资源