Independently Recurrent Neural Networks

Simple TensorFlow implementation of Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN by Shuai Li et al.

Summary

In IndRNNs, neurons in recurrent layers are independent from each other. The basic RNN calculates the hidden state h with h = act(W * input + U * state + b). IndRNNs use an element-wise vector multiplication u * state meaning each neuron has a single recurrent weight connected to its last hidden state.

The IndRNN

can be used efficiently with ReLU activation functions making it easier to stack multiple recurrent layers without saturating gradients
allows for better interpretability, as neurons in the same layer are independent from each other
prevents vanishing and exploding gradients by regulating each neuron's recurrent weight

Usage

Copy ind_rnn_cell.py into your project.

from ind_rnn_cell import IndRNNCell

# Regulate each neuron's recurrent weight as recommended in the paper
recurrent_max = pow(2, 1 / TIME_STEPS)

cell = MultiRNNCell([IndRNNCell(128, recurrent_max_abs=recurrent_max),
                     IndRNNCell(128, recurrent_max_abs=recurrent_max)])
output, state = tf.nn.dynamic_rnn(cell, input_data, dtype=tf.float32)
...

Experiments in the paper

See examples/addition_rnn.py for a script reproducing the "Adding Problem" from the paper. Below are the results reproduced with the addition_rnn.py code.

For 100, 500 and 1000 time steps, the behavior is similar to Figure 2 in the paper. For 5000 time steps the IndRNN does not converge. My best guess is that this is due to different initialization values for the recurrent, input and softmax weights. These are not mentioned in the paper and have been found to make a big difference.

More experiments, such as Sequential MNIST, will follow in the next days.

Requirements

Python 3.4+
TensorFlow 1.5+

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
examples		examples
img/addition		img/addition
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ind_rnn_cell.py		ind_rnn_cell.py
ind_rnn_cell_test.py		ind_rnn_cell_test.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

img/addition

img/addition

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

ind_rnn_cell.py

ind_rnn_cell.py

ind_rnn_cell_test.py

ind_rnn_cell_test.py

requirements.txt

requirements.txt

Repository files navigation

Independently Recurrent Neural Networks

Summary

Usage

Experiments in the paper

Requirements

About

Releases

Packages

Languages

License

zouxiaoyuonly/indrnn

Folders and files

Latest commit

History

Repository files navigation

Independently Recurrent Neural Networks

Summary

Usage

Experiments in the paper

Requirements

About

Resources

License

Stars

Watchers

Forks

Languages