Tutorial on Adaptive Computation Time for Recurrent Neural Networks

This jupyter notebook is a tutorial and demonstration implementation of Alex Graves's paper Adaptive Computation Time for Recurrent Neural Networks. Code is based on previous PyTorch implementation by Jason Phang

The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Tutorial on Adaptive Computation Time for Recurrent Neural Networks

Files

README.md

Latest commit

History

README.md

File metadata and controls

Tutorial on Adaptive Computation Time for Recurrent Neural Networks