(if answer is ready: stop wasting computation) On tasks where the complexity needed to answer...
Top-Down explanation of Graves’ 2015 paper “Adaptive Computation Time for Recurrent Neural Networ...