PLAY PODCASTS
[MINI] The Vanishing Gradient

[MINI] The Vanishing Gradient

Data Skeptic · Kyle Polich

June 30, 201715m 16s

Audio is streamed directly from the publisher (pscrb.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

This episode discusses the vanishing gradient - a problem that arises when training deep neural networks in which nearly all the gradients are very close to zero by the time back-propagation has reached the first hidden layer. This makes learning virtually impossible without some clever trick or improved methodology to help earlier layers begin to learn.