Training more effective learned optimizers, and using them to train themselves (Paper Explained)
Video Statistics and Information
Channel: Yannic Kilcher
Views: 16,815
Rating: undefined out of 5
Keywords: deep learning, machine learning, arxiv, explained, neural networks, ai, artificial intelligence, paper, optimization, lstm, taskset, google, google research, compute, outer optimization, adam, adamw, sgd, momentum, learning rate, gradient, learned optimizer, second moment, cnn, rnn, paper explained, neural network, gradient descent, hyper parameters, grid search, mnist, cifar10, imagenet
Id: 3baFTP0uYOc
Channel Id: undefined
Length: 53min 35sec (3215 seconds)
Published: Sat Oct 03 2020
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.