Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)
Video Statistics and Information
Channel: Yannic Kilcher
Views: 9,820
Rating: 4.9935274 out of 5
Keywords: deep learning, machine learning, arxiv, explained, neural networks, ai, artificial intelligence, paper, grokking, openai, double descent, belkin, overfitting, bias variance, steps, training, binary tables, binary operations, binary operation, multiplication table, algorithmic datasets, groups, s5 group, deep learning algorithmic, deep learning generalization, generalization research, why do neural networks generalize
Id: dND-7llwrpw
Channel Id: undefined
Length: 29min 47sec (1787 seconds)
Published: Wed Oct 06 2021
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.