George Hotz | Programming | Decision Transformer Reinforcement Learning (RL) | LunarLander | Part 1
Video Statistics and Information
Channel: george hotz archive
Views: 84,836
Rating: undefined out of 5
Keywords: programming, livecoding, georgehotz, george, hotz, geohot, twitch, github, yt:cc=on, lunarlander, tinygrad, decision, transformer, paper, balancing, temperature, logits, bugs, reinforcement learning, impossible, decision transformer, gym environment, press the light up button, game, embedding, action, reward, broadcast, issue, probability, layer, AGI, progress, scientific, notation, suppress, learning, loss, equity, inclusion, CartPole, pressthelightupbutton, game_length, advice, learn, gradient, game_lenght=32, rl, model, strategy
Id: 8U8kK3SpLTU
Channel Id: undefined
Length: 477min 31sec (28651 seconds)
Published: Tue Jan 09 2024
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.