Understanding STaR and how it powers Claude and Gemini/Gemma 2 (and maybe OpenAI Q* or Strawberry)
Video Statistics and Information
Channel: Chris Hay
Views: 2,619
Rating: undefined out of 5
Keywords: chris hay, chrishayuk, STaR, Self Taught Reasoner, claude 3.5 sonnet, google gemma 2b, NVidia Nemotron reward, OpenAI Strawberry, Q*, OpenAI Q*, synthetic data, ai
Id: SMCswGP4lA4
Channel Id: undefined
Length: 22min 48sec (1368 seconds)
Published: Tue Jul 16 2024
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.