Rabbit R1: The First Personal AI AGENT Device NO ONE Saw Coming (Look Out, Apple)

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
a company called rabbit just launched what I believe is the future of computing it is an incredible device that is a dedicated AI assistant and with no rumors or press prior to yesterday the rabbit R1 has set the internet of Blaze take a look at this incredible demo because you are going to be blown [Music] away hi everyone my name is Jesse and I'm the founder and CEO of rabbit I'm so excited to be here today to present you two things we've been working on a revolutionary New Foundation model and a groundbreaking consumer mobile device powered by it our mission is to create the simplest computer something so intuitive that you don't need to learn how to use it the best way to achieve this is to break away from app-based operating system currently used by smartphones instead that we envision a natural language centered approach the computer we're building which we call a companion should be able to talk to understand and more importantly get things done for you the future of human machine interfaces should be more intuitive now before we get started let's take a look at the existing mobile devices that we use daily the one device that's in your pocket the smartphones like iPhone and Android phones these guys been here for years and we've grown tired of them the problem with these devices however is not the hardware phone factor it's what's inside the app based operating system want to get a right to the office there's the app for that want to buy groceries there's another app for that each time you want to do something you fumble through multiple pages and folders to find the app you want to use and there are always endless buttons that you need to click add to the cards go to the next page check the boxes and jumping back and forth and so on the smartphone was supposed to be intuitive but with hundreds of apps on your phone today that don't work together it no longer is if you look at the top ranking apps on App Stores today you'll find that most of them focus on entertainment our smartphones has become the best device to kill time instead of saving them it's just harder for them to do things many people before us have tried to build a simpler and more intuitive computers with AI a decade ago companies like apple Microsoft and Amazon made Siri contana and Alexa with these smart speakers often they either don't know what you're talking about or fail to accomplish the task we asked for recent achievements in large language models however or llms a type of AI technology have made it much easier for machines to understand you the popularity of L chatbox over the past years has shown that the natural language based experience is the PA forward however where this assistance struggle is still getting things done for example if you go to the chbt and use your Expedia plugin to book a ticket it can suggest options but ultimately cannot assist you in completing the booking process from start to finish things like Chad gbt are extremely good at understanding your intentions but could be better at trigging actions another Hot Topic is a field of research around what they call agents it has caught the eye of many open- Source projects and productivity software companies what remains to be solved is for these agents to perform tasks end to end accurately and speedily the problem is forcing a model to perform a task it is not designed for whether for a language model to reason about web page using super prompts or screenshots we have yet to produce an agent as good as users simply clicking the buttons to fulfill our vision of a delightful intuitive companion we must go beyond a piece of complex software we want it to be in the hands of everyone so we first set up to fundamentally understand how computer apps are structured and more importantly how humans interact with them we wanted to find a way for our AI to trigger auctions on behalf of users across all environments we want it to be Universal not just a chrome Plug-In or a limited set of apps but everything iOS Android and desktop these applications share something in common the interface they all have a user interface so at a philosophical level if we can make an AI trigger actions on any kind of interface just like a human would we will solve the problem this Insight led us to create the large action model or lamb as we call it it is new foundational model that understands and executes human intentions on computers driven by our research in neuros symbolic systems with a large action model we fundamentally find a solution to the challenges that apps apis or or agent phas we solve it with interfaces lamb can learn any interfaces from any software regardless of which platform they're running on in short the large language model understands what you say but a large action model get things done we use lamb to bring AI from words to action finally we can build a computer that in addition to understanding what you're trying to saying can actually help you do things on your behalf we pack the large action model into a advanced rabbit OS operating system it is built for realtime interactions between you and the rabbit powered by lamp large action models concept and test results are so powerful that we decided to make a one of kind mobile device introducing R1 your pocket companion [Music] [Music] [Music] this is the rabbit R1 designed in collaboration with teenage engineering the R1 is a fully Standalone device primarily driven by natural language the accumulation of countless hours of work from our research design and software teams it is simple to use R1 is equipped with everything you need to interact with you and perceive your surroundings it comes with a touchcreen a push to talk button and an analog scroll wheel a microphone and speakers and a computer vision enabled 360 degree rotational camera we call the Rabid eye it is Bluetooth and Wi-Fi capable and I has the same car slots that supports Global 4G LTE network and now let's see how it works I can ask rabbit for anything just like if I'm using chbt the difference though is a speed with rabid OS I get a response 10 times faster than most of the voice AI project rabbit answers my questions within 500 milliseconds with a push to talk button you don't need to say anything to wake it up just press and hold the button and talk like a walkie-talkie now let's give it a shot what's the nature of the reality the nature of reality is a topic that has captivated the minds of philosophers and iners throughout history according to Bertrand Russell reality is composed of hard data derived from sensory experiences and logical analysis it is the amalgamation of our perceptions thoughts and the external world I can speak or type with R1 if you shake R1 a keyboard will pop up you will also know that the response speat is much faster than using some of the L assistant apps with web browser such as ching what's the stock price of Coca-Cola searching for the stock price of Coca-Cola the stock price of Coca-Cola KO is $59.7 who played the role Oppenheimer in Christoper Nolan's latest movies in Christopher Nolan's most recent film Oppenheimer the role of Jay Robert Oppenheimer is played by Killian Murphy the film is a biographical Thriller that explores the story of American scientist Jay Robert himer and his involvement in the development of the atomic bomb now let's see how the large action model works with lamb rabbit R1 can interact with all kinds of applications introducing the rabbit hole wipe portal I can log into different services and unlock functionalities for R1 it's just like iCloud let's say I want to listen to some music with a rabbit hole web portal I have complete control over which Services I want to activate on my R1 and which provider I prefer for music I use Spotify so let's go ahead and connect my Spotify to the rabbit hole now that this is Spotify login page before I start the login process I want to emphasize our commitment to privacy the way we design the infrastructure for app that we interact with we don't create fake users or spam users we don't hack their infrastructures and of course we do not store any of your thirdparty credentials instead we redirect you to their login page we never save your username and password and we don't track your data the authentication happens on the respective apps now I'm logging with Spotify rabbit OS recognize that Spotify has being authenticated on my device before this presentation I pre-authenticated some other apps like uber and door Dash just choose your preferred services and loging through them to get authenticated on your device it's that easy now let's play some music play craftwork pocket calculator playing pocket calculator by craftwork for you play another song from the album sure playing computer love for you that's all the tracks from the Same album who wrote the lyrics for this song [Music] Ralph hutter and Emil schul wrote the lyrics for the song Computer Love by craftwork which band sampled this song it seems that Coldplay talk is the only known example of a song that sampled Computer Love what do you think about this song Computer Love by craftwork is widely regarded as a groundbreaking and influential song in the realm of electronic music its Fusion of catchy Melodies robotic vocals and inovative use of synthesizers has left a lasting impact on the genre I can also use R1 to call a ride get me ride from my office to home now of course I will book an Uber ride for you from your office to your home please confirm the ride I have six people with three blockages find me Uber that can fit all of us for six people and three pieces of luggage I recommend booking an Uber XL as it provides ample space for all passengers and luggage please confirm the ride the ride shows up I just hit confirm over's on my way I can also use R1 to order food get me a 12 in pizza from Pizza Hut deliver to here the most ordered option on the app is fine ordering a 12-in pizza from Pizza Hut since you mentioned that the most ordered option is fine I will select that for you I just created an order for a 12-in pizza it's going to be hand tossed with a classic marinara sauce and topped with regular cheese please confirm your order that sounds really good I just confirmed order here now I'm waiting for my pizza I can also make R1 to a set of more complex actions that may take multiple apps to finish on a smartphone for example if I want to plan a trip to London I can just describe what I wish in full to R1 I want to take my family to London it's going to be two of us and a child of age 12 we're thinking of January 30th to February 5th can you plan the entire trip for me we like cheap non-stop flights grouped seats a cool SUV and a nice hotel that has Wi-Fi exploring ticketing options to make your trip a reality for your trip I found various flight options a range of hotels to choose from and car rentals available please confirm each option individually for further details and booking so it's all been planned out I just confirm confirm confirm and that's it could you come up with a schedule for fun things to do while I'm over there on it I have prepared a detailed travel plan for your trip in summary you will be exploring London's iconic landmarks visiting museums enjoying delicious meals at local restaurants and experiencing the vibrant atmosphere of different neighborhoods navigation details are also prepared and I can help you book tickets for any events you'd like to attend R1 just planned the entire trip for me that's awesome but it seems like this is a little bit too intense can you plan a easy schedule for us sure I'm working on it please take a look at it and let me know what you think it gave me a more relaxed schedule every day with all the details sounds really good and I just confirmed that I can foresee a wonderful trip Isn't that cool huh R1 help me reach others just like a phone but smarter it come with a build-in real-time translator and a notaker removing the final barriers to communication R1 automatically detects the language spoken in the environment and provide the bidirectional translation R1 has an i an onboard camera designed for advanced computer vision it can analyze surroundings and take actions in real time to activate the eye just double tap the button oh funny seeing you here Rick let me take a look fridge can you make me a nice dish that's low in calories let me see how about a green garden omelette it's a delicious and low calorie dish that combines the freshness of broccoli and cabbage with the creaminess of eggs here's a simple recipe for you nice it recognize all the stuff and gave me the actual recipes rabbit eye can also help you with your documents here's the table I've been working on for job questionnaire can you create an additional column that matches candidates who mentioned rabbit then their are questions about how they found us sure let me take a look at the table and add the matching column for you I've processed the table and sent you an email with the results okay now let's check our email I can continue to interact with rabid OS even Beyond R1 let's reply this email directory can you add another column that matches candidates who have included rabbit in their question and are La based I just reply the email h s and I got a refined version from rabid o through my email let's say I have a unique routine or task I cannot do on my phone R1 can do that too we are experimenting with what we call the teach mode just like how I can teach my friend how to skateboard I can show R1 how to do it and it will learn from me this means that any user regardless of technical background can teach R1 to learn new skills so you go to teach Mode start a new session today I will show you how to generate an image of puppy using me Journey from prompt using Discord first I will go to the servers page and click one my own servers since this is only a general image generation I'll go to Mid Journey text Channel then I will use the image command along with the prompt here I'm putting a cute baby wild dog with big eyes animated cartoon on rail 8K okay let's wait for a minute for the engine to start generating the images once it's done let's click on the image to get a link I will then explain to Rabbit OS how to use this rabbit and annotate it so that I can generate anything not just poppies so let's go back to our web portal submit request it takes seconds for the web portal to finish processing and that's it it's that simple now once we finish the training I can go back to my R1 now let's use mid Journey as I told you to generate a picture of a bunny in pixel art style certainly Jesse I will use mid journey to generate a picture of a bunny in pixel art style for you please give me a moment to create the image now here you go you got a image generated on M journey through teach mode watch learn and repeat that's teach mode it's that simple that's all the demos for today with land fast evolving my R1 will eventually help me to do things that can never be achieved on an app based phone speaking of the current APP based phones the first question we ask about ourselves is why would I need a new device if I already have a, iPhone my iPhone can't do any of this at all we do not build rabbit R1 to replace your phone it's just a different generation of devices the app based system were introduced more than 15 years ago and a new generation of native AI power devices are just getting started here's a quick recap R1 is our companion that hosts the large action model with natural language I can use it for a wide range of tasks ask anything direct actions complex actions AI enhanced video calls notaker translator with a rabid eye computer vision and experimental teach mode on the hardware perspective we got a 360 rotational camera a global 4G LTE SIM card a push the talk button and an analog scroll wheel one last thing what about the price now before we reveal our price I want to do a quick comparison here are some of the best phones on market right now you got iPhone you got latest version of Android phones we're looking at somewhere around 700 to $1,000 for a top phone with an app based system I bought my new iPhone 15 Pro Max last year and it's the same experience as my previous ones here are not so smart smart speakers they're asking roughly around $200 but they're all outdated and finally here are a couple of the new things with only large language models you got AI paying asking for $699 plus monthly subscriptions for their base models you got tab asking for $600 and you got meta reband glasses asking for roughly $300 remember these are the things with only large language model we still think these were too expensive we priced the rabbit R1 at $199 no subscription no hidden fees you can order the R1 now at rabbit. and we are shipping Easter 2024 I can't wait for you to experience the R1 for yourself thank [Music] you now how cool is that I've already ordered one it's $199 with no sub subscription and that's an incredible price point as compared to the Humane AI pin which is I think $7 or $800 and you have to have a subscription for it not only that and I know this might be controversial I think it's gorgeous I love the color I love the design and it's by a design firm called teenage engineering which is known for their absolutely exceptional design now what's truly unique about this device is it's completely AI powered and I already ordered one and they're saying that it'll be deliver Ed at the end of March so that's really soon and this is fully AI powered and not only that it can actually learn actions so not only are you operating a computer using natural language but if you come across things that it doesn't know how to do you can teach it to do those tasks and I talked a lot about the end of programming and you're probably thinking what does that have to do with this well this is a device that really gives you a preview of what the future of computing looks like it is natural language directly to a large language model directly computing the end result or whatever the task you want to complete is it's going to do that for you no apps maybe not even an interface eventually just voice and execution now they don't call their AI an llm they call it an L which is a large action model now I don't know if that's just a marketing term for a large language model and they're just calling it something else but what it says here is lamb is a new type of foundation model that understands human intentions on computers and that sounds exactly like a large language model except maybe it's really good at function calling but again that's just a large language model and so they actually put together a bunch of their research it's not quite a white paper but they do talk about a lot of what they found and how they created this new device now I want to show you this real quick these are videos obviously a lot of them of people going through apps and everything from Google Maps to ordering food to Airbnb and this is how they trained the model to actually get things done and the model that they chose in the device itself is supposed to be broadly applicable it can accomplish pretty much any task so it was trained on a bunch of usage of different apps and then it abstracts that away with a large language model and then simplifies the interface to just be voice and you can see here kind of like self-operating computer it's actually mapping all the different buttons within all of the different apps it records it and then it tries to learn how to actually get to let's say if you want to play an Ed Sheeran song how to do that from within the Spotify app and again bringing it back to the end of programming this is kind of the intermediate step where the AI is learning how to operate apps itself without you actually needing to go and navigate through these apps but the next step is probably to remove the apps altogether then you have the large language model Computing directly and when that happens there are no more apps and then who is going to be building them and trust me I don't want this to happen but I can't foresee any other future now I'm sure a lot of you are saying hey couldn't this just be an app on the iPhone or an Android device and the answer is probably yes but they would have had to navigate at the many restrictions of being just an app in an app ecosystem on the iPhone on an Android device and then you're probably also saying hey isn't Apple or Android also going to launch something like this and again the answer is probably yes but they're going to be launching it as part of the iPhone or as part of an Android device and so they're shoving this new type of Technology this new interface into an existing product and I'm never as big of a believer in that as I am of taking a completely new approach to something and even even if rabbit R1 itself does not end up taking off the form factor and how they're thinking about things is going to change how major consumer electronics companies think about building their own Hardware in the future and I always applaud Founders and companies that are willing to take huge risk and think about things in completely new ways because every once in a while we get something like the iPhone or we get something like the PC so this might be that if you enjoyed this video please consider giving a like And subscribe and I'll see you in the next one one
Info
Channel: Matthew Berman
Views: 1,169,561
Rating: undefined out of 5
Keywords: rabbit r1, rabbit tech, ai agents, llm, ai, artificial intelligence
Id: DlnJlG1SOZo
Channel Id: undefined
Length: 29min 58sec (1798 seconds)
Published: Wed Jan 10 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.