Rabbit R1: A pocket Companion That Moves AI from Words to Action.

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
an AI product which is worth buying in 2024 this is absolutely amazing and it took the industry by an surprise no one was expecting something like this so let me introduce you to this cool take named rabbit R1 an AI based pocket companion who plans to take over smartphones here is the video of it hi everyone my name is Jesse and I'm the founder and CEO of rabbit I'm so excited to be here today to present you two things we've been working on a revolutionary New Foundation model and a groundbreaking consumer mobile device powered by it our mission is to create the simplest computer something so intuitive that you don't need to learn how to use it the best way to achieve this is to break away from app-based operating system currently used by smartphones instead we envision a natural language centered approach the computer we're building which we call a companion should be able to talk to understand and more importantly get things done for you the future of human machine interfaces should be more intuitive now before we get started let's take a look at the existing mobile devices that we use daily the one device that's in your pocket the smartphones like iPhone and Android phones these guys been here for years and we've grown tired of them the problem with these devices however is not the hardware phone factor it's what's inside the app based operating system want to get a right to the office there's the app for that want to buy groceries there's another app for that each time you want to do something you fumble through multiple pages pages and folders to find the app you want to use and there are always endless buttons that you need to click add to the card go to the next page check the boxes and jumping back and forth and so on the smartphone was supposed to be intuitive but with handers of apps on your phone today that don't work together it no longer is if you look at the top of ranking apps on App Stores today you'll find that most of them focus on entertainment our smartphones has become the best device to kill time instead of saving them it's just harder for them to do things many people before us have tried to build a simpler and more intuitive computers with AI a decade ago companies like apple Microsoft and Amazon made Siri contana and Alexa with these smart speakers often they either don't know what you're talking about or fail to accomplish the tasks we asked for recent achievements in large language models however or llms a type of AI technology have made it much easier for machines to understand you the popularity of L chatbox over the past years has shown that the natural language based experience is the PA forward however where this assistance struggle is still getting things done for example if you go to the chbt and use your Expedia plugin to book a ticket it can suggest options but ultimately cannot assist you in completing the booking process from start to finish things like Chad gbt are extremely good at understanding your intentions but could be better at trigging actions another Hot Topic is a field of research around what they call agents it has caught the eye of many open-source projects and productivity software companies what remains to be solved is for these agents to perform tasks end to endend accurately and speedily the problem is forcing a model to perform a task it is not designed for whether for a language model to reason about web page using super promps or screenshots we have yet to produce an agent as good as users simply clicking the buttons to fulfill our vision of a delightful intuitive companion we must go beyond a piece of complex software we want it to be in the hands of everyone so we first set up to fundamentally understand how computer apps are structured and more importantly how humans interact with them we wanted to find a way for our AI I to trigger auctions on behalf of users across all environments we want it to be Universal not just a chrome Plug-In or limited set of apps but everything iOS Android and desktop these applications share something in common the interface they all have a user interface so at a philosophical level if we can make an AI trigger actions on any kind of interface just like a human would we will solve the problem this Insight led us to create the large action model or lamb as we call it it is new FAL model that understands and executes human intentions on computers driven by our research in neuros symbolic systems with a large action model we fundamentally find a solution to the challenges that apps AP I or agents pH we solve it with interfaces lamb can learn any interfaces from any software regardless of which platform they're running on in short the large language model understands what you say but the large action model get things done we use lamb to bring AI from words to action finally we can build a computer that in addition to understanding what you're trying to say can actually help you do things on your behalf we pack the large action model into advanced rapid OS operating system it is built for realtime interactions between you and the rabbit powered by lamp large action models concept and test results are so powerful that we decided to make a one other kind mobile device introducing R1 your pocket companion [Music] you [Music] this is the rabbit R1 design in collaboration with teenage engineering the R1 is a fully Standalone device primarily driven by natural language the accumulation of countless hours of work from our research design and software teams it is simple to use R1 is equipped with everything you need to interact with you and perceive your surroundings it comes with a touchcreen a push to talk button and an analog scroll wheel a microphone and speakers and a computer vision enabled 360 degree rotational camera we call the Rabid eye it is Bluetooth and Wi-Fi capable and I has the same car slots that supports Global 4G LTE network and now let's see how it works I can ask rabbit for anything just like if I'm using chbt the difference though is a speed with rabid OS I get a response 10 times faster than most of the voice AI project rabbit answers my questions within 500 milliseconds with the push to talk button you don't need to say anything to wake it up just press and hold the button and talk like a walkie-talkie now let's give it a shot what's the nature of the reality the nature of reality is a topic that has captivated the minds of Phil philosophers and thinkers throughout history according to Bertrand Russell reality is composed of hard data derived from sensory experiences and logical analysis it is the amalgamation of our perceptions thoughts and the external world I can speak or type with R1 if you shake R1 a keyboard will pop up you will also know that the response speed is much faster than using some of the L assistant apps with web browsing such as trt what's the stock price of Coca-Cola searching for the stock price of Coca-Cola the stock price of Coca-Cola KO is $59.7 who played the role Oppenheimer in Christopher Nolan's latest movies in Christopher Nolan's most recent film Oppenheimer the role of Jay Robert Oppenheimer is played by Killian Murphy the film is a biographical Thriller that explores the story of American scientist J Robert Oppenheimer and his involvement in the development of the atomic bomb now let's see how the large action model works with lamb rabbit R1 can interact with all kinds of applications introducing the rabit hole web portal I can log into different services and unlock functionalities for R1 it's just like iCloud let's say I want to listen to some music with a rabbit hole web portal I have complete control over which Services I want to activate on my R1 and which provider I prefer for music I use Spotify so let's go ahead and connect my Spotify to the rabbit hole now that this is Spotify login page before I start the longing process I want to emphasize our commitment to privacy the way we design the infrastructure for apps that we interact with we don't create fake users or spam users we don't hack their infrastructures and of course we do not store any of your third- party credentials instead we redirect you to their login page we never save your username and password and we don't track your data the authentication happens on the respective apps now I'm logging with Spotify rabid OS recognize that Spotify has been authenticated on my device before this presentation I pre-authenticated some other apps like uber and door Dash just choose your preferred services and logging through them to get authenticated on your device it's that easy now let's let play some music play craftworks pocket calculator playing pocket calculator by craftwork for you play another song from the album sure playing computer love for you that's all the tracks from the Same album who wrote the lyrics for this [Music] song Ralph hutter and Emil schul wrote the lyrics for the song Computer Love by craftwork which band sampled this song it seems that Coldplay talk is the only known example of a song that sampled Computer Love what do you think about this song Computer Love by craftwork is widely regarded as a groundbreaking and influential song in the realm of electronic music its Fusion of catchy Melodies robotic VOC and Innovative use of synthesizers has left a lasting impact on the genre I can also use R1 to callor right get me ride from my office home now of course I will book an Uber ride for you from your office to your home please confirm the ride I have six people with three blockages find me Uber that can fit all of us for six people and three pieces of luggage I recommend booking in Uber XL as it provides ample space for all passengers and luggage please confirm the ride the ride shows up I just hit confirm u's on my way I can also use R1 to order food get me a 12in pizza from Pizza Hut deliver to here the most ordered option on the app is fine ordering a 12in pizza from Pizza Hut since you mentioned that the most ordered option is fine I will select that for you I just created an order for a 12-in pizza it's going to be hand tossed with a classic marinara sauce and topped with regular cheese please confirm your order that sounds really good I just confirmed order here now I'm waiting for my pizza I can also make R1 to a set of more complex actions that may take multiple apps to finish on a smartphone for example if I want to plan a trip to London I can just describe what I wish in full to R1 I want to take my family to London it's going to be two of us and a child of age 12 we're thinking of January 30th to February 5th can you plan the entire trip for me we like cheap non-stop flights grouped seats a cool SUV and a nice hotel that has Wi-Fi exploring ticketing options to make your trip a reality for your trip I found various flight options arrange of hotels to choose from and car rentals available please confirm each option individually for further details and booking so it's all been planned out I just confirm confirm confirm and that's it could you come up with a schedule for fun things to do while I'm over there on it I have prepared a detailed travel plan for your trip in summary you will be exploring London's iconic landmarks visiting museums enjoying delicious meals at local restaurants and experiencing the vibrant atmosphere of different neighborhoods navigation details are also prepared and I can help you book tickets for any events you'd like to attend R1 just planned the entire trip for me that's awesome but it seems like this is a little bit too intense can you plan a easy schedule for us sure I'm working on it please take a look at it and let me know what you think it gave me a more relaxed schedule every day with all the details sounds really good and I just confirm that I can foresee a wonderful trip Isn't that cool huh R1 help me reach each others just like a phone but smarter it come with a build-in real-time translator and a notaker removing the final barriers to communication R1 automatically detects the language spoken in the environment and provide the B directional translation R1 has an ie an onboard camera designed for advanced computer vision it can analyze surroundings and take actions in real time to activate the eye just double tap the button oh funny seeing you here Rick let me take a look Never Going To Give You Up playing now injoy what am I getting Rick Roll in my own keynot let's move on to the next [Music] one this is what I got in the fridge can you make me a nice dish that's low in calories let me see how about a green garden omelette it's a delicious and low calorie dish that combines the freshness of broccol and cabbage with the creaminess of eggs here's a simple recipe for you nice it recognize all the stuff and gave me the actual recipes rabbit eye can also help you with your documents here's the table I've been working on for job questionnaire can you create an additional column that matches candidates who mentioned rabbit in their questions about how they found us sure let me take a look at the table and add the matching column for you I've processed the table and sent you an email with the results okay now let's check our email I can continue to interact with rabid OS even Beyond R1 let's reply this email directory can you add another column that matches candidates who have included rabbit in their question and are La based I just reply the email h s and I got a refined version from rabid o through my email let's say I have a unique routine or task I cannot do on my phone R1 can do that too we are experimenting with what we call the teach mode just like how I can teach my friend how to skateboard I can show R1 how to do it and it will learn from me this means that any user regardless of technical background can teach R1 to learn new skills so you go to teach Mode start a new session today I will show you how to generate an image of puppy using me Journey from prompt using Discord rep first I will go to the servers page and click one my own servers since this is only a general image generation I'll go to Mid Journey text Channel then I will use the image comand along with the prompt here I'm putting a cute baby wild dog with big eyes animated cartoon on rail 8K let's wait for a minute for the engine to start generating the images once it's done let's click on the image to get a link I will then explain to Rabbit OS how to use this rabbit and annotate it so that I can generate anything not just poppies so let's go back to our web portal submit the request it takes seconds for the web portal to finish processing and that's it it's that simple now once we finish the training I can go back to my R1 now let's use mid Journey as I told you to generate a picture of a bunny in pixel art style certainly Jesse I will use mid journey to generate a picture of a bunny in pixel art style for you please give me a moment to create the image now here you go you got a image generated on Mid journey through teach mode watch learn and repeat that's teach mode it's that simple that's all the demos for today with land fast evolving my R1 will eventually help me to do things that can never be achieved on an app based phone speaking of the current APP based phones the first question we ask about ourselves is why would I need a new device if I already have a thousand do iPhone my iPhone can't do any of this at all we do not build rabbit R1 to replace your phone it's just a different generation of devices the app based system were introduced more than 15 years ago and a new generation of native AI power devices are just getting started here's a quick recap R1 is our companion that hosts the large action model with natural language I can use it for a wide range of tasks ask anything direct actions complex actions AI enhanced video calls notaker translator with a rabid eye computer vision and experimental teach mode on the hardware perspective we got a 360 rotational camera a global 4G LTE SIM card a push to talk button and an analog scroll wheel one last thing what about the price now before we reveal our price I want to do a quick comparison here are some of the best phones on the market right now you got iPhone you got latest version of Android phones we're looking at somewhere around 700 to $1,000 for a top phone with an app based system I bought my new iPhone 15 Pro Max last year and it's the same experience as my previous ones here are not so smart smart speakers they're asking roughly around $200 but they're all outdated and finally here are a couple of the new things with only large language models you got AI paying asking for $699 plus monthly subscriptions for their base models you got tab asking for $600 and you got meta reband glasses asking for roughly $300 remember these are the things with only large language model we still think these were too expensive we priced the rabbit R1 at $199 no subscription no hidden fees you can order the R1 now at rabbit. and we are shipping Eastern 2024 I can't wait for you to experience the R1 for yourself thank [Music] you don't you think this product from rabbit take is downright amazing and capable of making its presence into take world making a new domain of AI Companions and what makes this device really special as compared to apple and other is that its ability to responds to cues very quickly and then there is the ability to learn things just by observing did you saw that part where it learned to generate images just by observing how M Journey does it which is kind of insane in itself it makes me wonder if if it can learn other things like voice mimicking and deix and then it can be us it just by voice command it would be amazing right and it is just their first generation product second gen would be more exciting and amazing I also went through their website and they have mentioned that their future plan is to beat smartphones with AI companions not now but it is their future goal so so what are your thoughts are you going to buy one or planning to wait till these features came out in your smartphones sh your thoughts like share and subscribe see you in the next video
Info
Channel: The Ai Verge
Views: 7,922
Rating: undefined out of 5
Keywords: AI, Artificial Intelligence, Technology
Id: fkNzbmn1ciQ
Channel Id: undefined
Length: 27min 11sec (1631 seconds)
Published: Wed Jan 10 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.