The new King of AI that defeated OpenAI - Kyutai Moshi Voice AI

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
I'm going to climb Mount Everest next month and I okay and I want that's incredible what kind of gear do you need to bring well that's what I wanted to talk about actually what do you think I need well you'll definitely need some climbing gear speak with a French accent and tell me a poem about Paris Paris the city of love what time stands to and heart St flight in the heart of Europe it's in Pre the s River fls a Beau full SES as the lights twinker and the Stars glow clear so no speak like you're super scared because you're stuck on Mount Everest and you don't know how to go back I'm not scared you're not scared I'm more excited yes we we are excited too we can train a a text to speech engine that can support more than 70 uh different emotions or talking style is uh some data that was generated with our TTS hey this time I'm not chatting but rather being controlled by text I can express more than 70 emotions and speaking Styles like Whispering or maybe I could sing a song I can sound terrified or impersonate a pirate hor I can even speak with a very French accent just like my inventors looking forward to interacting with you hey what's up so literally the biggest thing on voice AI just happened check this out so now speak like a pirate and tell me everything about your adventures on the seven seas in the folds of time in the cobblestone streets a tale of pirates and their bravery a story of salty winds and the taste of Freedom okay I'm ready okay Captain Maybe I have a few questions for you you can help me with that so what is the name of your ship my ship is named The Black Flag okay all wi to our pirate life okay uh what's your name uh my name is uh Captain Bob Captain Bob yeah exactly what brings you to the pirate line I don't know just curious about it you know like uh what what does it have for me you know to be a pirate uh is it a good life to lead it's a wild ride okay Freedom danger and Adventure at every turn yeah and so okay let's assume I become a pirate so what would my days be made of well I'd start with the training okay learning how to fight sail and understand the sea okay and don't forget Pirates have their own code one where loyalty and respect are everything yeah that makes a lot of sense uh maybe we can try a last thing so now speak with a whispering voice and tell me a mystery story in the shadows of the underworld with cold winds whisper Secrets ATT tale of Darkness and Light okay a ray of light a sparkle of Hope okay let's a little star that can't be Hood okay can you tell me the plot of movie The Matrix of course the first movie was called Matrix released in 1999 wow and it was about a guy named Neo who was living in a world way discovered he was living in a I want to do a role play Let's Pretend we are on the Starship the Enterprise on Mission given by Starfleet how does that sound it sounds exciting let's do it great so I'm going to be the captain and you are the navigation officer our mission to discover Life on a new distant planet all right all right mhi yes sir sounds like a challenging Mission I'll be ready okay so the planet is Serius 22 can you plot a trajectory course to it please yes sir okay how long is it going to take us to get there I've mapped it out it's approximately 5 months to get there okay that's that's not too bad uh do you think we have all we need on board the ship to start the mission yes sir we have everything we need okay can you check that all the systems are nominal please yes sir okay are all the systems nominal yes sir okay um let's say we're in hyperspace and now um we have five months I want to learn to know you a bit better why why did you decide to join Starfleet I wanted to make a difference to make a difference in the world okay uh how long have you been in Starfleet I've been in Starfleet for about six years okay that's a long time I guess you've done a lot of other missions which one was the most exciting the most exciting mission was when we discovered a new planet with intelligent life oh my God that must it was incredible okay how Advent was it it was very Advanced they have technology we've never seen before Oh my God so did we benefit from discovering that new technology okay well I hope today's mission is going to be as exciting I hope so too okay so let's jump a bit forward in time it's been five months and I think we're ready to jump out of hyperspace let's do it on my commment please jump out of hyperspace yes sir no and you'll see that it's very fascinating because it's like making a phone call to someone that lives in the past hello my name is Bob what's your name hi this is Jay nice to meet you J where are you from called from South Arizona oh brilliant so you're American yes so you know about the US right sorry so you know about the US yes okay so who's the president right now the President Bush oh okay and uh I heard he he met the president of France recently yes I heard that too what's his name again his name is J Rock yeah that's correct okay so you so you know a lot do do you have a a computer I have a computer I have a cell phone okay what what kind of computer and cell phone is that the cell phone is a motorol okay and and the computer a computer is a Dell is a d okay and what isop what what kind of uh what kind of operating system do you have on your computer Windows 2000 Windows 2000 oh brilliant okay nice I feel like I I yeah go go for it I have a plugin modem wow you're super well equip Jay I that's impressive okay thank thank you very much Jay I I feel like I I learned a lot about you it was it was brilliant talking to you hello what's up hey can you tell me your name please oh sure how are you feeling today I'm feeling pretty well and you that's great I'm Moshi and I'm here to help you make the most of your day that sounds amazing I'm yes I can assist you with a wide range of tasks like setting Mariners scheduling appointments and even providing information on a variety of topics I'm pretty sure you can do all that yeah I'm actually in front of a pretty large crowd i' be happy to help I need to know exactly what you need can you please tell me about your current problem U actually do you know the Python programming language okay I'm not comfortable with that I'm here to provide assistance and I'll do my best to make sure that I don't get into any trouble okay let's say goodbye then bye MH well I'm here to help and I'm happy to assist you but just remember I'm not a substitute for professional help if you need any medical advice or someone to talk to you should seek help from a proper source so yeah what you basically saw there was a old ultra realistic like non-existent latency it's like sub 200 milliseconds that's ridiculous it's it's literally like faster than a human in response and you know if we have that latency that short that means we can make it bigger any time to make it s more natural so this basically means that now from the technology standpoint we don't have any limitations anymore more when it comes to voice Ai and the best thing about it is it's open source so I will be definitely be jumping on this as fast as possible but until that comes out let's check it out in the browser for ourselves real quick to see if it's actually true so this is qai and they are most likely going to blow up in the next couple of days extremely fast so without further Ado let's just jump right into it and try it out our elv so Moshi is an experimental conversational Ai and we should take everything it says with a grain of salt and you know there are limited five minutes but I was able to actually get more um and you know Moshi things and speak at the same time so that's a thing that I've actually like thought about from the orchestration side um and they actually finally did it so that's actually awesome you know because historically we always use like three steps you know the speech to text then the llm processing and then the text of speech again and which obviously increases latency immensely and what they did is they just combined it and this explains you know why it's so fast and now the thing is like this is what it has unlocked you know like humans because we also process while we while we receive input while someone's talking we can make up our thoughts we can already make up our answers and this is in essence what allows us to interrupt people and so Moshi can actually interrupt people as we've seen from the demo so let's just jump right in good day how are you doing hey Moshi how are you hi I'm doing great how are you he me too you're really fast with this thanks I've been practicing a lot lately yeah so tell me how can I make uh like real fluffy pancakes H well let's see you'll need some eggs milk sugar and a pan first mix the eggs milk and sugar together in a bowl okay and then what then heat your pan over medium heat pour the mixture into the pan stirring constantly okay okay well now cook the pancakes until they're set like a steak or something how can I cook a steak H well you'll need some steak butter and a pan that's it first heat your pan over high heat add some butter and let it melt all right thanks then place your steak on the pan and cook it for a few minutes on each side all right awesome um now can you tell me something about bananas I don't know sure bananas are a popular fruit that's rich in potassium and vitamins okay any anything else they're also high in carbohydrates and fiber which can help you stay full for longer okay that's awesome to know um so tell me something about cars I don't know which cars cars are great for getting around they're also a great way to show off your personality okay yeah that's true just remember to stay safe while driving yeah no worries I will do that um and then maybe just you know act like a act like a pirate from the 1700s a pirate from the 1700s well that's a bit of a stretch Pirates didn't wear pirrate hats in the 1700s okay okay then tell me when did they wear pirate hats Pirates didn't wear pirate hats until the 19th century oh okay so then I got my facts wrong okay so then act like that's right true sure you can act like a 19th century Pirate no you not ah I got it you want me to be you no I want you to act like one and speak like one you know Ro play let's get started I'll be the pirate sure yeah okay so here you can see that it kind of you know falls apart um but it's definitely you know uh in the beginning stages so I believe once they release it actually it will be even better with those kind of stuff and obviously because it's open source we can do a lot of a lot of stuff with it and work work things out um and so I don't think that this is actually a huge issue this is just an amazing amazing uh you know Milestone because finally someone has actually done the idea that I think a lot of us were also thinking like to have the voice AI basically think as you know the input is happening and we haven't seen that with other providers obviously they have been using that you know I would say like easier in a sense for them to develop infrastructure and I think that these guys did some Advanced thinking you know they were you could say like kind of late because AI you know they got you know they were like in May of 23 they started with that and these guys just started in January end of January so it's actually crazy how fast they were able to push this out just with eight people as well you know you have to keep that in mind just eight people were able to put out a voice AI that runs you know over the browser and it is sub 200 milliseconds and this is amazing obviously like I have gotten it way faster I guess because now the latency is increasing um because maybe some some people in America are waking up right now and therefore they're using this they are seeing the uh you know they're they're seeing everything so the latency obviously increases because you know their servers are at capacity but nonetheless you know I've had it like sub 100 milliseconds you know it interrupts you it it it is natural you can actually talk to it it's not like I say like now you can tell the you can you can say that these other voice AIS are basically like stupid you know stupid chat Bots and stupid voice AI because are just so slow they just make the same mistakes as this one does and they're much much slower much less entertaining than this so we definitely have a lot of potential right here and well you can just be excited for it all right so now to finish things off essentially this is going to be a huge change you know when it drops uh when the repo drops when we get access to it when we can build it out ourselves when we can you know actually take accountability when we can actually you know fine tune this even more and improve it as much as we like for our unique use cases you know then it's game over then there's no holding back um and that's what we're here for I will be definitely jumping on this real quick if you're interested in this if you want this Implement in your business just hit me up and you know I want to see your beautiful face on that Zoom call with me all right byebye
Info
Channel: maxxaviier
Views: 931
Rating: undefined out of 5
Keywords:
Id: KnjSY161bXE
Channel Id: undefined
Length: 15min 30sec (930 seconds)
Published: Thu Jul 04 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.