New GPT-4o Voices & More AI Use Cases

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
ladies and gentlemen this week in AI news you can use we have voice filters and a bunch of interesting releases including a new GPT 40 use case as released by open AI give me like a nice evil maniacal laugh an app that creates music for you once them at a time a bunch of updates and some interesting Niche apps and weit lists this is going to be a good one with a wide variety of tools so let's Dive Right into this week's AI news you can use all right so first up I want to address something from last week's video because I had a line in there where I said hey The Voice Assistant from opening ey is coming next week the new voice assistant that will come over the next week but I was actually saying next weeks I just rushed it a little bit sorry for any confusion there although it might not be that inaccurate if you're not aware on Monday June 10th we're going to have the WWDC keynote from Apple and apparently they're going to announce a partnership with open a here's some of the rumored announcements coming up if you're curious about that but we'll covering this in a separate video Once the keynote happens so that would be a fantastic week to release the voice assistant but just to be clear we don't know what we do know is that they released a new use case demo here and as so many applications that come out these days seem to be mimicking what open ey is doing with this we absolutely need to look at this be the lion I really want you to like embody with them I feel like who goes there so what happened is they uploaded a singular video on their YouTube channel named character voices with GPT 40 voice basically showing how adaptable the voice inside of the new voice assistant feature again that is not out yet is now this is definitely a response to the whole Scarlet Johansson drama but we're not here to cover that we're here to talk about these capabilities now I encourage you check out the whole thing this is one of the most amazing demos now let's think about what the villain might be I don't know what animal would work best but let's start with like some kind of laughter give me like a nice evil maniacal laugh I wonder why they kept it in their pocket for so long maybe a well fought out marketing stunt with the scar Joo thing I don't know so as you can see the voice is super adaptable and we actually have some other releases linking to this so we'll cover this all together okay we have 11 Labs coming out with a sound effects generator until now we only had meta's audio box which we covered a few months back when that came out and that is really good at generating sound effects okay so now we have 11 laps for sound effects 11 labs for voices and chat GPD coming out with this model that kind of includes it all right you will be able to generate voices and sound effects with the open AI Voice Assistant it's all packaged into one it's just not accessible today but I want to share one more thing about the voice assistant with you so it seems like they started rolling out new features related to The Voice Assistant as Alex friend of the channel tweets here on X some Dev got a brand new menu feature inside of cat GPT now it's rolling out gradually I don't have this on my phone yet but what does it do well it's pretty simple it gives your chat GPT app the access to your microphone all across your phone independently of what you're doing so if you leave the app chat CHT is still going to be on recognizing your speech and this is really what enables this voice assistant concept right you want to be using it with other applications not just while you're in chat GPT so again another hint that The Voice is coming soon and I just want to note here that open AI is not the only company attempting this AI assistant idea many others are going after it one great example of this would be Nvidia with something they called Project G assist and this is a personal assistant targeted of Gamers you can ask it questions you often find yourself looking up online like what's the best early game weapon the best early game weapon is the spear unfortunately this is also not accessible today I just wanted to briefly show you that this is definitely happening and basically it's a little chat interface that can assist you in games and you can ask things like do you see any problem with display settings and then it looks at all your computer settings all your graphic card settings and gives you tips accordingly even charting something like latency for you as it changes the settings my points AI assistants are absolutely happening and they're about to happen now not in 2025 but now let's move on to something you can actually use today which is 11 lab sound effects and this is sort of a subset of The Voice Assistant functionality because as you just saw in the new demo the new voice assistant can change the voice depending on what you ask it for and that means it can essentially generate various sounds from scratch they're just showing it off on human voices but a subset of that it's scating sound effects and 11 laps released their new sound effects generator now this is not a first in the space we already have meta's audio box that is really good at generating sound effects and look today if you need sound effects you're probably a video creator that wants to include them with the edits but very soon this is going to be an essential modality when you combine it with something like open a eyes Sora or similar so let's give this a spin let's create a was to base it this is a sound effect I used to use all the time in my edits back in the day it should sound something like so essentially it's this airw with a base it in the end let's see how well 11 Labs did with [Music] this okay that one okay maybe add one more word see how this [Music] goes you know what I got to say it doesn't really do what I wanted to do here and look we actually went ahead and did some further testing on this running dozens of different sound effects effect in here and the overall conclusion is this if you keep the prompt short it actually generates usable sound effect but as soon as you start getting a little more specific and you venture Beyond four or five words in your description the results are often not as good as you would want them to be and even with something simple like a car crashing into a tree there's just always the same screeching tire and then not even a proper crash [Applause] there's one but it gets cut off in the end so look I'm a fan of 11 Labs products but this one is not too good I would still recommend the audio box demos where you can create sound effects just like this the only problem is it's not always accessible it's a research preview so at times it's just not available like right now they ask you how you are you just have to say that you're fine when you're not really fine okay so as you might already know having skills to go along with some of these tools is invaluable and I wanted to share this fantastic resource with you today that helps you do exactly that I'm talking about brilliant because in this fast-paced world we all live in continuous learning is key and finding the right resources to learn from makes all the difference and the reason I think brilliant is great is because they're at Interactive Learning platform so it's not just pre-recorded content but there's quizzes and little exercises in between they offer thousands of lessons in math data science programming and of course artificial intelligence but as I mentioned what really makes them Stand Out is their Hands-On approach to learning instead of monotonous lectures you really get to engage with the material and build your problem solving skills in the process and I actually have a concrete recommendation for you I've been more and more a fan of learning the basics of python even if you're not planning to program or do anything with it it teaches you how logic works and that helps prompt engineering it helps with a lot of these AI tools and they have a great course called practice applied Python A lot of times getting into coding can be a big roadblock for people and I think this is a great way to overcome that why is it more important well some of the more Advanced Techniques and workflows do require at least some knowledge of python as I mentioned just understanding the basics like variables and data structures goes a long way because it transitions into no codes tools it transitions into prompt engineering and more and this course is fantastic for getting you up to speed with the skills that you need to thrive in this digital era so go check out brilliant to quickly learn about Python llms and all kind of other Math and Science type topics to try brilliant for free for a full 30 days head on over to brilliant.org advantage or click the link in the description plus if you decide to stick with it you'll get 20% of an annual subscription a big thank you to brilliant for sponsoring this video and now back to more AI news you can use okay next up we have something that has gathered quite a bit of attention across the internet this one is called tomb crafter and what it does is that that if you give it two images of a cartoon it will generate all the frames in between so here you have one frame with the candle not burning here you have the second one and look at this flame coming up this is not just decent this is very very usable and that's why this has got a lot of attention this is by far the best tool to do this that has come out yet and you can use it today so we tried it out on a few examples of our own to see if this actually works here's image one and here's image two here's what tomb crafter produced okay that's not terrible this is one of the trickiest things you can throw at it let's maybe look at something easier yet crucial to a story this image of a character image one image two where she's smiling and here's what Tom crafter created look at that the eye movement the smile this one is really good one more example image one and then image two where the hair is in a different position and here's the to crafter result it works as expected you even have the leaves flying through the air so could the resolution be better could we have more frames sure but this is the very first animation tool where I'm like wow you can actually tell full-fledged stories with this if you take your time of creating the first frame and a second frame and then you can combine it with tools like the ones we looked at you can create your music with AI you can create the sound effects with AI you can create the voices of the characters W AI this is one piece to the puzzle that has not been solved yet and it looks like Tob crafter is really on that edge of just barely being good enough to be used in the real world fantastic stuff so our next tool here is actually quite impressive what it does is creates songs one stem at a time meaning it separates out the different instruments for you and then you can extend them and alter them based on your text prompt let's just have a practical look at what the interface looks like here so this is Frederick Ai and basically you get this chat interface on the right side and then it will generates the stems for you it's a very very beginner friendly interface so if you ever played with Garage Band and had some fun in there chances are you would enjoy this too by the way Garage Band used to be the very first creative software I ever tried on a computer and it got me hooked on the idea of wow I can just create stuff and learn this software and become better at creating stuff I thought that was absolutely incredible and that really was the first Spark that done learning Photoshop Premiere Pro After Effects python the basics of unity and so much more often a little inspiration goes a long way and this might just do it because look if you want to do a rock track I can just add this to the project like so and I can go to various stems and just click and then I have these Loops that I can easily extend and look I never learned too much about garage band but I do know that if I shift these up the pitch should change right then I can add effects like so and then a new project just using this rock preset by the way you get 10 Snippets like this a month for free so you get to play around with it for a few minutes what I could do then is simply Loop like so and let's see what we got from this preset what is this and yeah the free version is limited to these preset Snippets so if you want to generate your own you do need to subscribe for currently $10 a month but I feel like this might be a great way to get started with music production or see how these AI tools will integrate into music production which might be very relevant if you're really interested in something like openi Sora because essential piece to creating with that will be the underlying music and the sound effects so the tools that we just looked at here because video is really only 50% of the story so if you better Rec creating Sora better get a feeling for the audio part also this might just be a good way to do that oops head to run to the hairdresser bit of a changed look here so let's get back to the next piece of news you can use which is this new llm leaderboard by scale Ai and this one is different from a lot of the ones that have come out recently the problem of ranking different llms has been something many people try to tackle there's bench marks like MML there's chatbot Arena where users rate the output and then a ELO rating system ranks them but there has been a discussion around how reliable these rankings are some of them can be manipulated by including some of the questions or the user preferences into the training data so Scalia is attempting a new approach they have new methods to rank these where they don't disclose what exactly is being asked for and it results in these various leaderboards for coding math instruction following and language understanding you can look into all the details of what they disclose in here but most important you can check out their leaderboards that are supposed to be independent again this just popped up but I got to say these rankings look very reasonable to me the one that I personally care about the most is instruction following and yeah for that gbt 40 is fantastic also remember when Lama fre came out a lot of people were excited about the fact how strictly it follows the prompts that you give it this is something that doesn't work as well with something like Gemini and look I got to say when it comes to instruction following my personal anecdotal experience is lined up with what this leaderboard says here so I'll bookmark this reviewed over time but as of now this is a great leaderboard to look at as opposed to some of the benchmarks as they're published by the model makers in my opinion those hold less weights than leaderboards like this okay our next piece of news you can use is actually a quick update to udio the change is very simple to explain up until now you hadit a 30 second limit when generating a song now they upd it to 2 minutes as we talked about many times before Udu is absolutely fantastic for generating songs now you can actually create proper songs rather than 30 seconds and then adding 30 seconds at a time and along with that they included wave downloads if you're not familiar this is a high quality audio format which makes this usable for more commercial purposes as opposed to MP3s that are so heavily compressed that if you try to do any editing on top of them they just don't leave you any room to tweak the results whereas wave files are way less compressed so you can do those edits after effect so both udio and sunu are becoming more mature and more usable for real world purposes by the week and one more thing and this is something super unique in the AI space they actually added a feature where you can extend your very own audio so up now you could prompt and generate songs from scratch with their tool but now you can upload your own song your own voice your own sound effects whatever it might be and extend it with their generative tools very interesting and unique you do it through this upload button here on top and then you can take this [Music] clip and add more duration to it which would sound like this [Music] and to round things out I have two more things here one of them is perplexity Pages which perplexity you might be familiar with it it's the AI powered search engine that includes detailed links and references to what it presents to you and they're coming out with a new product which is perplexity pages and this is essentially article writer that is then built into search so you know as of now a lot of SEO experts specialize in using AI to flood the internet with various articles and perplexity being the search engine just figured hey why don't we just generate those articles ourself and then we have more control rather than leaving it up to random people and this is just a very interesting development so what they're doing here is creating something in between WordPress and medium where you can create your own articles that are all AI powered and you can publish them to the open internet but then they live under perplexity just like medium articles live under medium but they're still freely accessible but as opposed to medium you just know there was more AI involved in the creation I personally think this is preferable to random websites that pretend like they're not AI written and people project false expertise on articles and you look up something and at the end of the day you're just looking at a Rewritten gp4 output I appreciate the honesty behind this idea and I can see bigger players adopt this we could expect something like this out of Google at a certain point too because right now a lot of Google Search is just AI generated articles so maybe the solution to that is providing their own set of AI generated content articles but then you have the bias of the corporation I don't know this is a tricky topic but nevertheless a very interesting development of the internet as we know it's that perplexity is going after here and eventually I'm sure we'll see something like this also with video Once those tools get good enough okay and to round out today's episode I want to point you towards something that is not available today but you can sign up for the wait list and you should absolutely do it this is one of the most interesting and inspiring ideas I've come across in AI space recently it's showrunner and basically they created the AI generated South Park episode last year with their entire show generation engine there's a lot of things here okay so you could talk about this for 10 minutes straight because there's so much interesting stuff here but basically this is a prompt to show engine and they're trying to do multiple things with again this is not out yet but the idea is you're going to write a prompt and generate a brand new show of it or you're going to extend one of your favorite shows with a new episode or they have an aspect to this where they have a version of San Francisco with a lot of AI agents just living their life and then from that they generate a show so you can always follow what the agents are up to how they're living their life they're organizing birthday parties visiting each other extremely interesting stuff building up on a lot of research that has been published over the course of the last year related to autonomous agents just living in these little cities and then narratives emerging from that so they're trying to turn this into a media format that you can not just watch but that you can manipulate yourself with prompts super interesting stuff I'm on the wait list as soon as this becomes available I'll be covering this more if you want more details I recommend this Fred on X with various videos that show off these different ideas more in depth like the simulation and also this idea behind creating your own TV shows very interesting stuff lot of content here and then I suppose one day something like this could be built or acquired by Netflix and you'll just have ai features extending your favorite show with separate episodes and stories that you personally care about and that matter to you and with that that has been everything I have for you today I'm looking forward to the Apple announcements during WWDC and don't forget to subscribe to the channel for an episode like this every single Friday all right see you around
Info
Channel: The AI Advantage
Views: 67,464
Rating: undefined out of 5
Keywords: theaiadvantage, aiadvantage, chatgpt, ai, chatbot, advantage, artificial intelligence, gpt-4, openai, ai advantage, igor
Id: IZsobfqiJ6c
Channel Id: undefined
Length: 16min 41sec (1001 seconds)
Published: Fri Jun 07 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.