Massive Week for AI News You Can Actually Use

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
this has been a very interesting week in Ai and I'm back at the home studio to present you with another episode of AI news you can use because we have new features and updates from chat GPT a bunch of new open-source news and apps that you will actually want to show your friends after you watch this episode welcome to AI news you can use so hey let's not waste any more time and dive right into this first things first let's talk about chat GPT updates because for most people watching this including me it is still our primary AI tool that we return to regularly and the new feature here is quite simple but very impactful actually so you might be familiar with the image generation capabilities of chat GPT with the do free model if you have the plus plan in a chat with gp4 you can generate an image of pretty much anything and it will do it but now we have an additional feature we can do so-called inpainting and what this means is that we can actually click this picture of the happy alpaka and you will see a new button up here saying select and this is so powerful because what we get to do now is we get to increase or decrease our brush size and let's just say we want to change the eye colors so I'm just going to inpaint these two eyes and I'm going to say make them blue and it's going to regenerate the image but not the entire image as you might already guess it's going to change the eyes of the albaka to Blue like so and you could do this with any image multiple times so you can go up here and say add a sun whatever it might be this is something that some other tools like merour or Leonardo already have and it is one of the most important features because look you might just want a sun in your picture and there was no good way of doing that up until now without leaving the program not everybody knows Photoshop or wants to spend time with it one thing that I was immediately excited for here is the ability to edit text but after testing it a little bit I found out that it's not really good at that here I tried to remove this text replace the main text with something else doesn't really work at all so you'll still need some external image editing Tools in order to do that okay and second quick update from cat GPT also is that it's now accessible without logging in now I'm in Europe so this has not rolled out to me yet but team members from the US Reports that you can just go to chat. open.com and start using GPT 3.5 for free this is on par with some other models and we have so many open source models now that are officially better than GPT 3.5 that are freely available across the internet but they just kind of had to do this at a certain point nevertheless if you want somebody new to try chat GPT you can go to the site and from here and now you shouldn't have to log in anymore okay moving on to the next one which is stability AI stable audio 2 and they made this completely for free which was not the case before with their previous model and if you're not aware they just recently hit a leadership shift and the direction of the comp is changing they're trying to generate more Revenue so a lot of their stuff is behind their membership but stable audio you can just use straight up what does it do it generates music without lyrics how does it do it it's really good very solid if you need background music this is a fantastic tool I'm sure there's many more use cases that's just the one that comes to mind for me so what are the key points here up to three minutes in audio length you get this interface where you get to do it all you can try it for free just a quick Google login later and I have 20 tracks that I could be generating here few important points here first of all this is commercially usable because it builds up on stable Audio One and Only includes a data set that they actually licens so you can fully use these tracks second of all and most interestingly it's not just text to audio it's also audio to audio so I could record something here and then it reproduces that into a track and that's exactly what I'll do in this demo here okay exclusively for this video I'll dust off my rusty beatboxing skills from 10 years back or something never took it seriously but it might just come in handy here so let's see I'll record and let's try and create some back B tier it's something okay upload this and you know if I can do this you can do this too that's the magic of this it's just going to transform it into proper music so let's have a look do this let's pick something from The Prompt Library maybe a drum solo I like that using the 2.0 model perfect do 11 second long thing and I would have a drum solo for the intro of my YouTube video maybe let's see give me result oo let's have a [Music] listen wow that's a nice little Jazzy drum solo right there there you go it's free it's fun you can just do random noises into the mic and it's going to turn it into song yeah thank you so much stable audio moving on all right so as mentioned before this weekly series is all about AI news that you can use which essentially means that I'll be telling you about technologies that will enhance your skills to achieve your goals but here's the thing you need a good Baseline of skills to be enhanced because a lot of these tools don't do the things from scratch you need some sort of inputs take me for example if I wouldn't have had any coding skills whatsoever I wouldn't be able to create this video I guess my point is that these tools are often just extensions of yourself so that presents the question how do you acquire some of these base skills and one of the best ways I found is today's sponsor brilliant the great thing about brilliant is that they teach with interactive examples and exercises rather than just a boring textbook this Hands-On approach really helps you understand Concepts rather than just memorizing them just to forget them later on they have a pleora of interactive lessons in math data science programming and even AI my personal favorite course brilliant is how llms work large language models AKA chat GPT or Claude you learn the basics but also how to fine-tune them on your specific data to try out this course and everything else the brilliant has to offer for free for a full 30 days visit brilliant.org advantage or click the link in the description if you like the trial you can also get 20% of the annual premium subscription thanks again to brilliant for sponsoring this video and now let's move on to the next use case now let's talk about the open source space I get a lot of comments that hey ego you're not covering open source enough but there's a good reason for that I try to show you stuff that you can use and that you should use and open source is for people who build apps or it's for people who really care about privacy but most people just want the best results possible and that's where you get to the closed Source models like gp4 Cloud free or Gemini but nevertheless I try to cover all the important open source releases is because it is relevant to a lot of people and we got a brand new model this week dbrx from data breaks and this one is the new best-in-class opsource model but wait a minute this one is not fully open source because what they released this under is this data bricks open model license which is almost open source but not quite so similar to llama they have this clause in here that hey if you have over 700 million monthly active users you must request a license from data BRS if you want to look at the details I'll link it below but I'm aware that most of you care about how well does this perform and for that we shall have a look at benchmarks over here but over time only usage will show the true picture but as you can see here on the popular MML U Benchmark it is actually better than both llama mixol and grock one while being way way smaller on programming it's absolutely amazing this is their main claim to fame as it is on math and the big advantage on top of that and probably also the reason why this is the most popular space on hugging phase over the course of the last week by a long shot is that it's two times faster at inference and it costs nearly four times less compute to train so it's not just better than all of these it's also way more efficient to train and it responds twice as fast as some of these other models that's what inference means there is a hugging face space up where you can try this out today I'll just go with the obligatory rightman essay about penguins and look at this inference speed it's right there it's super fast there's a limit on the token outputs but you can test it in here and yeah apparently it's really good for coding tasks but over time I learned to give this a little bit of time get this in users hands and get the community's feedback on how it's actually performing and to round out this little coverage of the open source space there is a brand new mistal 2.8 dolphin model and this one is a fully uncensored model we talked about this in previous episodes I showed you how to use this in a replicate space and in effect you can ask it every single question because if you're not aware these dolphin models are completely uncensored meaning literally every single question that you could think of this thing will answer and this is the newest version of it when we covered it I believe that was dolphin 2.2 but this one has been even further refined and it's supposed to perform better I'll be playing with this but unfortunately I haven't found a super easy way to run this yourself without downloading it and running it locally I'll keep an eye on this there a lot of people really enjoy using these uncensored models if I find a simple way to use this I'll report back next week all right so moving on to the next piece of AI news that you can use this is less of a use case and more of a piece of knowledge that you should absolutely be aware of and namely Ethan mullik here is tweeting about this brand new paper that came out that studies how well AI detectors perform and the findings are sobering to say the least basically tested all these different detectors that claim to be the one that actually detect AI but this table right here is what I want you to pay attention to because it looks at gp4 and the accuracy in predicting if the textt is actually AI generated or not and the one thing I want you to notice is how all over the place this is okay all the way from Bart completely falling apart when you use some of these techniques to GPT becoming more AI like when you use certain techniques and less AI like if you use others in other words it depends what model you're using and there's all these techniques that can fool these detectors pretty reliably on top of that the paper also talks about how people that come from a foreign background where English might not be the first language often that gets identified as a AI written so just imagine you're doing a semester broad at University and you submit your English paper and then the teacher runs it for AI detector and they're like hey you used AI you're just like English what are you reading about like what the heck kind English but it won't matter cuz they feel like their tool is going to deliver an accurate result which is obviously not the case so this just confirms that we already talked about on the channel when it happened and that is that open AI withdrew their AI detection software because it just did not work reliably no matter what they did it was too easy to fool so just using AI detectors is not going to be a solution for this whole problem of how do we identify AI written text this is just the part of society now and you have to find other ways of dealing with it I thought this was really significant because really whatever you using AI for this is kind of a factor in all of it right and now you know that these detectors they're not something anybody should be relying on because adding a few spelling errors is just something everybody could do right okay moving on here's a fun little app and I found a similar one a few weeks ago that I showed you for Mac it essentially takes all the images on your computer and names them with GPT Vision now the Mac one was even more pricey than this and you do have to use your own API key to name each and every one of those it's not like a service where you pay $5 and it renames your entire iCloud photo library but if you pay you could totally do that and this is the windows version of it that is really easy to use now at the time of the recording this cost $25 which I think is actually quite reasonable and then as mentioned you have to use the API to name all the different image files so if you have messy hard drives and a bunch of unnamed pictures and you don't know how to organize them this might just be a great way look you just select them say Ren Ai and all of of a sudden all of them will be named with what is actually inside of the picture for a messy desktop like mine which I will not show you cuz I'm ashamed of it cut it out nope if you take a lot of pictures or screenshots Rene I might just be useful to you and nope this is not sponsored I just found this really interesting and I figured it could be really interesting to many of you okay so next up we have a very very interesting one we talked about AI avatars on the show a lot and a lot of different ones come out but I think everybody that I talked to recently agrees that haen is kind of the leader of the pack there and yet again they're pulling a head by releasing a new feature soon where the virtual Avatar is actually in motion so the person is walking and presenting the words that you give to him okay so what you get to do today you get to go to this link demo. hen.com Avatar in otion Link in the description below and you get to give it a quick phrase here and fill out your email and then after a little bit of waiting it's going to email you the video so what I did is I let Avatar introduce this weekly show and gave him a German name cuz that's a little tricky and I want to see how they handle that let's have a look at how it did Welcome to AI news you can use I am your host for today rudiger Schulz this has been a demo of hen's Avatar in motion what do you think I think this is extremely impressive while you're in a rush scrolling for your Instagram feed you would not pick up on the fact that this is AI generated and we're just getting started this is literally the first version of this that we SE in production there's no other company that does this this well that I'm aware of and as per usual the comment section is there if you know of one please leave a comment below but basically you can get your own little custom demo like so for free and then you could download it and send it it to your friends you can have them say pretty much everything that doesn't go against their privacy policies and with that let's look at our future use case and for this week's future use case it's something we like to do at the end of the show to show you what's coming up I'll be showing you one of the most exotic ways of how you could be evaluating an AI right here in this GitHub repo called llm Coliseum so as we talk about all these open source models and these new releases it's getting increasingly hard to Benchmark them because 6 months ago as a user you just used to ask a new large language model hey create me a game of snake and then you could see if it actually completed it if it got stuck halfway these were very simple but quite effective little benchmarks for real world use cases but over time these model makers adjusted and they started adding these examples into their training data matter of fact from what it seems like is all they're trying to do is just get a model that performs high on all these benchmarks and handles all the test cases that people across YouTube and Twitter throw at it and then it looks really good in theory but in practice look at it and they're like okay so who cares that this is better benchmarks than gp4 it's not nearly as good so inside our AI Advantage Community we had a very interesting discussion on how these things will be benchmarked moving forward and Daniel from the team brought my attention to this llm Coliseum GitHub repo which essentially a brand new way of kind of benchmarking and measuring how good these models are and hear me out the way it does it is it lets the large language model play Street Fighter turn by turn Okay so it uses Vision to analyze the frame and then it decides on the next move and then it does that same to the second large language model and who wins that Street Fighter is the better llm I mean admittedly this is a bit ridiculous but I just found this idea interesting enough to share with you who knows how these models will be valuated in the future if we're going to have personal AI assistants at the end of the day it matters that they actually help our lives and not some arbitrary Benchmark so as you can see it looks at the game and then it's asked what is your next move and then the llm decides what the next move is and it's executed and so on so there you go that was our little preview corner where we looked at what is coming up I think both corporations and the community are going to keep coming up with these exotic benchmarks and then you get to choose which ones you care about but I just wish we had a better measurement and these standardized benchmarks that everybody seems to be fooling these days and letting the AI play a game of Street Fighter time will show okay now that you checked out all the news that you can use you might just be interested in this chat GPD for beginners playlist I have on the channel I created a lot of chatchi tutorials over the years but here I collected all of them and organized them chronologically so if you want to get more out of large language models this is a great and completely free way of doing that okay I'll see you next Friday hope you have a great week
Info
Channel: The AI Advantage
Views: 42,058
Rating: undefined out of 5
Keywords: theaiadvantage, aiadvantage, chatgpt, ai, chatbot, advantage, artificial intelligence, gpt-4, openai, ai advantage, igor
Id: cif0hm5bDAc
Channel Id: undefined
Length: 14min 18sec (858 seconds)
Published: Fri Apr 05 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.