Claude 3 Vs Gemini Vs GPT-4: Who Can Make Amazing Powerpoints?

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
Okay. So yesterday I came across this tweet, which I thought was interesting, Around the idea that LLMs are really going to be a threat to white collar professionals when they can generate slick PowerPoints, better than humans. And while I found this sort of, semi amusing, I also thought it's kind of true, right? That a lot of people, their jobs seem to just be making PowerPoints and perhaps doing email on top of that. So what I thought I would do is try out, just to see, okay, with what's the current state of building a PowerPoint deck with an LLM. Now I've deliberately not gone for doing an agent kind of framework. So maybe in the future, I will do this sort of video again with agent frameworks And look at how well that could be improved upon. But what I do is I just take the sort of top LLMs that we've got out there and I've chosen three of these. so I've chosen ChatGPT Plus, which is GPT 4. I've chosen Claude 3 Opus. And I'm going to use the Google Gemini 1.5 pro model in here to try these out. So where possible I'm going to use the same prompts for each of these models. And I'm going to try and, see if I can try and use the same prompts all the way through. All right. So we're kicking off with a ChatGPT Plus or GPT 4. and I basically just asked it a very sort of generic, simple question. What is the best way to make a PowerPoint slash side deck programmatically? And you can see that what it's coming back with, is it suggesting a number of different packages or frameworks that you could do this. I'm not sure about markdown or LaTex there, but okay, we've got some things going on in the ideas in here. If we give the same thing to Claude. and you'll notice I'm trying to use, like I said, I'm trying to use the exact same prompt each time. I did the same thing to Claude. It's also giving me a list of different kinds of frameworks. But one of the things that's nice about it is that it's actually giving me some sort of code examples of how I can sort of see. this as I go through here. Okay. And then finally, when I give it to Gemini 1.5 Pro, you can see that we're getting back a similar kind of things, talking about different frameworks is stuff. Although some of the frameworks are different in here. and it's almost like they're all sort of trying to clarify something like, what language are you using? that kind of thing. All right. So I'm going to follow up with the same prompt for all of them to clarify this. And what I'm going to do is I'm going to ask it to make a set of slides about the five good emperors of Rome. So if you, come and look at, just doing a Google search. Google generative, AI search here for the five good emperors of Rome. It clearly knows what emperors they are and it's giving me, this is the generators stuff with an LLM before it actually then goes on to give me search results and stuff like that. So we know that an LLM should know what these five emperors are et cetera. so I've clarified that. ahead of time. What I've also done is I've gotten a couple of books about these and converted them into text files so that we can actually try uploading the books and getting the LLM M to use this as a resource, maybe for a second round, going through, et cetera. Okay. So here's the next prompt that I'm going to go with. So I'm going to basically clarify that I'm using Python. okay, let's say I'm using Python. Give me a code example that will generate 10 slides about the five good Roman emperors with, I probably want to make that with two slides for each of them. I want it to be creative and very stylish as if it was designed by an advertising agency, creates ads for Paul Smith. for the info on the slides have the emperor's top five achievements For the content. Let me just put a comment there. For the content approach it as if you are a top four consulting company, making a pitch. Actually we'll say making a pitch deck. All right. So we kick this off in here. What I'm really curious to see is like, where do they strongly go for the same kind of thing? And where do they strongly differ in their outputs? So I've got. GPT 4 going there. Let me just kick Claude off. So I'm really curious to see sort of where are they similar? Where are they different? Not just in the actual sort of specific content. But in the way that they respond, to this. It's interesting that this one got the subtitle, a pitch by top four consulting. So maybe I should have used the actual, consulting names there. Let's give this one to Gemini and kick that one off. And while that one's running, let's come back and see what, GPT 4 has done. Okay. we can see here that, they've all settled on this particular package. which I haven't used before. So eventually I'll try and build these and see what these actually look like. Add emperor slide. So it's got a function there. Okay. It does look like it's got the five emperors, correct here. All right. we've got Nerva, Trajan, Hadrian, Antoninus Pius and Marcus Aurelius. Okay. so it's putting those together. I don't really see anything about styling in there. So this code provides basic structure to truly capture the Paul Smith style you should enhance this with this. Okay. So we'll see if the other ones do the same kind of thing. Okay. So it's basically kicked off. It's got Paul Smith colors. Okay. So it's very interesting. just Googling quickly what the Paul Smith colors are. You can get a sense of, this kind of stripey color there. so it's definitely defined some interesting colors. I wonder how it's actually going to use them in here. And we can see that it's also defined nicely, the top achievements of each of these, which is, I think I said the top five achievements from memory. Yep. I said the top five achievements. So this seems to have paid attention to that and responded quite nicely in there. And it's basically telling us that it's going to make this out. So this is almost like that it's saying this is already done and ready to go. I think this one, we will sort of try building it. Gemini has been going. so Gemini we've got, oh, we got an interesting thing here where it's been done it for two of the emperors, but then it's just sort of sit our ad details for, the others in here. And then it hasn't really got anything about the Paul Smith colors or anything. Although it's got a font name, Paul Smith's handwriting. Again, kind of interesting because that there is like a, some kind of font like that. My guess is that it's probably not easy to get that font. Okay. I'm not seeing a direct link to that font when I actually come and look that up. what else would they have got in here? it's got for this, okay. The first key thing I think is I need to tell it that you need to give me the details for all five emperors. Okay, so I'm going to give this same prompt to the other ones, although I think the Claude one may have already done that, but let's just say. Let's just pass that in for each of these. this one actually It looks like this one did do something along the lines of what we actually wanted in here. And this time, we've certainly gotten some nicer formatting coming out of it for that one. For Claude, okay, suddenly here is the complete code with the details for each emperor and the content for each slide. Okay. So it seems I messed up, copying it across, where for some of them, I had a slide for each of them. for the ChatGPT one I had two slides for each of them. Although it doesn't seem like it's made the extra slides there. it seems to be sticking to one slide for each of them. let's continue generating. All right. So I think now what I'll do is I'll copy the code over, install the packages and we'll start to see, actually how well do these actually work? So I've managed to get the code into a VS code for each of these. You can see I've basically got one for the chat GPT slides, one for, the Claude slides and one for the Gemini's slides. I will say that the Gemini one, did have errors in it and it required me going back to it multiple times, asking it to fix its own errors. Cause I deliberately didn't want to fix any of themselves. It was just basically for getting to import different things in here. And it was only when I basically said to it, come on, like you need to get your act together kind of thing that it was like, I sincerely apologize for repeated errors. You're absolutely right. These mistakes aren't acceptable. And then it actually did fix all of the code in there. So I've gone through and basically just run each of these, and given them, just change the name of the front to be. in it so that we know where they came from for going out here. And we're going to do next is basically upload them. I don't have PowerPoint on this machine. so I'm going to upload them into Google slides and we can basically have a look at what we get up there, knowing that's probably not the perfect way to evaluate these. but at least we can sort of see and get a sense of, what they were doing, what they were thinking. in here. Okay. I guess another way I can basically check them is just as sort of use the preview tool In the Mac to go through them. So this is the ChatGPT ones, they come out. pretty much normal sort of boring slides. the content, I haven't gone through the content thoroughly, but it seems reasonably sort of right. Yeah, focuses on Marcus Aurelius meditations. it's got some different sort of things in there. If we have a look at the Claude one. The Claude one has the colors, or certainly is getting at some of the color ideas from the Paul Smith thing in there, very sort of boring layout of the slides, right? It hasn't really sort of nicely proportioned them in any way. but we do have like, when they actually ruled, you know, the ruling dates for each of these. Although we've got, things overlapping each other, that, which is also not ideal. And then we can see with the Gemini one. We've basically got some sort of repetitions of things. And I'm guessing we're missing something. This could be because we're looking at something that's black on black. Okay. So If we bring these over and bring them in, we can actually see what they will look like in a Google slides. So we can see the ChatGPT one, has gotten stuff, although it looks like it's bleeding across the edge on Google slides, although that is probably a Google slides issue rather than a PowerPoint issue in here. again, we've got, I guess looking at these, we've got the content in these, but we haven't exactly got nice layout in here. okay. When we look at the Gemini one. And it's interesting that the errors that the Gemini one was getting was it was trying to put in images. and it just was getting errors because those images didn't exist. So you could imagine if we had prompted it that, okay, here are the five images we've got, use those, in here or something like that. We could've gotten some better results. this is not good though, where it's basically giving us Trajan's accomplishments. And one of the five accomplishments is he was considered one of the five good emperors. That's not, really holding the context of what it is that we're sort of going for, in here. Again, with Marcus Aurelius the idea of, that one of his achievements was the thing that we're actually making the the deck about really perhaps signifies to me, it didn't really understand what I was going for in here. It does seem though that the main issue is the issue around design. what I want to do is go and have a look at a product, which actually I use quite a bit, Which is all about getting LLMs to create nice designs by using certain components. And while it's not actually doing sort of powerPoint slides. it's doing websites. so let's jump in and have a look at that. Okay. So another service that is not exactly PowerPoint, but is kind of, I think, in the same realm. is that a website's right. And this is a service called v0.dev created by Vercel. And you can see, I basically put it in a prompt in there. and the prompt was create a website about the five good emperors, have sections with pictures and profiles for each emperors showing their name. now I didn't tell her anything about the five imprints. It's basically using the large language model. That's running this. Guessing it's open AI for this. but it's either got a lot of good things in context in here, or it's got a fine tune model. So what basically v0.dev is creates a websites. and you can come through and look at them so we can see here that this one we've got five good emperors. Trajan, Hadrian, Antoninus Pius okay, so it's got one of them wrong. So it's missed out, Nerva the first one and it's added Marcus Aurelius' son who was probably not considered one of the good emperors, He was a bit of a crazy guy. if we look at, some of the other ones, though, here, we can see, again, it's getting some of the content wrong. But definitely stylistically it's much more kind of interesting here. Now having put in the stuff around, the advertising agency and stuff like that, we could certainly come through and try that out. and we can see this one hasn't done a very good job at all on that one. but the whole idea with V0 is that once you've got something, you can then reprompt it. so now I can basically ask it, use the colors of a Paul Smith design. let's see, what we'll basically do is take what we had there and then sort of remake it. so obviously we looked before at the Paul Smith colors are very flamboyant, very colorful. And it's kind of gotten the colorful thing. I'm only seeing two colors in here though, so that's not exactly ideal. but one of the things that I'm kind of thinking is that, you could imagine a service or an agent that allows you then to look at the slide output, and then, sort off reprompt it, and basically improve on it and say, okay, I like how you did this slide, but change this other slide. I need more information about this. I need more information about that in there. And I think that, it's probably not that far away. my guess is that there are probably startups out there already working on this. if there aren't already, products that exist really for doing this. it's kind of an interesting problem to do in that, I think it's definitely a tractable problem that you could do with LLMs. The issue becomes, how do you deal with design features? and putting in colors, et cetera. So I'm just taking the colors from the Claude one. And I'm just gonna say, Remake it with these colors so you can see that even as it's remaking it here. it wants to stick to a limited number of colors on the page. Now that could be something in its prompt about, telling it not to be sort of just too many colors or something that stick to sort of two or three colors. and hone in on those, which it seems to have done with both the yellow version and this other version as well. Anyway, I thought this would be a fun little exercise, just to get a sense of how, different or how similar the ChatGPT models, the Claude models, the Google models are when we actually start putting these together. as you can see from the slides that we got out still got a while to go before we could just do this with the off the shelf, LLM. We're probably going to need an agent we're probably gonna need, some kind of functional prompting and, some system that understands about design a bit more and understands about how to take it. So I think it's interesting that the LLMs have been able to get the facts, that we're sort of after. and you could imagine if you preloaded them with a context for your company, for your project, that kind of thing. it will be able to get that. and especially if you imagine it, that in that context, you've got things like a recording of a meeting that you had about the project. a set of sort of agreements notes that you came about for that project, that kind of thing. it should be able to do those sorts of things. I think the challenge is going to be making the design look really sleek. And also getting the style in a way that, people are going to be happy with it. So I think There are some slide creation tools out there that once you've got your facts in there, you can sort of just click the button and keep changing styles until you get something that you like. And you could imagine that would work really nicely here. Anyway, this is just a fun, little play full thing that I wanted to try out on the weekend. let me know what you think. let me know your ideas on how to make this better. if people are really interested, maybe I look at sort of writing an agent to do this or writing some kind of thing to do this. And have a go at making it better as we go through. Anyway, as always, if you found the video interesting, please click like, and subscribe, and I will talk to you in the next video. Bye for now.
Info
Channel: Sam Witteveen
Views: 29,877
Rating: undefined out of 5
Keywords: claude 3, claude 3 opus, claude 3 api, anthropic claude 3, chatgpt, gpt-4, machine learning, claude vs chatgpt, gpt4 turbo, chatgpt vs claude vs gemini 1.5 pro, open ai, generative ai, gpt4, claude 2, claude ai, claude pro, chatgpt plus, chat gpt, ai writing, gemini pro 1.5, gemini 1.5, 1 million context window, gemini 1.5 pro demo, GPT 4, GPT 4 turbo, google launch gemini 1.5 pro, 10 million tokens, first hand demo, actual demo gemini 1.5 pro, google gemini era 5:51
Id: g3vroajaWwg
Channel Id: undefined
Length: 16min 48sec (1008 seconds)
Published: Sun Mar 10 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.