BabyAGI: Discover the Power of Task-Driven Autonomous Agents!

Video Statistics and Information

Video

Captions Word Cloud

Reddit Comments

Captions

[00:00:00] Okay, in this video we're gonna be looking at a form of autonomous AI or an autonomous AI agent. And this is actually from a paper that was released online called Task-Driven Autonomous Agent using,GPT-4, Pinecone, LangChain for diverse applications. it was actually announced on Twitter. I saw this,last week, by Johei Nakajima. and he had a nice sort of, tweet post announcing it and if we go through it, we can see that, okay. The idea here is that we are using a large language model to generate ideas, do some sort of critique on those ideas and thenideally execute tools So this has elements of throwbacks to Toolformer, some of the other key papers around this, that have been here. one of these things shows, it's got some nice diagrams in here, showing like how this works and some of the key points into it. So the user basically provides an objective and a task and then once that is set into a task queue, [00:01:00] the large language model basically decides how to execute that, and then as it, develops stuff, it saves it to a memory. and that memory then can be accessed through different things. it then, goes back a, around these loops. it's got a prioritization agent to decide what. Task comes next and what you know is priority, et cetera. And it basically goes, through this, over time. So if we look in here, there, there's, they've got another diagram in here, just going through it as well, of basically you've got this task queue. we've got the execution agent. And each of these things,for these parts is actually just the same language. you're just using different prompts. You're using different ways to, manipulate the output. And then you've got the memory, which in this case they're using Pinecone, which is a vector store database, so that they can store things in there and that you can basically do lookups on them and stuff like that as well. So the. The idea here I think is really good. the paper talks a lot about using, GPT-4 Pine Cone [00:02:00] and the LangChain Framework,to basically do this. along with this, he also released, some, code. So the code, has got the nickname, baby AGI. I don't think this is approaching AGI in any way, but it's cute name. And you know what, this is supposedly a very paired down version of the original one. So we don't actually know how good the original one was. As far as I understand, we hasn't been released or we haven't, got, videos of trying it and stuff like that. looking at the baby g i, that's here. this is de this code has definitely been released. We can play around with it. In fact, I've set. A CoLab so that you can also have a play with it. And so we can just have a look at, how it works, some of the ideas behind it, and what you could do with an agent like this, in the future. So you need quite a number of API keys to get this going. you'll need an opening AI key. of course you can basically set it up to use either GPT-4, or in this case I'm using GPT 3.5 turbo. [00:03:00] It's nice enough to have a print statement that does say if you're using GPT-4, that, you know, this could get expensive. we also need Pinecone API key and you need to set up the Pinecone environment. So this will change based on your, where you're setting it up. I just left this in so that you could get an idea of what you should be putting in there. Cause I think it's not always totally clear. you need to put in a table name. you can't use underscores or anything in this table name, here. And then you need to set up an objective and initial task. So here I've basically said, okay, plan a romantic dinner for my wife this Friday night in Central Singapore. and the initial task is make a list of the tasks, and you'll see That while this particular, one is not set up with any tools or anything, it does a nice job at going through, the thinking processes that it would need to do. Now in here it does seem like they're planning on adding tools. to, to this, looking at the, at the code base there are, tools being added. it's also interesting that looks like they're planning to add, or they've added,the [00:04:00] llama, I'm guessing this is the four bit version of llama that runs locally, to do this thing. So that, that would be interesting to. See how well that does. if we go through, we can look and see, okay, after it's got a lot of setup code, for doing this and for Pinecone setup code,the main logic behind these things, is not, overly complex. So we can see that we've got the task creation agent. So this is basically got its own prompt here and we've just got nice sring basically substituting, doing kind of what LangChain does, with its prompts. and it's then able to, basically call that it's got a prioritization, agent again, same concept, but with a different prompt. So you are using a different prompt to do the same thing. you can see here you're a task prioritization, AI tasked with cleaning, the formatting and reprioritizing the following tasks. And the tasks in tasks. Consider the ultimate objective of your team passes in that as well. So this is very similar to some of the agents that we see in LangChain and that we've,looked at as well.[00:05:00] We've then got at the execution agent, again, same concept, different prompt. You are an AI who performs one task based on the following objective. Take into account these previously completed tasks, passing into context, your task, and then the response. so it's interesting here that they're setting the temperature quite high, for. whereas normally in Chen you would actually set the temperature pretty low, like close to zero or zero, for this kind of thing. so that may have also of affected its output, for doing this. alright, so then you basically just go,you've just got this huge leap. It goes through it. Does it, let's look at some of the output that we are getting from. So first off, make a list of tasks. so you can see that it does a pretty nice job of choose a romantic restaurant in central Singapore. Make a reservation for two at the chosen restaurant. select a bouquet of flowers to surprise your wife with. This is definitely not something I asked for, but okay. Maybe it's something that it, decided. you could imagine in the [00:06:00] future though, you would want the agent to actually come back to you with suggestions and then you would say yes or no to these suggestions. choose a romantic gift for your wife. purchased a selected gift from the store in central Singapore. it's really going all out on this dinner. And then, finally confirm the dinner, et cetera. Okay. So, research and choose a romantic activity to compliment the dinner experience. if anything, I would say that the agent is very verbose. and again, this would all be down to the manipulation of the prompt, that you would want for something like this. And you could imagine that this prompt might be really good for one task, but not great for another task. All right. It goes through, it comes up with su suggesting a private Sunset yacht cruise, along the Marina Bay. Now, this, it is, it's very good in that it's getting,locations, right? And it's getting things like that. Again, this is to be expected, because we're using, one of the large, open AI models, for doing this. it's [00:07:00] quite funny how it's, choose a romantic outfit, hire, rent a luxury car,a lot of things that it, it's making suggestions, but they may not be, Ideal sort of suggestions for, a romantic date in Singapore. one of the things I did find out was interesting was, and it's funny how it goes on to say, please note that you may need to provide a valid driver's license and stuff like that. I, you could imagine in the future that these things will have a variety of information on you. And then be able to use that, like if it's got a knowledge base on you of your driver's license, your credit card number, all those sorts of things. I certainly wouldn't give this one my credit card number cuz it seems to want to spend a lot of money. so you can see here it's picked out, a jewelry store. It actually picks out, three real jewelry stores and it get, seems to get their location correct. Uh, pretty impressive. it then also picks out a nice, restaurant. and it's interesting that, the restaurant that it picks is a luxury restaurant. I think it's a Michelin, I'm pretty sure it's a three star Michelin restaurant, in Singapore. and [00:08:00] so again, this is all coming from the open AI model. there, there's nothing, unique about this that's coming from Baby agi. It's just nice manipulation of the open AI APIs, in this, it goes on and on. It takes, a bit of time to run, through these. in the end I just of stopped it. cuz it certainly, can ping the API quite a bit and get a lot of things back. it is interesting to, you know, I tried another one. Planning a party. And that also, did the basic stuff quite well. what we are lacking here, is the ability for it to come back to you and to know what it should come back to you about. and this is gonna be one of the key things, I think for a lot of these things going forward. it claims that it's contacted the restaurant and it's made,a booking. It's very strange that, about their policy on bringing outside candles. Again, this is I would say, dying in the details, o of this kind of thing. Anyway, it's here. You can have a play with it yourself. I ended up stopping it, just cuz it seemed to be going on and on. [00:09:00] Alright. The idea, I think is the key thing here. the idea of developing these, agents that have the ability to run a variety of different tasks and are incorporated with a variety of different tools. That's what we're gonna see a lot of in the future. we're gonna see this with the, chatGPT plugins or the open AI plugins format that's coming along. We're already seeing this with some of the things in LangChain, so this sort of just. Is a nice way of wrapping up some of these ideas and giving you some idea of how they could be in the future. As always, if you've got, any questions, put them in the comments below. if you found this useful, please click, like can subscribe. I will see you in the next video.

Info

Channel: Sam Witteveen

Views: 34,716

Rating: undefined out of 5

Keywords: LangChain, open ai, chat gpt api python, chatgpt api tutorial, chatgpt api key, chatgpt api php, chatgpt api react, chatgpt api postman, chat gpt api integration, chat gpt api project, chatgpt api cost, chatbot api keys, chatgpt api, chat gpt app, chat gpt app development, chatgpt applications, chat gpt app link, chatgpt api javascript, chatgpt api python, chatgpt software development, chatgpt software engineering, open ai api, babyAGI, task agents, autonomous ai

Id: QBcDLSE2ERA

Channel Id: undefined

Length: 9min 42sec (582 seconds)

Published: Mon Apr 10 2023