Should You Use Open Source Large Language Models?

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
There are over 325,000 models on Huggingface and  thousands more are being added. And why might   you choose to use AI models like these? Well,  let's start by getting a few things straight.   The models we're talking about in this video,  they're specifically LLMs, and that's large   language models, which are foundation models  that use artificial intelligence, deep learning   and massive datasets to generate text. We're  talking generative AI. And there are two types of   generative AI models: There's proprietary models,  and there are open source models. Now, proprietary   LLMs, those are owned by a company who can control  its usage. A proprietary LLM may include a license   that restricts how the LLM can be used. On  the other hand, open source LLMs are free and   available for anyone to access, and developers and  researchers are free to use, improve or otherwise   modify the model. Now look, it's not true in every  instance, but generally many proprietary LLMs are   far larger in size than open source models. And  specifically in terms of parameter size. Some of   the leading proprietary LLMs extend to thousands  of billions of parameters. Probably? Actually,   we don't necessarily know because, well, those  LLMs and that parameter counts are proprietary.   But bigger isn't necessarily better. And the  open source model ecosystem is showing promise   in challenging the proprietary LLM business model.  So let's discuss the benefits of open source LLMs.   Let's talk about the types of organizations  that are using them. Let's talk about some of   the leading open source models available today,  and we should talk about the risks associated   with using them. Now, clearly, one of the  benefits of a open source large language model,   that has to be transparency. Open source LLMs  may offer better insight into how they work,   their architecture, and the training data used to  develop them. Another big one is pre-trained open   source LLMs allow a process called fine tuning.  That means you can add features to the LLM that   benefit your specific use case and the LLMs  can be trained on specific data sets. So I can   fine tune an LLM with my own data. And community  contributions are a big plus. Using a proprietary   LLM means you're reliant on a single provider,  whereas open source models benefit from community   contributions and multiple service providers. You  can experiment and use contributions from people   with varying perspectives. And these benefits  have led to all sorts of organizations to use   open source LLMs. In another video, I addressed  how NASA and IBM developed an open source LLM   trained on geospatial data. Some healthcare  organizations use open source LLMs for diagnostic   tools and treatment optimization. There's an open  source LLM called FinGPT [fin / financial]. It was   developed for the financial industry. Which brings  us onto the topic of talking about some specific   open source LLMs that you might find of interest.  Now Huggingface maintains an open LLM leaderboard,   and that tracks , ranks, and evaluates open  source LLMs on various benchmarks like which LLM   is scoring highest on the Truthful AI Benchmark  series, which measures whether a language model   is truthful in generating answers to questions. So  it gives those answers a score. And the top spots   on this leaderboard, they change frequently.  And it's quite fun to watch the progress these   models are making. Many of the models on the  leaderboard are variations on the Llama 2 open   source LLM. That's the one provided by Meta  AI. And Llama 2 encompasses pre-trained and   fine tuned generative text models from 70 billion  all the way down to 7 billion parameters. And it's   licensed for commercial use. Vicuna was created  on top of the Llama model and fine tuned to follow   instructions. And then it's also Bloom by BigScience, which is a multilingual language model   created by more than 1000 AI researchers. Now, one  area that both proprietary and open source LLMs   share is their associated risks. Although LLM  outputs often sounds fluent and authoritative,   they can be confidently wrong. Hallucinations,  they can result from the LLM being trained on   incomplete, contradictory, or inaccurate data  from misunderstanding context. Bias happens   when the source of data is not diverse or not  representative. And security problems can include   leaking PII, and cybercriminals using the LLMs for  malicious tasks like phishing. Especially in these   early days of large language models, we do need to  mitigate risk. But open source LLMs are thriving   in business. Here at IBM, the Watsonx.ai Studio  makes available access to multiple Llama 2 models,   and IBM has released a series of foundation  models of its own called Granite. And this   space is changing rapidly, making open source  LLMs a field well-worth keeping a close eye on.
Info
Channel: IBM Technology
Views: 282,895
Rating: undefined out of 5
Keywords: IBM, IBM Cloud
Id: y9k-U9AuDeM
Channel Id: undefined
Length: 6min 40sec (400 seconds)
Published: Mon Nov 27 2023
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.