[訪問完整] 梁見後問: AI會控制人類嗎? 黃仁勳"這樣回" 2個台灣囝仔在台上鬥嘴! 黃仁勳與梁見後用三聲道演講 他大喊:現場這麼多人你在跟我講價│【焦點要聞】20240606│三立iNEWS

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
you charging me $2 million more than $2 million for [Laughter] dragu I think I think when you say Green Computing you mean energy efficient Computing right yes Nvidia is energy efficient Computing yes we have follow you where AI Control us of course not don't forget that your another baby inia found CEO Jensen to share his great vision with us [Applause] [Music] [Applause] [Music] J thank you hi everybody now what AI is changing because of you what's new today I have to admit just now when I was coming to your keynote in the car I fell asleep and so right now right now I'm a little bit groggy so if I say nonsense things please I let me apologize first no well let's see um uh Charles we've gone back a very long ways yeah and and um uh what are we doing oh I needed some water I need up okay all right my energy yeah they said I was on this side and you keep going on my side this is what happens when we don't practice you don't need to and you are no time you you don't need and so so um I uh what what were we saying um this is a very important time because we have a new agent Computing coming there are two things that are happening at the same time the first is accelerated computing accelerated Computing has arrived at a time oh Green Computing yeah a green computer yeah okay computer we I think I think when you say Green Computing you mean energy efficient Computing right yes Nvidia is energy efficient Computing yes we have S we follow you all right [Applause] look Green Computing and Green Computing all right so so um uh accelerated computing's time has come because for a very long time the amount of data processing has been increasing exponentially yeah and yet CPU scaling has slowed for many many years so we've been we have now an enormous amount of waste wasted energy and wasted cost trapped inside the data centers so when we accelerate the data centers the savings incredible because it has been so long of waste trapped and so now we can release the waste and use that energy for a new purpose number one accelerate every application accelerate every data center these amazing servers here right so many new products so many new products you have 220 new products unbelievable did he tell you that already noic I had walk very high I came to announce super micr products and so that's the first thing the second thing is because the Energy Efficiency and the performance efficiency and the cost efficiency is so incredibly great with accelerated Computing a new way of doing Computing has emerged and it's called generative AI generative AI is an incredible thing people say generative AI inference it's related not the same inference recognizing cats dogs speech inference generation text Generation image Generation video generation that's what we call a generative AI the pressure of generative AI to not the pressure but the the transition to generative AI will affect every single data center in the world we have a trillion dat a trillion dollars worth of data centers in the world that's a established $3 trillion probably by 2030 in another 6 years we have to modernize all of them with these amazing systems yeah that's the reason why the demand is so great because all of these data centers has to be modernized and Charles and the super micro team is ready to take your order jenssen I'm your I'm your best sales guy thank you I work commission no commission we will buy more chip from you don't buy more [Laughter] chips so that's J Michel is now shipping data center uh liquid cooling DLC R in volum production now to lower the power consumption so you can manufacture more AI CH yeah yeah thousand of Hab here you [Applause] [Laughter] see I have many American colleagues they don't understand my Chinese I have many Chinese colleagues they don't understand my Chinese hi [Laughter] we are shipping up to 1,000 R per month now 1,000 right like it is multiply by ASP yeah you're going to be a gigantic company yeah thank you that's why I need more CH did you guys all do the math Millions time thousands time 52 no no no you charge me $2 million more than $2 million for [Laughter] D only are we allowed to do this on TV are we on TV I I guess either the well is this oh P say so we are shipping about 1,000 that's incredible now this this uh 600,000 Parts this is probably more than 600,000 Parts how many pounds oh I don't know can I move three I think it's 3,000 more than 3,000 yeah it's incredible so yeah our goal this year is to ship more than 10,000 record at Le you know the Charles this is the thing that's really amazing uh people think that we're building gpus you know GP is a chip there are 72 chips in here and then there are 600,000 other parts it's 72 chips probably weighs one pound this is 3, 2,999 other pounds so the amount of Technology that's inside one of these Rags is really quite extraordinary this is a technology Marvel the most most most complex most advanced computer the worlds ever made yeah exactly the p in the one now yeah absolutely incredible and the software that it takes to run this already is unbelievable yeah unbelievable isn't that right and so I think that that people now are starting to realize that when we say GPU server of course the brain is the GPU yeah but the system is much much more complex than that and super micro does amazing engineering thank you [Laughter] [Applause] okay then there's some Americans year we are going to ship hopefully make D when we're together sometimes we speak Taiwanese sometimes we speak Mandarin and then when we disagree we speak English we try to make a DLC Mar share from 1% to 15% this year wow Save lot of power for your TP yeah yeah the Energy Efficiency is so much better the cost to the data center is cheaper cheaper that's right people don't realize this liquid cooled systems eliminates an enormous amount of cost in the data center yeah so that you can use that waste capture that waste and put it into Computing in the future in the the future Computing throughput is revenues because it's token generation and token generation is dollars per million tokens just like energy dollars per kilowatt hour we have now invented a new commodity this is a very important idea for all of you this is a new commodity it has value and the faster you can generate it the higher throughput the greater utilization the higher your revenues it is absolutely true and it's directly measurable that's why this is a factory not a data center that's why this is a factory not a file server it's not a retrieval of files it's not used for exchanging emails this is directly generating revenues for factories that's why we call it AI factories and so powerful and only Sunan dolls [Laughter] okay so $3 million and you can generate who knows how much revenue per year right uh oh 3 million 1,000 and every year have how many months 12 the the return on the return on large language model generation token generation is going to be very very good yeah would be huge and the reason for that is because the token embeds intelligence yeah and the intelligence could be used in so many different Industries and so the future is very important it's time to start up yeah time to Startup throughput yeah utilization all matter so reliability has Revenue implication throughput has Revenue implication startup has Revenue implication yeah that's why it's so important that we integrate the whole whole system into a rack scale get all the software working connected to all the all the networking so that and we build all of our own data centers we build our own supercomputers so that we know when you install this when you install super micro in your factories the startup time will be extremely fast your utilization will be extremely high and your throughput will be extremely high because your revenues depends on it Factory output is measured by all of those factors very complicated yeah and all of those R are Invidia software license all certified so the sound of that parking the cable and they can run and it runs that's right and all of the Nvidia Nims all of the large language models it just runs on all these systems yeah [Laughter] [Applause] we are shipping thousand still very handsome yes very beautiful Charles Charles said that this is everything everything in here is NVIDIA for all the American citizens there from Super to H AI everything all Nvidia sare all all Green Computing all Green Computing all green computer all green computer all the support that's fantastic good let go through some detail okay okay H1 H2 B1 for you cooling wow shipping in wow and this one your p200 uhhuh fully ready beautiful for your chip beautiful beautiful this will be how many time faster than this so we have we have we have uh uh for Blackwell Blackwell has air cooled liquor cooled x86 Grace MV link 8 MV link 2 MV link 36 MV link 72 yeah so many different configurations yeah so that depending on the type of type of utilization type of use case you have the type of data center that you have uh Charles is ready to serve you immediately right immediately just need a ACH one hand we got a a chip second hand we shift the C wow thank goodness we only need two hands in two weeks in two weeks can be very uh that's incredible and all of it software compatible this is really this is really the amazing thing literally everything here is software compatible 100% yeah and software as we know is the most complex part of high performance computer yeah thank you for those great offering they are all ready to service our customer there are three very important software Stacks that we have in our company that everything is built on top of the first of course is Cuda is very famous the second for all of the networking because networking is just not networking networking today networking today is a Computing fabric networking today is a Computing fabric not just for sending email to each other or 400 800 meah a gahz megahertz this is not 1980s Mez Kilz gahz gigahertz yes 400 gigabits per second 800 gigabits per second and and then of course Next Generation coming 1600 but the important thing is all of the software that we have that runs on the networking for distributed computing is on top of two software Stacks one is called DOA for the nick nickel for the fabric yeah and it enables us to distribute the workload across the network yeah very very efficiently because ethernet was was not designed for hyper computer you make our job easier but still very busy because you have so many great my job is to help give you job we and because because you do such a good job it becomes gives me job oh don't forget that your another baby yeah yeah yeah yeah inside here this this is an incredible incredible system in fact in fact um these chips are all connected together using high-speed interconnect the world's fastest CIS the CIS is incredibly fast and very energy efficient and so we can connect this gray CPU to dual Blackwell gpus and that's very important because in the training stage the memory system of Grace could be used for checkpoint restart checkpoint and restarting is very important for high utilization and high uptime and so checkpoint restart uh could be stored in the system memory that system memory is very low energy very low power and the link between Blackwell and Grace is very very high second during inference time as you know there's a concept called prompts context in context training prompting that prompt memory that context memory is right here this is the memory the thinking memory the working memory of AI and so this memory needs to be very high performance very low energy and so during training we have good use for Grace gray CPU during inference we have excellent use for gray CPU and the interconnect is very very high speed very low power fly optimate and so the re the benefit is because we compress so many in one system yeah if we save 20 watts 50 Watts on the interconnect yeah you multiply by the whole rack then we can take the energy and use it for computing y so Energy Efficiency translates to higher performance that's right Green Computing huh I am a super micro employee I'm a super micro employee control us of course not um we we have to we have to uh the most important thing of course at the moment is we have to make AI work well right now ai is of course uh working extremely well and in many applications AI has become good enough to good enough to become useful it has achieved the plateau of good enough very useful however we want it to be incredibly good we want it to be very functional everything from Guard railing for uh fine-tuning skill learning there are many different things that we still have to improve okay so we know that AI is AI still has long ways to go that's job number one is to advance the technology at the same time we have to advance safety technology as you know uh our the planes that we all flew on to come here has autopilot and autopilot is automatic technology in order for planes to be safe a great deal of Technology had to be invented to keep the plane safe yeah also practices to monitor the planes air traffic control other planes monitor the planes Pilots monitoring each other many different ways to keep uh AI uh keep autopilot safe in the future we'll do the same thing with AI there will be AIS that watch AIS there are people that watch AIS there's gu right guard rails that keep AI guard rail and so there's going to be a whole lot of different Technologies we need to create for safety technology for safety and then third of course we need to have good policies for safety good practices and good policies for safety talking about it is very important so that we can all remind each other that we have to do good science good engineering good business practice good policy practice good industrial practice all of those things has to advance so perfect strategy so the conclusion is one more you buy more safe the more you buy the more you safe the more you buy the more you safe yeah thank you Jesus thank you good job thank you everybody thank you okay thank you thank you thank you thank you all right have a great show thank you
Info
Channel: 三立iNEWS
Views: 48,591
Rating: undefined out of 5
Keywords: 三立iNEWS, 直播, 新聞直播, 政治, 大世界, 侯友宜, 柯文哲, 郭台銘, 賴清德, 信賴台灣, 財經, 生活, 蔡英文, 三立, 輝達, 黃仁勳, 設廠, 台灣, AI, AI之父, 小粉紅, 中國, 中共, 梁見後, 美超微, 超微, 電腦, 運算, 高效率, 綠色運算
Id: oUNMK-uEIyc
Channel Id: undefined
Length: 22min 9sec (1329 seconds)
Published: Thu Jun 06 2024
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.