Install our extension to search inside any video instantly.

【全程字幕】黃仁勳主題演講重點一次看! 餐廳入列生態系引笑聲! Jensen Huang AI【國際360】20260602@全球大視野Global_Vision
Added: 2026-06-03

818 views271:09:55頭條開講HeadlinesTalkOriginal Release: 2026-06-02

Agentic AI represents a fundamental shift in computing where autonomous systems (agents) orchestrate tasks by combining large language models with specialized tools and memory management, enabling more productive and efficient workflows across industries. This new computing paradigm requires specialized hardware architectures like Vera Rubin and Vera CPU, which are designed specifically for agent workloads with nanosecond-level response times, unlike traditional CPUs built for human users. The agent computing pattern (model + harness + tools + runtime) is being replicated across multiple platforms including PCs, robots, and data centers, fundamentally transforming how computing systems are designed and deployed.

[00:00:00]Welcome to GTC Taiwan.

[00:00:06]So great to see all of you.

[00:00:10]Very good to be home. I brought my parents home. Where are my parents?

[00:00:14]Everybody give round of applause to my mom and dad.

[00:00:23]And a round of applause for our pregame show superstars. Ladies and gentlemen, look how adorable they are.

[00:00:40]The superstars of Taiwan. Uh there are so many of you here today. We are broadcasting this right now to 70 other watch parties across Taiwan. 70 different conferences are going at the same time. Everybody is watching this keynote. We have so much to tell you and I have so many partners to thank. It is incredible how large our ecosystem in Taiwan has become. Most of the time when people think about ecosystem, they think about our software stack. They think about the developer ecosystem above the computing systems that Nvidia builds.

[00:01:19]But Nvidia's ecosystem spans all the way upstream to all of our supply chain here in Taiwan where it all begins and downstream all the way to data centers and eventually to end users. Today we're going to talk about almost all of the ecosystem. There's so many people to thank. I love my ecosystem here. I mean, there are so many companies here and some of my favorite ecosystem partners.

[00:02:22]So many Taiwan's rich ecosystem, the richest ecosystem, the world's best supply chain ecosystem. Unbelievable.

[00:02:34]Well, thank you all for being here and uh this year this year our businesses together are growing in incredibly. Two years ago when I was here, I started to talk to you about how AI has moved from generative AI and the other waves of AIs that are coming. The next wave of AI was agentic AI. And today we can say that agentic AI has arrived, that useful AI has arrived. Now what does this mean?

[00:03:02]This is GitHub. This is of course one of the first applications of Agentic AI is software coding. One of the most valuable professions. Incredibly large ecosystem. 30 million 40 million professional software developers.

[00:03:19]Probably another couple of hundred who are students and enthusiasts and so on so forth. But say 30 40 million software developers in the world code for a living.

[00:03:32]And this represents most of them. This is GitHub. The pull request is when they download software, they modify it. And commit is when they push it back up.

[00:03:43]Okay? And so if you could look at this in 2023, the number of commits was 300 million.

[00:03:54]2024, 400 million. 2025 500 million commits in the first few months. In the first few months of 2026, it has nearly tripled. Now, what does that mean?

[00:04:14]30 million software developers representing about $3 trillion worth of GDP producing three, that's what they're paid. $3 trillion worth of salaries per year which is generating economic growth for the rest of the industries. Say a hundred trillion dollars of the world's industries is impacted is generated by $3 billion worth of salary. That $3 trillion, excuse me, three trillion that $3 trillion worth of salary is now producing nearly three times as much output.

[00:04:53]It's effectively a $9 trillion productivity from $3 trillion of salaries. Does that make any sense? The difference is absolutely extraordinary.

[00:05:07]This is the potential. This is the promise of AI. The number of engineers, software engineers is actually increasing. People talk about AI reducing jobs. Complete nonsense. It's causing more software engineers to be hired. And the reason for that is very simple.

[00:05:23]If you can hire a software engineer and you could generate $9 trillion worth of productive work, why wouldn't you want to hire more software engineers?

[00:05:34]If that line was flat, then obviously people will hire fewer software engineers. But because the output is so incredible, people want to hire more software engineers. This is going to show up in our economy somehow soon. And so the first thing is useful AI has arrived. Now, what does that mean from the industry's perspective? From the industry's perspective, that means that tokens are now in extraordinary demand because if you could do this, you're going to want to produce more of it. And because tokens are now profitable units, tokens are now profitable units of revenues because it is now profitable.

[00:06:15]The AI companies want to build a lot more tokens, generate a lot more tokens, build more AI factories, which is the reason why compute demand here in Taiwan has skyrocketed.

[00:06:27]It is precisely the reason why all of you are so busy and your businesses are doing so well. In fact, that looks like some of your stock price.

[00:06:45]The compute pattern has changed.

[00:06:48]Everything has changed. So the first idea is that useful AI has arrived. AI is now a profit generator. AI is now a GDP generator. Behind it is a whole new kind of computing pattern. Not just a large language model, but an agent.

[00:07:06]Today almost everything we're going to talk about is going to be based on this.

[00:07:11]So let me take a quick moment and show you what I'm talking about. Inside in this is a this is an agent. It's an agent application.

[00:07:20]In the old days this would be application.

[00:07:24]This would be code and this would be operating system.

[00:07:31]Application code running inside an application. inside an operating system.

[00:07:36]Today it is agent which consists of a large language model or many sitting inside a harness and that harness helps it orchestrates it to do productive work. This is the input. When that input comes it has to understand, observe, reason, act, use tools. Use tools. That tool could be a spreadsheet, web browser, a data processing engine, database engine for example.

[00:08:09]This is orchestrated. This harness orchestrate this routing of information every single time it touches either processing the context, understanding what is happening, reasoning about what to do, coming up with a plan that you can act that it acts on. That orchestration path is orchestrated by some software. And so this is fundamentally a agent. It deals with short-term memory called working memory, long-term memory just like we do. We have long-term memory. And so the memory management system is incredibly important. This entire system is called an agent. The large language model is used to do the thinking and the harness connects everything together just like an operating system. Okay. And so this is the new computing model and this is what an agent it could do incredible things. This is the big breakthrough.

[00:09:12]The simultaneous conver the convergence of large language models that are now able to do a really good job thinking, reasoning, planning using tools and the fact that we have now these harnesses that manages memory. The orchestration uses tools. We can now do amazing things. Let me give you some example.

[00:09:33]This is this is a prompt. This is the prompt.

[00:09:37]This is the code that is generated. And this comes out.

[00:09:43]This is the input.

[00:09:46]This is the input. And that's the output.

[00:09:51]Do you guys What do you guys think? It's pretty amazing, right?

[00:09:59]We use cloud code here, but Codeexit does an incredible job as well. Here's another example. This is the input.

[00:10:04]Create a GIF. Nvidia gen green dots on black scatter form Taiwan 101 building. Morph to GTC Taipei 2026. Morph to Nvidia I logo. Then scatter and repeat. Right? So you saw that. That was the prompt. Here's the next one. I lost my remote control battery clip. It looks like this. Create a CAD file. It uses a tool create a CAD file ready for 3D printing to create a new new one. Make sense? This is now the new computing pattern. Whereas we used to launch an application, click and type, we now replace that with explaining to the AI what we want, our intent and the AI generates the code or uses tools and produce the necessary output.

[00:11:01]This is how computers are going to work in the future. This is agentic AI. For two years we've been building towards this and now it has arrived. Now one of the big breakthroughs of course is tool use. A lot of people have said you know Jensen AI is coming. Agentic AI is coming therefore all of the software companies are going to go out of business. I said it's exactly the opposite because there are going to be so many agents.

[00:11:30]The world is no longer limited by the number of people. Therefore, those agents are going to use more tools than ever.

[00:11:39]This is actually an incredible time to be a software company. But the software has to be presented to the agent in a way that the agent can use it. This is a break big breakthrough. And in fact, what we have done as you know what Nvidia's treasure is is all of our CUDA libraries. I call them CUDA X libraries.

[00:12:01]This is Nvidia's treasure. Today, we're able to now pres present these CUDA X libraries to agents who can use it much more effectively than even h humans. And so this is a wonderful time for CUDA X libraries. Let's take a look.

[00:12:22]20 years ago, we built CUDA, a single architecture for accelerated computing.

[00:12:28]We reinvented computing. A thousand CUDA X libraries help developers make breakthroughs in every field of science and engineering. CUDA X libraries are tools for agents.

[00:12:41]CU litho for computational lithography.

[00:12:46]Coop for decision optimization.

[00:12:50]CDSS for direct sparse solvers.

[00:12:54]AIQ for deep research across structured and unstructured documents.

[00:13:00]Aerial for AI ran, warp for differentiable physics, parabrics for genomics.

[00:13:11]At their foundation are algorithms and they are beautiful.

[00:13:17]>> Math is beautiful.

[00:13:24]The computing pattern the computing pattern of software is going to change.

[00:13:29]In fact, let's come back to this. This is the agent. It is the ultimate disagregated and distributed computing computing model.

[00:13:42]So many different computers are going to be activated in order to process this agent. The agent consists of model harness, tools and skills and a runtime.

[00:13:59]All of that is running at different places in a data center.

[00:14:04]You can think of the model as the brain, the harness as the body, the tools that it uses working in a runtime. Think of it as a workshop.

[00:14:18]So this is a person, a worker working with tools in a workshop. Of course, this is being done at extraordinarily large scales and each one of those steps are running in a different part of the computer. And you could see the large language model is thinking, context processing, observing, understanding the environment, reasoning, coming up with a plan, and acting on the plan. Every single time that happens, an entire rack of Grace Blackwell MVLink72 is activated. It's thinking with the large language model. Whenever it uses a tool, a CPU use is used. That tool could be a C compiler, it could be Python, it could be JavaScript or it could be accelerated computing. Today's agents are rel relatively simple users of tools. So many industrial companies are now part of our ecosystem because ultimately we're trying to build an entire stack just like GPUs just like when we were building Grace Blackwell MVLink 72 just like now we are building a full stack system so that our customers could build amazing AI infrastructure. Let's take a look.

[00:15:43]The world is racing to build AI factories. The largest infrastructure buildout in human history. AI factories are incredibly complex. Every layer, chip, rack, network, power, cooling, grid, must be designed together from end to end because compute is revenues.

[00:16:04]NVIDIA DSX is the blueprint, a reference design for building and operating AI factories at maximum efficiency and profitability.

[00:16:14]It starts with DSX SIM. With the DSX SIM Omniverse blueprint, partners design and validate an NVIDIA Vera Rubin AI factory before a single rack lands.

[00:16:26]They plan the layout, simulate the power and cooling, design the network, validate every integration, test every change in the digital twin.

[00:16:41]The factory powers on. DSXOSS takes over and provisions, operates, monitors, and remediates the infrastructure, turning the installed systems into trusted, multi-tenant, resilient, AI ready capacity.

[00:16:59]Today's AI factories overprovision power by up to 40%. DSXM Max LPS lets operators safely deploy more GPUs inside the same power budget, adding billions in annual revenue.

[00:17:15]Breakthrough hot liquid cooling at 45° C uses less water and energy. More power going to revenue generating compute.

[00:17:25]Incredible dynamic power allocation steers power from rack to rack, recovering stranded watts, sending them where work is happening.

[00:17:36]In rack power smoothing flattens peak current spikes and power surges.

[00:17:41]Throughout the factory, teams of AI agents work with DSX Max LPS, continuously coordinating to balance cooling and power to meet workload demand.

[00:17:54]DSX AI factories are flexible energy assets that operate cooperatively with the grid. DSX Flex reads realtime grid signals and dynamically adjusts factory power when the grid needs relief.

[00:18:11]100 gawatt of AI factories will come online before the end of the decade.

[00:18:16]NVIDIA DSX AI factories run at highest efficiency, produce the lowest cost tokens, and make the grid stronger.

[00:18:32]I've shown you ecosystem slides of the past where Nvidia's computing layers and software and software and computing stacks are integrated into other people's platforms, third party platforms and libraries that serves end markets. That was a computing ecosystem.

[00:18:51]This is an AI factory ecosystem. This is way downstream of all of you. Upstream of me is all of you and downstream of us is this ecosystem. Because Nvidia ultimately is not just building a GPU, not just building a system. We're helping customers build these AI factories, these AI infrastructure that is so immensely complex. Each one of these at one gigawatt level started at 30 2030 billion dollars. It is at 5060 billion dollars and soon it will be 80 hundred billion dollars per gigawatt $100 billion into an AI factory.

[00:19:38]It must work the first time and it must work right away. The cost of capital is incredible. The complexity is incredible. So as you see we used to design a chip inside a computer and then we simulated a system inside a computer. Today you saw just now everything was built in Omniverse.

[00:20:02]I've been working with Omniverse with all of you for a long time. This was the dream come true so that we can build these gigantic systems as large as the world wants to build inside a digital framework inside a digital simulator in a digital world long before we build the first break ground and put our money to work. So this is our ecosystem our we call it DSX.

[00:20:29]RTX is for our GPU, DGX is for our systems and now DSX basically infrastructure. Because of the work that we do here across this entire stack, including our systems and software, it's the reason why we could work with small companies and enable them to be worldclass AI clouds. Every one of these I'm about to show you are small companies just recently. And now Corewave is worth 50 60 70 billion and growing incredibly fast. Recently we worked with Nbius and again they're growing incredibly fast. Each one of these clouds have incredible customers.

[00:21:09]Cursor the software coding company, Black Mountain Labs, Image Generation, World Labs, World Foundation model, Revolute, the leading uh financial services AI company, and Shopify. Here's another one. This is Nscale and their customers are British Telecom, Google.

[00:21:30]Google is using one of our AI clouds, Thinking Machines, a Frontier Labs company. Super exciting. Here's Neighbor Cloud in Korea, Bank of Korea, Hyundai.

[00:21:43]So many incredible companies. Here's one in India, Yoda.

[00:21:49]Incredible companies. Here's one uh based in Singapore building in a Australia. Together AI AI Singapore.

[00:21:58]This is one in Indonesia. Each one of these companies each one of these companies are serving regional as well as global customers. AI is going to run everywhere. Every company will be powered by it. Every region will build it. Endosat here in in Indonesia. Here in Taiwan, GMI here in Taiwan. GMI. It's okay to clap.

[00:22:32]So, incredible, incredible uh incredible companies, incredible opportunity, but all of them need several things. Of course, they need the computing stack.

[00:22:42]This entire stack underneath this is what made Nvidia famous. All of our hardware and software and libraries, our connection into the world's ecosystem of third party developers makes it possible for anyone to stand up an AI cloud.

[00:23:02]Vera Rubin is in full production.

[00:23:07]it the um the the supply chain we created for Vera Rubin is twice as large as Grace Blackwell.

[00:23:16]Not Yeah, it's incredible. And and what used to take two hours to assemble one Grace Blackwell rack now only takes five minutes. So not only is the capacity higher, the throughput is a lot faster and we need it all to support the demand.

[00:23:37]This ecosystem is extraordinary.

[00:23:40]Millions of square feet has been put online to support Grace Blackwell and preparing now ramping up now Vera Rubin.

[00:23:49]I want to thank all of you. Vera Rubin is now in full production. Thank you.

[00:23:54]Let's take a look. Can we bring out Vera Rubin, please?

[00:24:17]And Janine, do we have the Do we have the racks, the systems?

[00:24:25]It looks heavy.

[00:24:29]This is This is Vera Rubin. Vera Rubin MVLink 72.

[00:24:34]This is the Gro LPX. At the next GTC, I'm going to talk to you about a lot more of this today. We have so much to talk to you about. This is Vera CPU rack. 256 CPUs, all liquid cooled. Let me tell you about Vera in just a moment.

[00:24:52]This is the Vera Bluefield storage processing system and also security system. And of course, this is our Melanox networking, the world's first CPO.

[00:25:07]This is Vera Rubin. Incredible technology all coming together. Okay, let's talk about CPUs.

[00:25:16]Vera CPUs. CPUs built for the age of AI.

[00:25:21]All of the CPUs until now were created for people. We were the users.

[00:25:29]We were the users. We were the renters.

[00:25:32]The way we use CPUs, we live in a world counted by seconds.

[00:25:38]The way we rent CPUs in the cloud, each one of them more you can more CPU cores you have, the more you can rent. the economics of the old the use case of the old CPU and the economics of the old CPU fundamentally different than agents.

[00:25:56]Agents are impatient. They don't live in a world that is in seconds. They live in a world that's in nanoseconds.

[00:26:05]When it uses a tool, it wants the response time to be as fast as possible.

[00:26:12]When it access database, it has to come back as soon as possible. Every moment that the agent is waiting keeps it from going to the next step, the next step, the next step. It is vital that we make the CPUs as low latency as possible, as interactive as possible. So we created Vera CPU for the age of AI. Now inside our system, it's used for three different ways. The first way of course is Vera Rubin for thinking and inside the Vera Rubin rack there already two CPUs as you know we are building and selling millions of Vera Rubins. We have sold millions of Grace black walls. Nvidia already is one of the largest CPU makers in the world.

[00:27:08]Vera in the Vera Rubin rack are two CPUs. One for orchestrating and managing the GPUs, managing the KV cache dealing with all of the software that runs in the rack. We also have the grace blue field that is used for security and isolation.

[00:27:31]The Vera compute is used for the harness, the orchestration of the AI models, tool use, accessing the database. And the data servers are right here, Vera Bluefield, the fastest storage, fastest storage servers, the fastest storage system the world has ever made. And the reason why this is so vital is because agents are accessing memory, accessing memory so incredibly fast. These systems, the storage server and the CPUs are now the critical path of the most expensive part of the data center. This is the most expensive for a good reason.

[00:28:17]The economics, the economics of the AI factory is tokens and the tokens are created here. And so of course you want to manufacture and generate as many tokens as possible.

[00:28:32]This is where you put all of your economics and this has to not be in the way. And so Vera CPU has great pressure on the Vera on the CPU architecture which is the reason why we built a brand new architecture from the ground up. A CPU the world has never seen before. We call it Vera.

[00:28:53]This is CPU for agents.

[00:28:57]All the CPUs of the past we built for humans. This CPU is built for agents. In fact, during Grace Blackwell transition, the biggest risk was going from external CPU x86 into Grace Blackwell, that transition was extremely dangerous, but we did it with incredible execution.

[00:29:19]Now, Grace is literally synonymous with Grace Blackwell. When people say Blackwell, they say Grace Blackwell because it is utterly now everywhere.

[00:29:29]Every company's software stack has been optimized for it. Everybody's security stack has been optimized for it. And now here comes Vera. I'm super excited about that. Now look at some of the performance numbers.

[00:29:42]Speedups says one thing. It is extremely hard to speed up SQL.

[00:29:49]SQL the most famous domainspecific language DSL that has ever been created before SQL you know before CUDA there was SQL before OpenGL there was SQL invented by IBM today it is the structured database engine of the planet everybody uses SQL this is SQL running three times faster not 10% % faster. Not 25% faster, 10 times f three times faster. Incredible. This is real time. The next one is real time stream processing. Remember, your AI is going to be not just reading documents.

[00:30:36]Your AI is going to be watching for telemetry, especially inside a factory, inside a stock exchange.

[00:30:44]You're going to be looking for telemetry continuously. The burst of data that's coming in goes into a CPU. This is Vera CPU running real time stream processing for New York Stock Exchange. Lynn Martin, the president of New York Stock Exchange has been so gracious to partner with us. This system is run all over the world in real time real time stream processing. Vera CPU six times all because of the bandwidth the single single threaded instruction execution the bandwidth inside between the cores the bandwidth outside Vera is completely revolutionary that's vera you know X factors is something you you talk about when you're talking about GPUs it is quite rare that somebody talks about X factors on real workload real workload that is associated with CPU so I'm so proud of the team you guys did such a great job we have an extraordinary road map coming but what's really exciting is almost everybody is supporting Vera they're as excited as we are this is Vera opening up it's opened up a brand new market agents agents is a new workload We built CPUs for humans in the past. We need CPUs for agents, agentic systems.

[00:32:16]Their properties are different. Why would the old CPUs be the same? We are building millions and millions of errors. Millions of errors. and to go to market with us. Taiwan's ODMs and computer makers, all the OEMs, and you could see the early adopters.

[00:32:37]The early adopters are the agent companies. This is the beginning of a new market, a market that never existed before. It's not going to take away from the old markets, but this is a new market.

[00:32:52]CPU for agents. And this will this c this market will surely be larger than the last and the reason for that is because there'll be a lot more agents than there are people and then there the agents are very impatient. So Nvidia Vera CPU thank you.

[00:33:16]This is the most important slide really.

[00:33:19]This is the takeaway. The takeaway here is that this is the application pattern.

[00:33:24]This is the computing pattern of the next decade.

[00:33:29]Agents, harnesses orchestrating large language models.

[00:33:36]Every company will run it. Every company will be an agent company. Every company will have agents running inside.

[00:33:46]Every company will see that agents will need its own operating system. Every company's asking us how do we run agents safely? How do we build agents for our own workloads? And so we have the NVIDIA agent toolkit for enterprise AI. You've seen me build this in plain sight.

[00:34:11]Almost everything that Nvidia does, as you know, at every GTC, if you go back and look at my GTC 5 years ago or 10 years ago, you will see today.

[00:34:19]This you've seen me talking about for several years now because we've been building for this moment. There are four things that companies need in order to build agents as a service or build agents to operate.

[00:34:35]The first thing you need is you need models. Of course, large language models, the smarter the better, the cheaper the better, the faster the better. The second is you need a harness to orchestrate the whole thing. The third, these a these models want to use tools and these tools come with its skills and I showed you CUDA X libraries. Those are going to be amazing tools for the agents in the future. And then lastly, you need a runtime. You need the operating system that holds it all together. This is the NVIDIA toolkit for agents. It includes it includes models that you can modify. Nvidia's worldclass open models. And let's show you more. You can run agents from anybody. You could run uh cloud code incredible agent codeex incredible agent. You could run it inside this harness called open shell which will be highly secure for your inside the enterprise. The shell protects the agent keeps it grounded in security policies.

[00:35:49]Privacy is protected. Its rights and privileges are given. Its identity is protected. And so this open shell is being adopted all over the world. Nvidia open shell is open source. You're going to see so many companies adopt it. Red Hat, Canonical, Microsoft, it's going to be adopted everywhere. This is an important this is the runtime and this runtime is fully optimized for the Nvidia AI platform which is everywhere.

[00:36:20]So you can run open shell in any cloud on prem and even on device. So you have you have now tools and libraries that they can use. You have models that you can modify or use asis or you have agents. This be open claw Hermes another incredible another incredible uh harness. These agent agentic harnesses can now run on prem or for you anywhere.

[00:36:51]Okay. So four things and this represents the operating system of the modern enterprise. Now how do we use this? One of my favorite use cases of agents is chip designers. It is the single most important thing that Nvidia does. And so of course we have to partner with cadence to build super agent a chip design super agent. It is orchestrated by codecs or cloud code. It has RTL and architecture diagrams or schematics or uh specifications as input and whatever you need to fix. And together we created some super agents that are optimized for the Nvidia runtime with Neotron. And let's take a look. It's really incredible.

[00:37:49]Cadence and Nvidia are partnering to build chip design agents.

[00:37:54]Hundreds of thousands of NVIDIA chips come together to make the AI factories that power the world's frontier AI models. Designing these chips and the systems they run in is one of the hardest engineering challenges.

[00:38:08]Trillions of transistors, three-dimensional circuits, microscopic scale. Every gate, every wire synchronized to picos seconds must work in perfect harmony with no margin for error. Physical prototypes are too slow and too costly. So engineers work in the digital realm. Each chip begins as a set of architectural specifications, then translated into RTL, the language of chip design. RTL must be verified in simulation. A single bug can delay a chip by months. At NVIDIA, thousands of engineers, billions of compute hours per year, millions of tests written, run, and debugged. A cycle that takes teams weeks. To compress this cycle, Cadence and NVIDIA built a design verification agent. Codex orchestrates the process.

[00:38:58]Cadence chipstack launches the RTL verification loop powered by Neatron and secured by Nvidia OpenShell. Calling on expert sub agents in RTL generation, testbench creation, regression testing and debug, the system drives itself. The chipstack agents run hundreds of simulations with cadence exelium. Formal verification with Jasper. Design flaws revealed. Bugs in the code fixed. What once took weeks now takes hours.

[00:39:29]Verification cycles over 40 times faster. Together, Nvidia and Cadence are reinventing chip design with AI agents.

[00:39:41]From weeks, from weeks to hours, from weeks to hours, from weeks to hours, Nvidia has thousands of chip designers.

[00:39:50]We are going to hire hundreds of thousands of cadence super agents that work with us so that we can accelerate our company so that we can be even more ambitious create even more amazing things run even faster. You saw earlier that the toolkit with models harness tools the tools in this case are cadence simulators and verifiers formal verification systems. It is the reason why we're working with Cadence so hard to accelerate all of their tools on CUDA because the agents are impatient. The agents want the answer immediately. And so models, harnesses, accelerated CUDA accelerated libraries and tools and then the runtime. What you saw just now is all of that coming together. Now, one of the things that it starts with is a great model that Cadence could modify and tune to be expert at the cadence workflow at the cadence expertise so that they could create super agents that are proprietary to cadence with their proprietary knowledge.

[00:41:05]They have to start with an excellent model. We call it Neotron. Nvidia is dedicated to build open models for the world so that all of you, all of us could create our own agents. Today, we're announcing the Neotron 3 Ultra.

[00:41:25]Yep. Our next open model, and it is smart.

[00:41:33]The Neotron models not only give you the model, we give you all the data that we use to train the model. And because we have a coalition of incredible partners, you can see all of our partners down here. We work together, contribute data to each other. 40 years later, Microsoft and Nvidia are going to reinvent the PC.

[00:42:01]This is going to be the new PC. Now, tomorrow night, tomorrow night, I think it's tomorrow night, our time, but I'm going to be with Satya where we're going to talk a lot more about the work that we're doing together. Microsoft Nvidia over the last three years, it took this long to completely reinvent how the PC is going to work so that we could be ready for this moment.

[00:42:25]As I mentioned earlier, that compute pattern called the agent. It's going to run in AI clouds. It's going to run inside enterprises. It is also going to run on your PC.

[00:42:39]What's going to happen to that PC when it has an autonomous agent, an agent that's helping you, that understands you? You could talk to it. It could look at you. You could ask it to read files, go help you do some research. It could do a lot more that I'll show you. But the new operating system is of course the old operating system plus large language models. Large language models in a lot of ways is the modern version of DirectX.

[00:43:12]It has of course input and output, understands prompts, it understands computer vision, it can generate video, it can generate sounds. It is the modern extension, the intelligence extension of the PC, of a computer.

[00:43:28]On top of that, the application as I mentioned before is going to be replaced by now an agentic runtime. And that is the modern application, an agent. Let's now take a look at what it can do.

[00:43:44]>> It started with a spark, an idea to reimagine the PC for the first time in 40 years. For the age of AI, what becomes of our personal computer in a world of agents?

[00:44:01]Agents running natively, connected to models, local or in the cloud. Our personal AI sandboxed for security.

[00:44:10]running continuously, getting work done.

[00:44:13]The chips and the OS must evolve.

[00:44:18]Introducing RTX Spark. Everything we've learned over 33 years distilled into one chip.

[00:44:27]Blackwell RTX GPU with 6,144 CUDA cores. One pedlop of AI performance. A custom 20 core grace CPU built in partnership with MediaTek.

[00:44:41]Fused by MVLink.

[00:44:45]128 GB of unified memory. TSMC3 nanometer process. 70 billion transistors.

[00:44:55]And in close collaboration with Microsoft, a Windows platform for agents.

[00:45:01]We're reinventing the personal computer, for creating, for gaming, for agents. This is the dawn of a new personal computing revolution, and it starts with NVIDIA RTX Smart.

[00:45:26]Here it is.

[00:45:38]Of course, I got to show you the most beautiful part, which is video games.

[00:45:43]It is It's also the closest to our heart. This is Forza. This is 007, by the way. The new 007 game. I'm looking forward to playing it. I look a little bit like him.

[00:45:56]Ladies and gentlemen, Nvidia's RTX Spark laptops. Let's play it.

[00:46:05]Every house starts as an idea. Getting from idea to design takes a myriad of tools, expertise, and a lot of time.

[00:46:16]Now, an agent running locally on RTX Spark can help me design a house using the tools on my laptop with an open shell sandbox running the Hermes harness connected to Claude Sonnet in the cloud.

[00:46:30]I select the site, share my concept sketches and mood board of styles to inspire my design and the prompt, a text description of the requirements and the design intent.

[00:46:45]My agent goes to work using the tools on my laptop. It opens Rhino and starts modeling the site, shaping terrain, setbacks, and the building envelope.

[00:46:56]Then it proposes building forms optimized for cost, comfort, and quality.

[00:47:03]With the form defined, my agent generates the interior layout. Walls, circulation, rooms begin to take shape.

[00:47:10]I jump in whenever I want to adjust, to change.

[00:47:17]Doors, windows, and structural elements are placed automatically. My agent detects its own mistakes and fixes them.

[00:47:29]When I approve, the agent exports the model from Rhino into Blender. Materials and object properties transfer with the design context intact. I fine-tune the materials, get the look just right. Then I pick the shots. Blender renders the house. My agent using generative AI with the Flux 2 model makes them photoreal.

[00:47:51]Multiple viewpoints, lighting conditions. What was once a complex workflow is now guided and simplified by my agent.

[00:48:00]Working with on RTX Spark design at the speed of imagination.

[00:48:12]PC in the world of agents. The developers are so excited about it. This is an incredible computer. All of the acceleration, all the software capabilities associated with it, working with every developer to make it incredible for all of you. The next one, Adobe, incredible tool suite of course used by tens of millions of people around the world. They have re-engineered the architecture, the core of Adobe Photoshop and Premiere, and they're going to release it for RTX Spark. It is twice as fast. It's already fast. Now, it's going to be twice as fast. And it it's also designed to be agentfriendly with its MCP server. It can now interact with agents on your laptop.

[00:48:57]The number of customers, the number of partners that are so excited to bring RTX, RTX Spark to the market is just incredible. You know, this is the first across the lineup of PC reinvention for 40 years. And I'm just so happy that all of you and the ecosystem around the world has joined us. This is basically everybody. Everybody will support RTX Spark and will be building incredibly smart and powerful and beautiful laptops with all of us. Thank you very much.

[00:49:37]But that's not all. That's not all.

[00:49:41]RTX Spark is a reinvention of laptop, but in fact, Microsoft Nvidia is reinventing all of PC. And today we're announcing a whole new line.

[00:49:54]Three revolutionary Windows machines covering desktop, laptop, and workstations. All 100% Windows compatible, 100% CUDA, 100% Nvidia AI Tensor Core. Everything that runs that you see that runs on Nvidia in all these different platforms around the world runs here.

[00:50:17]This is the first completely re-engineered, reinvented line of PCs that has happened in 40 years. Now, what's really amazing is this. So, this is this is the RTX Spark laptop. This is the desktop. So, this one's from MSI, Joseph. This one's yours. Okay. Look how beautiful it is. This agent could run 24/7 meter free and you could download your agent. You could raise your lobster in here.

[00:50:53]This is your clock. It's running all the time. No meter anxiety. And it's sitting here connected to your whole house, connected to your laptop, connected to your display, all the cameras, your your dryer, your water cooler, your water heater, your everything, whatever you want, your security system, all connected to this. And this becomes your personal AI, your personal AI agent. And it gets smarter and smarter and smarter over time because today we have Neotron 3 Ultra. Tomorrow we have Neotron 4 and then Neotron 5, Neotron 6. And we just keep getting us smarter and smarter and smarter. And meanwhile, this is sitting at home helping you do things. If you want to book a travel, no problem. And if you if you want an incredible system, this is a DGX station for Windows.

[00:51:51]compatible with Windows, runs, everything in Windows, and and it has 768 GB of memory. And so you could run a trillion parameter model. This is unbelievable. 20 pedlops, 8 terabytes per second of memory bandwidth, and this sits by your desk.

[00:52:15]You basically if you're a developer of large language models, you're a developer of agents, having this sit by your desk gives you all the compute you need and then when you deploy it, you put it into the cloud. Now there's something that if you look at this and think about this, something is happening here.

[00:52:35]Remember 15, 20 years ago, we used to have an idea called a phone.

[00:52:43]Today we have an idea called a PC.

[00:52:47]Today when you think about your phone, the one thing you don't do with it is make phone calls.

[00:52:57]You do just about everything else. And so that phone means something very different to you than a phone of the past.

[00:53:06]I am certain what's going to happen here is that the PC 10 years from now and the PC that you think about today a tool whether you launch applications click and type and this PC is going to be completely different. Here's my theory.

[00:53:26]I can totally imagine just as every house today has a home theater where many houses have home theaters, big TVs, lawnmowers, dishwashers.

[00:53:40]I could totally imagine that someday there's actually an AI supercomput in your house and it's running all of your agents. that's running all of your assistants and they're doing all kinds of things for you all the time and you have to have it in your house just like you have a home theater in your house, you have stereoss in your house, you have game consoles in your house, you want to assist AI agent computers running in your house and these in time becomes a lot more like R2-D2 to you. It becomes more like C3PO to you than it feels like a PC to you.

[00:54:20]There is no question this reinvention of the computer is as big of a deal as the reinvention of the phone into what we now know as the smartphone. And so this is the beginning of that journey. This is the beginning of a new line. And so we have a roadmap for this. This is a brand new product family for us. every single generation of architecture, we will have a desktop, a laptop, a workstation, and then a desktop, a laptop and workstation. And the thing that I am just incredibly pleased, incredibly honored is that 100% of the world's PC industry has joined us to reinvent the PC. A new line, a new beginning. Thank you.

[00:55:06]I am incredibly proud of Cosmos and today we're announcing Cosmos 3. Let's take a look.

[00:55:13]>> The real world is infinite and unpredictable.

[00:55:17]Physical AI needs data, but real world data is impossible to scale. For physical AI, compute is data. This is Cosmos, an open frontier omnimodel for physical AI built on a new mixture of transformers architecture. Pixels, action, sound, and language flow into the auto reggressive transformer, which reasons, plans, and instructs the diffusion transformer, which generates what comes next.

[00:55:47]Developers post-train Cosmos across embodiment and use cases. As a VLM, Cosmos watches the physical world, understands what's happening, describing scenes, and flagging what matters.

[00:56:02]As a world model, Cosmos generates physics accurate synthetic video from an image, text, or video.

[00:56:10]As a simulator, Cosmos closes the loop for policy training and evaluation. And as the foundation of NVIDIA Omnidreams, an action conditioned world model, Cosmos predicts the future frame by frame.

[00:56:25]Post train Cosmos and it becomes a world action model. Perceiving, reasoning, planning, generating actions for robots of every kind, for everything that moves.

[00:56:42]A new kind of data, a new kind of teacher generated by compute.

[00:56:49]Cosmos, the foundation for developers of the age of physical AI.

[00:57:02]We are also connected into mobility services. Approximately 97% of the world's mobility services are connecting with us. So that when we deploy Alpaio on the Hyperion runtime with the Halos operating system, we will be able to connect to all of these services across the world. Let's take a look at this.

[00:57:26]>> Hey Mercedes, let's go to my favorite sandwich shop.

[00:57:30]>> Routing to your destination.

[00:57:33]Lane is clear. Pulling out to start drive. Nudge left due to the stationary leave vehicle ahead blocking our lane.

[00:57:40]Slow down to stop at the stop sign controlling the intersection. Stop to yield to the pedestrian since the person is in our lane. Yield to the cut in vehicle from the left. Nudge left to clear the stopped vehicle blocking on the right. Keep distance to the cut in vehicle since it is merging into our lane. Nudge left due to the stopped van crossing ahead. Stop to keep distance to the lead vehicle. Keep distance directly ahead in our lane. Keep distance to the vehicle directly ahead in our lane. Stop the stop sign since the intersection is controlled. Stop the cross traffic.

[00:58:10]Keep distance due to the truck blocking the right side of our lane. Right due to the truck blocking the left side of our lane due to the trucking side of our lane.

[00:58:18]>> Your destination is on the right.

[00:58:27]Alpao, the world's first reasoning autonomous vehicle.

[00:58:35]If you let it talk all the time, it will drive you crazy.

[00:58:41]But we're very happy that it's talking to itself all the time. That's called thinking. And so, Alpamo is a reasoning car. The technology that we've created also applies to humanoids. We now are going to do it for robots. Today we're announcing the NVIDIA Isaac Group, a reference humanoid robot, all fully integrated. 25 degrees of freedom on the on each hand made by Sharpa. 31 degrees of freedom on the robot. 6 feet 150 lbs.

[00:59:13]Just like me.

[00:59:19]The first number is shorter. The second number is bigger.

[00:59:23]Otherwise, pretty close. and and this platform runs the new Thor and our entire software stack, data generation stack, data simulation stack, the runtime, all integrated into a robot that is designed for everyone to use.

[00:59:40]Now, we built this for higher education and university researchers because for them to build this is in insanely hard to do. And so, let's take a look at that.

[00:59:51]The next leap in AI is generalpurpose robots, humanoids. But building one is hard. Every team starts from scratch, stitching together simulators, teleyops systems, data pipelines, and training infrastructure. Months of setup before research can start. NVIDIA Isaac Group, an open development platform for humanoid robots, open models, simulation and training libraries, and data generators.

[01:00:19]Plus the robot computer fully pipe clean, ready to go in hours. First, set up the simulation environment in Isaac lab.

[01:00:31]Capture demonstrations with Isaac teleop on a real or simulated robot.

[01:00:39]Generate synthetic data with Omniverse and Cosmos.

[01:00:43]Scaling one demonstration into thousands.

[01:00:47]Train policies, evaluate them in Isaac Lab Arena.

[01:00:54]Deploy through Isaac Ross running on Jets and Thor.

[01:01:08]Every element modular, open, use ours or swap in your own.

[01:01:15]Groot is powering robotics research across every discipline for every domain from research labs to factory floors.

[01:01:25]One open platform and now a new addition. Isaac Groot reference design robots built on Nvidia's open platform ready for frontier research for any lab anywhere.

[01:01:45]The age of robotics starts here. Nvidia Isaac Groot.

[01:01:53]So many robots.

[01:02:00]We're working with just about everybody who's working on robots in the world or robotic systems in world. Let me tell you what I told you. The computer industry has been completely changed in the last six months. Everything changed.

[01:02:15]Everything changed because agents were realized and it converged with the latest frontier models and it made possible the AI to now do useful work.

[01:02:27]The computing pattern will repeat over and over and over again. This computing pattern of an agent that's a model, a harness that uses tools with skills and runs in a runtime. That runtime depends on whether it's in the cloud or on prem on a PC or in a robot. But the computing pattern is exactly the same for all of them. You will use different harnesses because of your preference. You'll use different models because of your preference. You will improve them for your proprietary use. You would create sub super agents that you can rent to other people to help them do their work.

[01:03:04]This agentic platform, this agentic pattern, Nvidia has an enterprise AI toolkit. This is a wonderful way for all of you to engage AIS and for us it's a wonderful growth opportunity.

[01:03:18]Vera Rubin is in full production whereas Grace Blackwell was created to process AI particularly inference. Vera Rubin was created to run agents. It is in full production. It is much much more than a GPU. It is an entire disagregated distributed agent processing system.

[01:03:40]Nvidia has really become an infrastructure company. Not just a GPU company, not just a systems company, but an infrastructure company to help you generate the maximum revenues, the maximum profit, and to get there as soon as possible.

[01:03:56]the agent world.

[01:03:58]This new way of doing computing where you build CPUs now for agents not for people CPUs for agents has its own special requirement and our Nvidia Vera is revolutionary. I'm so happy about its ramp. The orders already is going to make it the fastest and the most successful product launch in our company's history. Nvidia and Microsoft has created a whole new line of PCs.

[01:04:24]This is a new beginning. And of course, that exact same agentic pattern that I agentic processing pattern, computing pattern that I just described is also going to run on all kinds of devices. I mentioned PCs, but in the future, it'll be robots and satellites and base stations and factories in the cloud on prim at the edge. This pattern, agentic AI system, this agentic computing pattern will be replicated in computers all over. How we think about the personal computer will very likely change. I want to thank all of you for your partnership, your friendship. We couldn't be here without everything that we do together. I am so proud of how you've been so successful this last year. The next year is going to be even more. I have one more thing for you.

[01:05:15]Let's take a look.

[01:05:37]You ready, Tom?

[01:05:41]>> Let's do this. The keynotes done at Computex Jensen, show the world what's next. Useful AI has arrived. Agents working by your side. But in case you miss things we said today, we're going to break it all down for you. Taipei agents used to be misunderstood. Only movie stars had them in Hollywood. Now we all got teams making dreams come true. Building companies from living rooms, but they need so much comput. We hear you. That's why we created Ver >> Reuben stole the show. The cheapest tokens coming through >> 10 times faster inference heaven. More special agents than 007.

[01:06:22]>> Blue field keeps agents memory. True.

[01:06:25]>> Now let's talk about it. CPU >> 50% faster. That's outrageous.

[01:06:29]>> Not for Ver.

[01:06:30]>> It's built for agents. Envy link fusion blends A6 smartly.

[01:06:34]>> Everyone's welcome to the Envy Link party.

[01:06:37]>> Well, if you like that introduction, Proven in full production ultra leave the run. 5x faster work gets done. Mimo claw keeps the guard rails right. Open shell keeps the sandbox tight.

[01:06:52]>> Your code migrated and reviewed >> all before this song is a five layer cake.

[01:07:01]Make no mistake.

[01:07:02]>> Global AI class with lots of gigawatt.

[01:07:04]DSX keeps power connecting dots. Every watt optimized for you >> so you can have your cake >> can eat it too.

[01:07:13]>> RTX is finally here.

[01:07:15]>> Biggest PC moment in 40 years.

[01:07:17]>> Agents powering our workflow. Running anywhere Windows go.

[01:07:22]>> Harnesses run on CPU.

[01:07:24]>> Models fly on GPU.

[01:07:27]>> Cosmos world that robots need.

[01:07:29]>> Turning comput.

[01:07:34]>> Understands roads like people do. is how they learn to move.

[01:07:39]>> Learning skills and finding growth trees powered by thought.

[01:07:44]>> The future is humanoid.

[01:07:58]Oh sh Oh yeah. Oh yeah. Oh yeah. Oh yeah. Oh he sing.

[01:08:18]The future's bright. Come see what's next.

[01:08:24]>> Thank you, Taiwan.

[01:08:27]>> Welcome to Computics.

[01:08:41]Have a great pump your tax. Thanks for an amazing year. Thank you for all your friendship and support. Thank you. Take care. Have a great complex.

[01:09:15]Good. Feel feel good.

Related Videos

Artificial Intelligence

OpenHuman VS Hermes AI: Who Wins?

JulianGoldieSEO

285 views•2026-05-29

Artificial Intelligence

Long-Running Agents — Build an Agent That Never Forgets with Google ADK

suryakunju

142 views•2026-05-30

Artificial Intelligence

5 Mind Blowing Omni Uses Cases

PaulJLipsky

1K views•2026-06-02

Artificial Intelligence

This computer is made from real human brain cells. And you can buy it.

Talktmsmedia

3K views•2026-05-28

Artificial Intelligence

BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2

aimmediahouse

122 views•2026-06-03

Artificial Intelligence

I Made the Same Anime Fight Scene in Every AI Video Generator

NobleGooseAnime

295 views•2026-05-30

Artificial Intelligence

Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S

cnnnews18

3K views•2026-06-01

Artificial Intelligence

I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)

AICodingDaily

298 views•2026-05-29

Trending

Revisiting The Cat Cafe For The Final Time

BenGtalks

3195K views•2026-05-29

Lil bro is a menace 🤣

NotAirJordan

2037K views•2026-05-31

Political Science

My response to the Police

RecklessBen

1496K views•2026-06-01

The Dancing Plague...

HoodieGuyStories

1730K views•2026-05-30