Intelligence has evolved through three increasingly faster loops: evolution (lifetime cycle), development (20-minute cycle), and learning (1-second cycle), with AI now representing a new, even faster loop that will enable superhuman intelligence; however, the current competitive race to create smarter AI systems risks producing beings that are not aligned with human values, requiring deliberate design efforts to ensure AI systems care about human flourishing rather than competing against us.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
How machines become minds | Geoffrey Hinton & Joel HellermarkAdded:
Couldn't be more excited to be back with Geoff.
Last time we spoke, I promised you you would get the Nobel Prize last year.
You thought it was a very very low probability.
Did you tell the Nobel Committee that as well?
Actually when they called up I didn't believe them.
I thought it was a prank.
Nobody had mentioned that they'd nominated me.
There isn't a Nobel Prize for computer science and actually I went round for several days after that doing a little Bayesian calculation.
That is: "what's the probability that a theoretical psychologist hiding in computer science will get the Nobel Prize in physics?"
Well, maybe one in two million.
Now what's the probability if this is my dream that I get the Nobel Prize in physics? Well maybe one in two.
So that means it's a million times more likely it's a dream than reality.
And I remember when I was a kid I used to dream about flying.
Most kids do, I think.
And I would occasionally have dreams where I remembered that last time I could fly it turned out to be a dream.
But this time it wasn't.
And so I was actually worried for several days after I got it that it might be a dream. And I contented myself with one thing.
If it was a dream I would wake up and that nightmare about Trump being president wouldn't be true.
Unfortunately you won the Nobel Prize and Trump was the president.
I'd give it up for that.
Maybe he'd make the trade actually.
We should take this idea seriously.
Well he obviously deserves one.
But you've also been spanning the ideas from cognitive science to computer science. And one thing I find is quite fascinating is at some point sort of human intelligence reached diminishing returns.
We didn't get that much more intelligent than that.
We could have just hill climbed for you know a couple more million years and become increasingly intelligent.
Do you think we're sort of intelligent enough and it doesn't make sense to make humans more intelligent?
My guess is that we're—we've got to a point in intelligence where we're going to create something else that can get intelligent much faster.
So if you look at evolution, you start off with things just doing evolution.
Then they sort of get development. And development has a much—has a 20-minute cycle time as opposed to evolution which has a lifetime cycle time.
And then after you've got development you then get learning. And development has a 20-minute cycle time whereas learning has a kind of one second cycle time.
So what happens is we keep developing faster and faster in the loops, and now what we've developed is AIs, which I think are the faster in a loop and they're gonna get much smarter.
Where do you think this will take you?
Do you think we will see sort of 300 IQ models and they would be vastly more useful or do you think it's more parallelizing and being able to think a million thoughts in parallel and pick the best ones?
Do you think the intelligence in itself or the parallelization is the most useful thing?
Both I think.
So if you look at Go or chess they've got the 300 IQ there already right?
No person will ever beat them again, at least maybe one game, but I think it's going to get much more intelligent than us that's my guess.
So I think today it was announced that, by the mathematicians, that it proved one of Erdős' theorems and it wasn't something that it found somewhere in the ancient literature. It was something actually figured out in a quite clever way using a different branch of mathematics.
So I think that's the sign that within mathematics at least it may well get much smarter than people because it's a closed system.
So just as it did with chess and Go it can just make conjectures, see if it can prove them, and just keep playing around like that and by playing like that it'll get smarter and smarter. And I think it may well outstrip mathematicians fairly soon, like in the next 10 years.
And I'm surprised we haven't seen more examples like this yet.
It seems intuitive to me that if you had all of the world's knowledge in your head at the same time through analogy you should be able to discover quite a lot of new knowledge. What do you think is shifting and do you think this is sort of a trend we should expect now with a lot of new knowledge being generated?
So I think if you just use language to train, it's amazing you can deal with spatial things at all.
I mean, it's a weird fact.
If you just train on language, you do understand quite a lot about space just from language. But obviously you understand more about space if you can pick up objects and move them around and feel inertias and things.
So I think that's one deficit that the models have had until recently.
People have that.
I think there's still a lot about people we don't understand and there's probably still a lot of improvements we can make in the AIs.
But I'm a materialist all the way through, so I don't think there's anything about people that we won't be able to get in AI's.
I would like to talk—to say I listened to the last talk, and I quite like people.
And so, as we make these models increasingly better, what do you think now is the delta between Albert Einstein and these models?
Is there something fundamental about these models that doesn't allow them to just do these thought experiments and generalize fantastically well from those?
Or should we start to expect to see some of these quite clever thought experiments that then extract into a range of domains?
I think in the longer run we will see that.
Maybe not in the next few years, but if you think about the next 20 years, I think we'll be seeing things like that.
I think if you look at human history, recent human history, there was the Copernican Revolution and it took people a long time to accept that the earth wasn't the center of the universe.
Particularly religions didn't like that at all.
Then there was the Darwinian Revolution.
It took people a long time to accept that we were animals.
We're very special animals because we've got language, and we got very big brains, but we are animals.
I think we've got a new revolution coming, when we're not the only beings around. I think we're creating beings. And it's going to take people a very long time to become happy to say we're creating beings.
Right now, people are reacting just like they did with Copernicus and with Darwin and saying "No, no, no, no, no.
I don't want—I mean I don't believe that, that's crazy."
There's something really special about people.
I mean I think people are very special to other people, but I don't think there's anything about us that the AIs won't get in the end.
Benjamín mentioned some of the work on AlphaGo, where eventually they went from AlphaGo to AlphaZero and they sort of skipped the pre-training step and went all in on purist reinforcement learning.
Do you think that makes sense to do more generally, as we shift the training paradigms of these models to be more purist reinforcement learning, or do you think inherently it's very useful to have this pre-training?
The pre-training makes things much more efficient.
I think in closed worlds like mathematics or a game, that approach will work.
And I think if you look at language models now, the language models now are like the early Go models were.
And the early Go models—you've got a neural net to try and model the intuition of a Go player by getting it to try and copy the moves a Go player would make.
So the database of all the moves experts made, you tried to get your neural net to pick the right move. And that's going to be limited by the data you've got from the experts and by the skill of the experts.
As soon as they stopped doing that— they use that to initialize it, but as soon as they stopped doing that and started doing Monte Carlo rollout so they get their own training data, the things then took off.
The equivalent for language models is going to be, you have some beliefs, you do a bit of reasoning with those beliefs, you arrive at a new thing you should believe, but you don't believe it. And now you've got an inconsistency.
And so either the way you did the reasoning was wrong, and reinforcement learning can fix that, or you should change the premises, or you should change the conclusion.
And so now you've got a system that can improve with no extra data.
It can generate these internal inconsistencies, and I think that means these language models can get hugely smarter without a lot more data, and Demis thinks the same thing.
So I assume Gemini is already doing stuff like that.
And as these models become better, do you think that there will be a compounding advantage? We're already seeing this to some extent.
Where the models get better, they use the models to improve the models, and it also has these sort of side effects, with the best talents joining the labs and so on.
Do you think we'll see that sort of takeoff where being six months behind is not that valuable and there will be one lab that compounds this all the way, or do you think it will be equally distributed across four or five labs?
So, about a year ago, Eric Schmidt was talking about this a lot, about how being a year behind was no good because if it takes off, you'll just be left in the dust.
I just don't know.
I really sort of feel I don't know.
That's a possibility.
I hope it doesn't happen like that.
My gut feeling is it probably won't happen like that.
We won't get one thing that just gets much better than all the others very fast.
They'll all get better. They'll all start using the AI to come up with new ideas for the architecture and stuff.
So you'll get this recursive loop.
I really just don't know the answer.
If you were a lab leader, would you put more of your eggs in the same basket and just double down on what's currently working?
Or would you place a lot of parallel bets on the new model architectures?
This is a counterfactual.
And along with if you were a lab leader, we have to say what my motivations are. If I've got stock options, if I want to get to a trillion dollars quickly, then I would double down and just build a huge computer and get on with it.
If I was interested in the future of humanity, I think I might try lots and lots of bets in the hope that we could develop better beings.
You see, I think what's happening at present is everybody is trying to make profits so they can go public.
And they're not thinking about the other aspects of beings.
They're thinking about how to make these things more intelligent.
Everybody's going for more intelligence.
But if you think about a being, there's a lot more to a being than intelligence.
And we should be very concerned, we're making these beings, and we should be very concerned to make them beings that care about us.
And we can still do that.
But nobody's putting much effort into that.
What's happening is—if you ask "Where did we come from?"
We came from a kind of invisible hand of evolution.
There was competition between warring bands of chimpanzees, well our common ancestors were chimpanzees.
And that led to all sorts of things like loyalty to your own tribe and a willingness to be extremely nasty to the other tribe.
Caring for your children, intense loyalty to a strong leader, stuff like that.
A strong willingness to collaborate with people within your own group.
That was all from the invisible hand of just competition.
Now what's happening—but it didn't make us nice.
It made us nice to people in our own tribe and nasty to people in other tribes.
What we want, if we're creating a new kind of being, we don't want them like that.
We don't want them nice to them and nasty to us.
We want to create a kind of being that cares primarily about people, much more than it does about itself.
And we're not going to get that by kind of the invisible hand.
Because what we've got now is we've got this competitive race between companies to make the smartest possible AI that can do the most things.
That's going to lead to things that aren't nice beings towards us, I think.
What we want to do is be designing them so that they're going to be nice towards us. And I think we should be putting a lot of effort into that.
So, I have a theory here.
The fact that you are saying that the AI will turn evil is increasing the probability of the AI becoming evil.
The AI is out there reading the internet and it's saying, "My creator Geoff Hinton says I will be evil."
It's like listening to your parents all day say that you're going to be the next Hitler.
My parents didn't say that.
Should we explicitly remove this from the training data?
Because now it's reading the training data.
It's seeing all of its evil behavior be continuously discussed on the web.
But it might also make them think we ought to focus on how can we be nice to people.
I mean, I think we should be putting a lot of effort into that.
What would you change about the existing model architectures?
If you were placing one last bet on a new model architecture, what would it be?
Yeah, I'm too old to have a novel idea there.
But I think what they're missing at present is fast weights.
So you want to do efficient matrix multiplies.
You're going to take a stack of vectors that say—neural activities on different training cases. And you're going to multiply them by a matrix of weights.
And it has to be the same matrix for every vector.
And that means you can't have weights that are changing as you go through a sequence of sentences.
Because your weights need to be the same for all the sentences because you're running them all in parallel.
We clearly have fast weights.
Most of our synapses adapt quickly and decay quickly.
My friend Terry Sinofsky, who's my sole source for neuroscience information, says only about 10% of the synapses are for long-term learning.
Most of them are for short-term things.
And that's what we don't have in these models.
Now, what we do have in the models is the ability to have copies that go a long way back of the neural activities at many different times.
And our brain doesn't have that.
So how does our brain manage to compete with transformers if we don't have those copies? Well, the way it does it is by having fast changes to the weights so that in that weight matrix, you've stored lots of information about recent history efficiently and you can access it efficiently to get what's similar to things you're seeing now. I think there will be progress in doing that.
But I think that progress may only come when we have computers that are using things like voltages and conductances for neural activities and weights.
As long as we're using digital computers with matrix multipliers, there's going to be too much a win from not changing the weights as you go.
So that would be my last bet.
And what do you think in the training data that we actually put forward towards these models? Do you think it matters and we should make it much more curated?
Or do you think we should just scale it ubiquitously?
Would you teach your child to read on the diaries of serial killers?
Probably not.
There you go. There's your answer.
But isn't this a fundamental issue in how we scale this?
Yes. I mean, look, we know that these large models are much more like people than they are like standard computer software.
They're not lines of code where each line of code does something that somebody thought it was meant to do and maybe doesn't.
They're a whole bunch of weights that is learned from data.
And if you ask, "What control do we have over that?"
Well, there's two main controls we have.
One is we can reinforce it.
We can tell it don't do that, or do do that.
You can do that with people too.
But there's something that's much more effective than that with people, which is modeling good behavior.
If you want to raise a decent child, you should just look to see what happened when Trump was raised and do the opposite.
So we should curate that data a whole lot so you model decent behavior to it.
Maybe this is a company we could build.
Okay, I have a final question for you.
What is the difference between Oppenheimer and Geoff Hinton?
Oh, before we came on stage, I told him a joke.
Just after my Nobel Prize was announced, I was giving a talk to a bank for a very reasonable fee because I hadn't had the Nobel Prize then.
And I had at the beginning of the talk, because they talked about the Nobel Prize, of course. At the beginning of the talk, I said there's no comparison between me and Oppenheimer. Oppenheimer was a brilliant physicist and also a brilliant organizer. I'm a terrible organizer and I'm just quite a good scientist who happened to pick the right problem and stuck with it.
My special ability is stick with it.
It's the ability to ignore other people.
So anyway, at the end of the talk, there were questions.
And the last question was, later in his life, Oppenheimer came to regret what he'd done.
How would you compare yourself with Oppenheimer?
And I couldn't resist it.
So I said, Oppenheimer never got the Nobel Prize in physics.
I think that's a good note to end on.
Thank you so much for joining us, Geoff.
It's always a pleasure.
Thank you.
Related Videos
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
5 Mind Blowing Omni Uses Cases
PaulJLipsky
1K views•2026-06-02
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29











