Podcasts CienciasThe 80,000 Hours Podcast on Artificial Intelligence

Escucha gratis este podcast en la aplicación:

radio.es

Sleep timer

Favoritos

Descarga gratuita en la App Store

The 80,000 Hours Podcast on Artificial Intelligence

80,000 Hours

Ciencias Cultura y sociedad

Último episodio

14 episodios

One: Will MacAskill on AI causing a “century in a decade” — and how we’re completely unprepared
05/06/2026 | 3 h 57 min
The 20th century saw unprecedented change: nuclear weapons, satellites, the rise and fall of communism, third-wave feminism, the internet, postmodernism, game theory, genetic engineering, the Big Bang theory, quantum mechanics, widespread birth control, and more. Now imagine all of it compressed into just 10 years.
That’s the future Will MacAskill — philosopher, founding figure of effective altruism, and now researcher at Forethought Research — argues we need to prepare for in his paper “Preparing for the intelligence explosion.” Not in the distant future, but probably in 3–7 years.
The reason: AI systems are rapidly approaching human-level capability in scientific research and intellectual tasks. Once AI exceeds human abilities in AI research itself, we’ll enter a recursive self-improvement cycle — creating wildly more capable systems. Soon after, by improving algorithms and manufacturing chips, we’ll deploy millions, then billions, then trillions of superhuman AI scientists working 24/7 without human limitations. These systems will collaborate across disciplines, build on each discovery instantly, and conduct experiments at unprecedented scale and speed — compressing a century of progress into years.
Will compares this to a mediaeval king suddenly needing to upgrade from bows and arrows to nuclear weapons to deal with an ideological threat from a kingdom he’s never heard of, while simultaneously learning he’s descended from monkeys and his god doesn’t exist.
What makes this acceleration perilous is that while technology can speed up almost arbitrarily, human institutions and decision making are much more fixed.
Consider the case of nuclear weapons: in this compressed timeline, there would have been just a three-month gap between the Manhattan Project’s start and the Hiroshima bombing, and the Cuban Missile Crisis would have lasted just over a day.
Robert Kennedy Sr, who helped navigate the actual Cuban Missile Crisis, once said that if they’d had to make decisions faster — like in 24 hours rather than 13 days — they would likely have taken much more aggressive, much riskier actions.
So there’s reason to worry about our capacity to make wise choices quickly. And in his paper, Will lays out 10 “grand challenges” we’ll need to navigate to avoid things going wrong.
Will now believes we’re entering one of the most critical periods for humanity ever — with decisions made in the next few years potentially determining outcomes millions of years into the future. In this wide-ranging conversation, Will and host Rob Wiblin discuss:
Why leading AI safety researchers now think there’s dramatically less time before AI is transformative than they’d previously thought
The three different types of intelligence explosions that occur in order
Will’s list of resulting grand challenges — including destructive technologies, space governance, concentration of power, and digital rights
How to prevent ourselves from accidentally “locking in” mediocre futures for all eternity
Ways AI could radically improve human coordination and decision making
Why we should aim for truly flourishing futures, not just avoiding extinction
Learn more and read the full transcript on the 80,000 Hours website.
This episode was originally released in March 2025.
Chapters:
Cold open (00:00:00)
Who’s Will MacAskill? (00:00:46)
Why Will now just works on AGI (00:01:02)
Will was wrong(ish) on AI timelines and hinge of history (00:04:10)
A century of history crammed into a decade (00:08:59)
Science goes super fast; our institutions don't keep up (00:15:41)
Is it good or bad for intellectual progress to 10x? (00:21:03)
An intelligence explosion is not just plausible but likely (00:22:53)
Intellectual advances outside technology are similarly important (00:28:57)
Counterarguments to intelligence explosion (00:31:31)
The three types of intelligence explosion (software, technological, industrial) (00:37:29)
The industrial intelligence explosion is the most certain and enduring (00:40:23)
Is a 100x or 1,000x speedup more likely than 10x? (00:51:50)
The grand superintelligence challenges (00:55:37)
Grand challenge #1: Many new destructive technologies (00:59:17)
Grand challenge #2: Seizure of power by a small group (01:06:45)
Is global lock-in really plausible? (01:08:37)
Grand challenge #3: Space governance (01:18:53)
Is space truly defence-dominant? (01:28:43)
Grand challenge #4: Morally integrating with digital beings (01:32:20)
Will we ever know if digital minds are happy? (01:41:01)
“My worry isn't that we won't know; it's that we won't care” (01:46:31)
Can we get AGI to solve all these issues as early as possible? (01:49:40)
Politicians have to learn to use AI advisors (02:02:03)
Ensuring AI makes us smarter decision-makers (02:06:10)
How listeners can speed up AI epistemic tools (02:09:38)
AI could become great at forecasting (02:13:09)
How not to lock in a bad future (02:14:37)
AI takeover might happen anyway — should we rush to load in our values? (02:25:29)
ML researchers are feverishly working to destroy their own power (02:34:37)
We should aim for more than mere survival (02:37:53)
By default the future is rubbish (02:49:04)
No easy utopia (02:56:55)
What levers matter most to utopia (03:06:32)
Bottom lines from the modelling (03:20:09)
People distrust utopianism; should they distrust this? (03:24:10)
What conditions make eventual eutopia likely? (03:28:49)
The new Forethought Centre for AI Strategy (03:37:21)
How does Will resist hopelessness? (03:50:13)
Outro (03:57:12)
Video editing: Simon Monsour
Audio engineering: Ben Cordell, Milo McGuire, Simon Monsour, and Dominic Armstrong
Camera operator: Jeremy Chevillotte
Transcriptions and web: Katy Moore
Two: Ajeya Cotra on accidentally teaching AI models to deceive us
05/06/2026 | 2 h 49 min
Imagine you’re an orphaned eight-year-old whose parents left you a $1 trillion company, with no trusted adult to guide you. You have to hire a smart adult to run that company, guide your life the way a parent would, and administer your vast wealth. You have to hire them based on a work trial or interview that you design. You don’t get to see any resumes or do reference checks. And because you’re so rich, tonnes of people apply — for all sorts of reasons.
Ajeya Cotra argues this peculiar setup resembles the situation humanity finds itself in as we train very general and very capable AI models using current deep learning methods. Ajeya was a senior research analyst at Coefficient Giving at the time of this interview, and she now works at METR (Model Evaluation & Threat Research).
As she explains, this eight-year-old faces a challenging problem. In the candidate pool there are likely some truly nice people, who sincerely want to help and make decisions that are in your interest. But there are probably other characters too — like people who will pretend to care while you’re monitoring them, but intend to exploit the job to enrich themselves as soon as they think they can get away with it.
Like a child trying to judge adults, at some point humans will need to judge the trustworthiness and reliability of machine learning models that are as goal-oriented as people, and greatly outclass them in knowledge, experience, breadth, and speed. Tricky!
Can’t we rely on models' performance during training tasks to guide us? Ajeya worries this won’t work. The trouble is that three different sorts of models will all produce the same output during training, but could behave very differently once deployed in a setting that allows their true colours to come through. She describes three such motivational archetypes:
Saints — models that care about doing what we really want
Sycophants — models that just want us to say they’ve done a good job, even if they get that praise by taking actions they know we wouldn’t want them to
Schemers — models that don’t care about us or our interests at all, who are just pleasing us so long as that serves their own agenda
In principle, a machine learning training process based on reinforcement learning could spit out any of these three attitudes, because all three would perform roughly equally well on the tests we give them, and ‘performs well on tests’ is how these models are selected.
But while that’s true in principle, maybe it’s not something that could plausibly happen in the real world. After all, if we train an agent based on positive reinforcement for accomplishing X, shouldn’t the training process produce a model that just does X and doesn’t have complex thoughts and goals beyond that?
According to Ajeya, this is one thing we don’t know, and should be trying to test empirically as these models get more capable. For reasons she explains in the interview, the Sycophant or Schemer models may in fact be simpler and easier for the learning algorithm to creep towards than their Saint counterparts.
But there are also ways we could end up actively selecting for motivations that we don’t want.
For example, let’s say you train an agentic AI model to run a small business, selecting for behaviours that make money and measuring success by the balance in its bank account. During training, a highly capable model may experiment with the strategy of tricking its trainers into thinking it has made money legitimately when it hasn’t. Maybe instead it steals some money and covers that up. This isn’t a hypothetical worry: models often come up with creative — sometimes undesirable — approaches during training that their developers didn’t anticipate.
If such deception isn’t caught, a model like this may be rated as particularly successful, and the training process will reinforce its tendency to engage in deceptive behaviour. A model that could deceive without being caught would, in effect, have a competitive advantage.
What if deception is picked up, but just some of the time? Would the model then learn that honesty is the best policy? Perhaps. But it might learn a different lesson instead: that deception does pay, as long as it’s done selectively and carefully enough to avoid detection. Would that actually happen? We don’t yet know, but it’s possible.
In this conversation, Ajeya and host Rob Wiblin discuss the above, as well as:
How to predict the motivations a neural network will develop through training
Whether AIs in training will functionally understand that they’re AIs being trained
Stories of AI misalignment that Ajeya doesn’t buy
Analogies for AI, from octopuses to aliens to can openers
Why it’s smarter to have separate ‘planning AIs’ and ‘doing AIs’
The benefits of only following through on AI-generated plans that make sense to human beings
Which approaches for fixing alignment problems Ajeya is most excited about, and which she thinks are overrated
How one might demo actually scary AI failure mechanisms
Learn more and read the full transcript on the 80,000 Hours website.
This episode was originally released in May 2023, but we still think it’s one of the best episodes we have at explaining core risks from power-seeking AI.
Chapters:
Rob’s intro (00:00:00)
The interview begins (00:02:38)
How Ajeya’s views have changed since 2020 (00:05:09)
Are neural networks more like a sped-up version of evolution, or a slower version of human learning? (00:17:42)
Situational awareness (00:26:10)
Misalignment stories Ajeya doesn't buy (00:42:03)
The orphan heir with a trillion-dollar fortune (00:59:14)
Saints, Sycophants, and Schemers (01:03:41)
Ways to train safer AI systems (01:23:20)
Aliens and other analogies (01:38:22)
Moral patienthood (01:53:21)
ARC Evaluations (01:55:35)
Interpretability research (02:09:25)
Rewarding models based on how good and sensible their plans seem to us (02:17:48)
Overrated approaches (02:25:49)
Demos of actually scary alignment failures (02:30:57)
Skills to develop for doing useful work (02:37:23)
Rob’s outro (02:47:24)
Producer: Keiran Harris
Audio mastering: Ryan Kessler and Ben Cordell
Transcriptions: Katy Moore
Three: Carl Shulman on the economy and national security after AGI
05/06/2026 | 4 h 14 min
The human brain does what it does with a shockingly low energy supply: just 20 watts — a fraction of a cent worth of electricity per hour. What would happen if AI technology merely matched what evolution already managed, and could accomplish the work of top human professionals given a 20-watt power supply?
Many people have sort of considered this hypothetical, but perhaps nobody has followed through and considered all the implications as much as Carl Shulman. Behind the scenes, his work has greatly influenced how leaders in artificial general intelligence (AGI) picture the world they’re creating.
Carl simply follows the logic to its natural conclusions, leading to a world where:
$0.01 of electricity can be turned into medical advice, company management, or scientific research that would cost hundreds of dollars today, resulting in a scramble to manufacture chips and apply them to the most lucrative forms of intellectual labour
Given their incredible hourly salaries, the supply of outstanding AI researchers quickly goes from 10,000 to 10 million or more, enormously accelerating progress in the field
Companies operated entirely by AIs are much faster and more cost effective than those that lean on humans for decision making, and the latter are progressively driven out of business
The technical challenges of controlling robots are rapidly overcome — leading to strong, fast, precise, and tireless robot workers able to accomplish any physical work the economy requires, and a rush to build billions of them and cash in
Overnight, the number of humans becomes irrelevant to economic growth, which is now driven by how quickly the entire machine economy can replicate its components. Given how quickly complex biological systems can reproduce — some in a matter of days — a doubling every few months may be a conservative estimate
Any country that delays participating in this economic explosion risks being outpaced and ultimately disempowered by rivals whose economies grow to be 10-fold, 100-fold, and then 1,000-fold larger than its own
As the economy grows, each person could afford the equivalent of a team of hundreds of machine ‘people’ to help them with every aspect of their lives.
And with growth rates this high, it doesn’t take long to run up against Earth’s physical limits — the toughest to engineer around being Earth’s ability to release waste heat. If this machine economy and its insatiable demand for power generates more heat than the Earth radiates into space, the planet will rapidly heat up and become uninhabitable for humans and other animals.
This eventually creates pressure to move economic activity off-planet. There’s little need for computer chips to be on Earth, and solar energy and minerals are more abundant in space. So you could develop populations of billions of scientific researchers operating on computer chips orbiting in space, sending the results of their work — such as drug designs — back to Earth.
These are just some of the wild implications if AGI could accomplish everything the most productive humans can, using the same energy supply.
In this interview with host Rob Wiblin, Carl explains the above, and Rob pushes back on whether that’s realistic or just a cool story:
If we’re heading towards the above, how come economic growth remains slow now and isn’t really increasing?
Why have computers and computer chips had so little effect on economic productivity so far?
Are self-replicating biological systems a good comparison for self-replicating machine systems?
Isn’t this just too crazy and weird to be plausible?
What bottlenecks would be encountered in supplying energy and natural resources to this growing economy?
Might there not be severely declining returns to bigger ‘brains’ and more training?
Wouldn’t humanity get scared and pull the brakes if such a transformation kicked off?
If this is right, how come economists don’t agree?
For the last section of the episode, Carl addresses the moral status of machine minds themselves. Would they be conscious or otherwise have a claim to moral rights? And how might humans and machines coexist with neither side dominating or exploiting the other?
Learn more and read the full transcript on the 80,000 Hours website.
This episode is the first part of Rob’s marathon interview with Carl Shulman in 2024. The second episode is on government and society after AGI, and you can listen to them in either order.
Chapters:
Cold open (00:00:00)
Rob's intro (00:01:00)
The interview begins (00:04:43)
Transitioning to a world where AI systems do almost all the work (00:05:20)
Economics after an AI explosion (00:14:24)
Objection: Shouldn’t we be seeing economic growth rates increasing today? (00:59:11)
Objection: Speed of doubling time (01:07:32)
Objection: Declining returns to increases in intelligence? (01:11:58)
Objection: Physical transformation of the environment (01:17:37)
Objection: Should we expect an increased demand for safety and security? (01:29:13)
Objection: “This sounds completely whack” (01:36:09)
Income and wealth distribution (01:48:01)
Economists and the intelligence explosion (02:13:30)
Baumol effect arguments (02:19:11)
Denying that robots can exist (02:27:17)
Semiconductor manufacturing (02:32:06)
Classic economic growth models (02:36:10)
Robot nannies (02:48:25)
Slow integration of decision-making and authority power (02:57:38)
Economists’ mistaken heuristics (03:01:06)
Moral status of AIs (03:11:44)
Rob's outro (04:11:46)
Producer and editor: Keiran Harris
Audio engineering lead: Ben Cordell
Technical editing: Simon Monsour, Milo McGuire, and Dominic Armstrong
Transcriptions: Katy Moore
Four: Rose Hadshar on why automating human labour will break our political system
05/06/2026 | 2 h 14 min
The most important political question in the age of advanced AI might not be who wins elections. It might be whether elections continue to matter at all.
That’s the view of Rose Hadshar, researcher at Forethought Research, who believes we could see extreme, AI-enabled power concentration without a coup or dramatic ‘end of democracy’ moment.
She foresees something more insidious: an elite group with access to such powerful AI capabilities that the normal mechanisms for checking elite power — law, elections, public pressure, the threat of strikes — cease to have much effect. Those mechanisms could continue to exist on paper, but become ineffectual in a world where humans are no longer needed to execute even the largest-scale projects.
Almost nobody wants this to happen — but we may find ourselves unable to prevent it.
If AI disrupts our ability to make sense of things, will we even notice power getting severely concentrated, or be able to resist it? Once AI can substitute for human labour across the economy, what leverage will citizens have over those in power? And what does all of this imply for the institutions we’re relying on to prevent the worst outcomes?
Rose has answers, and they’re not all reassuring.
But she’s also hopeful we can make society more robust against these dynamics. We’ve got literally centuries of thinking about checks and balances to draw on. And there are some interventions she’s excited about — like building sophisticated AI tools for making sense of the world, or ensuring multiple branches of government have access to the best AI systems.
In this conversation, Rose and host Zershaaneh Qureshi discuss all of this, and more:
Three dynamics that could reshape political power in the AI era
How AI gives small groups the productive power of millions
Why AI-powered tyranny would be uniquely difficult to topple
How power concentration compares to ‘gradual disempowerment’ by AI
Slower-moving scenarios that could still get scary
Which interventions could genuinely work — and which could backfire
Rose's most promising approaches to fighting back against power concentration
Why a ‘Manhattan Project’ approach to AI should worry you — and why international projects aren’t automatically safe either
Learn more and read the full transcript on the 80,000 Hours website.
This episode was originally released in March 2026.
Chapters:
Cold open (00:00:00)
Who’s Rose Hadshar? (00:01:05)
Three dynamics that could reshape political power in the AI era (00:02:37)
AI gives small groups the productive power of millions (00:12:49)
Dynamic 1: When a software update becomes a power grab (00:20:41)
Dynamic 2: When AI labour means governments no longer need their citizens (00:31:20)
How democracy could persist in name but not substance (00:45:15)
Dynamic 3: When AI filters our reality (00:54:54)
Good intentions won’t stop power concentration (01:08:27)
Slower-moving worlds could still get scary (01:23:57)
Why AI-powered tyranny will be tough to topple (01:31:53)
How power concentration compares to “gradual disempowerment” (01:38:18)
Some interventions are cross-cutting — and others could backfire (01:43:54)
What fighting back actually looks like (01:55:15)
Why power concentration researchers should avoid getting too “spicy” (02:04:10)
Why the “Manhattan Project” approach should worry you — but truly international projects might not be safe either (02:09:18)
Rose wants to keep humans around! (02:12:06)
Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon Monsour
Music: CORBIT
Coordination, transcripts, and web: Nick Stockton and Katy Moore
Five: Helen Toner on the geopolitics of AI in China and the Middle East
05/06/2026 | 2 h 20 min
With the US racing to develop AGI and superintelligence ahead of China, you might expect the two countries to be negotiating how they’ll deploy AI, including in the military, without coming to blows. But according to Helen Toner, director of the Center for Security and Emerging Technology in DC, “the US and Chinese governments are barely talking at all.”
In her role as a founder, and now leader, of DC’s top think tank focused on the geopolitical and military implications of AI, Helen has been closely tracking the US’s AI diplomacy since 2019.
“Over the last couple of years there have been some direct [US–China] talks on some small number of issues, but they’ve also often been completely suspended.” China knows the US wants to talk more, so “that becomes a bargaining chip for China to say, ‘We don’t want to talk to you. We’re not going to do these military-to-military talks about extremely sensitive, important issues, because we’re mad.'”
Helen isn’t sure the groundwork exists for productive dialogue in any case. “At the government level, [there’s] very little agreement” on what artificial general intelligence (AGI) is, whether it’s possible soon, and whether it poses major risks. Without shared understanding of the problem, negotiating solutions is very difficult.
Another issue is that so far the Chinese Communist Party doesn’t seem especially ‘AGI-pilled.’ While a few Chinese companies like DeepSeek are betting on scaling, she sees little evidence Chinese leadership shares Silicon Valley’s conviction that AGI will arrive any minute now, and export controls have made it very difficult for them to access compute to match US competitors.
To find an autocracy that’s truly bought into the AGI vision, we might need to look at nominal US allies. The US has approved massive data centres in the UAE and Saudi Arabia with “hundreds of thousands of next-generation Nvidia chips” — delivering colossal levels of computing power.
When OpenAI announced this deal with the UAE, they celebrated that it was “rooted in democratic values,” and would advance “democratic AI rails” and provide “a clear alternative to authoritarian versions of AI.”
But the UAE scores 18 out of 100 on Freedom House’s democracy index. “This is really not a country that respects rule of law,” Helen observes. Political parties are banned, elections are fake, dissidents are persecuted.
If AI access really determines future national power, handing world-class supercomputers to Gulf autocracies seems pretty questionable. The justification is typically that “if we don’t sell it, China will” — a transparently false claim, given severe Chinese production constraints. It also raises eyebrows that Gulf countries conduct joint military exercises with China and their rulers have “very tight personal and commercial relationships with Chinese political leaders and business leaders.”
In this conversation in Washington, DC, host Rob Wiblin and Helen discuss the above, plus much more.
Learn more and read the full transcript on the 80,000 Hours website.
This episode was originally released in November 2025.
Chapters:
Cold open (00:00:00)
Who’s Helen Toner? (00:01:02)
Helen’s role on the OpenAI board, and what happened with Sam Altman (00:01:31)
The Center for Security and Emerging Technology (CSET) (00:07:35)
CSET’s role in export controls against China (00:10:43)
Does it matter if the world uses US AI models? (00:21:24)
Is China actually racing to build AGI? (00:27:10)
Could China easily steal AI model weights from US companies? (00:38:14)
The next big thing is probably robotics (00:46:42)
Why is the Trump administration sabotaging the US high-tech sector? (00:48:17)
Are data centres in the UAE “good for democracy”? (00:51:31)
Will AI inevitably concentrate power? (01:06:20)
“Adaptation buffers” vs non-proliferation (01:28:16)
Will the military use AI for decision-making? (01:36:09)
“Alignment” is (usually) a terrible term (01:42:51)
Is Congress starting to take superintelligence seriously? (01:45:19)
AI progress isn't actually slowing down (01:47:44)
What's legit vs not about OpenAI’s restructure (01:55:28)
Is Helen unusually “normal”? (01:58:57)
How to keep up with rapid changes in AI and geopolitics (02:02:42)
What CSET can uniquely add to the DC policy world (02:05:51)
Talent bottlenecks in DC (02:13:26)
What evidence, if any, could settle how worried we should be about AI risk? (02:16:28)
Is CSET hiring? (02:18:22)
Video editing: Luke Monsour and Simon Monsour
Audio engineering: Milo McGuire, Simon Monsour, and Dominic Armstrong
Music: CORBIT
Coordination, transcriptions, and web: Katy Moore

Más podcasts de Ciencias

Acerca de The 80,000 Hours Podcast on Artificial Intelligence

10 experts, 10 episodes: a crash course on transformative AI and what you can do to help shape its trajectory. This compilation features 10 key episodes of The 80,000 Hours Podcast to help listeners — particularly those new to the topic — get to grips with the potential upsides and downsides of powerful, transformative AI.

Sitio web del podcast

Ciencias Cultura y sociedad