You Are Being Told Contradictory Things About AI: 8 examples
With headlines of an imminent job apocalypse, code red for ChatGPT and recursive self-improvement, at the same time as Anthropic's CEO yesterday saying we know how to scale to AGI, and Gemini 3 DeepThink out today, it is easy to get lost among the narratives and counter-narratives. So here are both, plus the facts behind them, for you to decide.https://epoch.ai/data/data-centersEpoch AI is the sponsor of today’s video, and my views, and those expressed in this video, do not necessarily reflect Epoch AI’s views in any way.Chapters: 00:00 - Introduction00:42 - Job Apocalypse?01:45 - Scaling to AGI04:15 - Recursive Self-Improvement Needed, or Not09:57 - OpenAI Code Red vs Gemini 3 DeepThink vs Claude Opus 4.513:27 - DeepSeek Speciale vs Mistral Large v316:45 - Claude Soul Documenthttps://lmcouncil.ai/AI Insiders ($9!): https://www.patreon.com/AIExplainedGuardian Interview: https://www.theguardian.com/technology/ng-interactive/2025/dec/02/jared-kaplan-artificial-intelligence-train-itselfMIT Study on Jobs/Tasks: https://iceberg.mit.edu/report.pdfvs https://www.cnbc.com/2025/11/26/mit-study-finds-ai-can-already-replace-11point7percent-of-us-workforce.htmlAmodei on Scaling: https://www.youtube.com/watch?v=FEj7wAjwQIkClaude Soul Document: https://www.lesswrong.com/posts/vpNG99GhbBoLov9og/claude-4-5-opus-soul-documentCapabilities Original Stance: https://www.anthropic.com/news/core-views-on-ai-safetyIlya Interview: https://www.dwarkesh.com/p/ilya-sutskever-2Ricursive Intelligence: https://x.com/RicursiveAI/status/1995932204703346946Economist Worker Usage of GenAI: https://www.economist.com/finance-and-economics/2025/11/26/investors-expect-ai-use-to-soar-thats-not-happening#selection-1409.94-1413.42Mistral v3 Large: https://docs.mistral.ai/models/mistral-large-3-25-12Compute Slowdown Paper: https://joel-becker.com/images/publications/forecasting_time_horizon_under_compute_slowdown.pdfhttps://x.com/joel_bkr/status/1993023436541903155METR Chart: https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/https://www.theinformation.com/articles/openais-350-billion-computing-cost-problem?rc=sy0ihqOpenAI Code Red: https://www.anthropic.com/news/core-views-on-ai-safetyRocket Company: https://www.independent.co.uk/news/world/americas/sam-altman-rocket-elon-musk-spacex-b2878351.htmlDeepSeek Paper: https://arxiv.org/html/2512.02556v1DeepSeek Crowdstrike CCP: https://www.crowdstrike.com/en-us/blog/crowdstrike-researchers-identify-hidden-vulnerabilities-ai-coded-software/https://simple-bench.com/Patreon Post: https://www.patreon.com/c/aiexplained/postsRobot: https://x.com/jloganolson/status/1985850115379351799
--------
20:15
--------
20:15
Gemini 3 is Here: 11 Details You Might Have Missed
Gemini 3 Pro is out, and records fell like snowflakes in Svalbard. No long description, chapters or links today, huge technical difficulties, including with audio, so just want to publish asap.https://app.grayswan.ai/ai-explainedhttps://lmcouncil.aiAI Insiders ($9!): https://www.patreon.com/AIExplainedNon-hype Newsletter: https://signaltonoise.beehiiv.com/Podcast: https://aiexplainedopodcast.buzzsprout.com/
--------
21:42
--------
21:42
Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
A lot just got released in the last 36 hours, and it will all affect hundreds of millions of people. 10 details you would miss if you just read the headlines, from GPT 5.1 regressions, to how Claude hacked Govt Agencies, to SIMA 2, and Musical Turing Tests.https://assemblyai.com/aiexplainedChapters:00:00 - Introduction00:56 - GPT 5.1 Smarter?01:47 - Some Regressions03:22 - Sycophancy?05:22 - Claude Auto-Hacking 06:16 - Jailbreaking through Granularity08:22 - This Will be Re-used09:30 - Hallucinating Hacker09:57 - Surprisingly Neutral Tone12:18 - SIMA 214:10 - Alpha Parallels17:24 - AI MusicGPT 5.1 Announcement: https://openai.com/index/gpt-5-1/System Card: https://cdn.openai.com/pdf/4173ec8d-1229-47db-96de-06d87147e07e/5_1_system_card.pdfBenchmarks: https://openai.com/index/gpt-5-1-for-developers/Simple Bench: https://lmcouncil.ai/benchmarksAuto-Hacking: https://x.com/AnthropicAI/status/1989033793190277618https://www.anthropic.com/news/disrupting-AI-espionageReport: https://assets.anthropic.com/m/ec212e6566a0d47/original/Disrupting-the-first-reported-AI-orchestrated-cyber-espionage-campaign.pdfSima 2 Announcement: https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/https://x.com/amoufarek/status/1988986075331858693Scepticism: https://www.technologyreview.com/2025/11/13/1127921/google-deepmind-is-using-gemini-to-train-agents-inside-goat-simulator-3/Voyager: https://voyager.minedojo.org/Reuters Music: https://www.reuters.com/legal/litigation/are-you-listening-bots-survey-shows-ai-music-is-virtually-undetectable-2025-11-12/
--------
18:26
--------
18:26
Bubble or No Bubble, AI Keeps Progressing (ft. Relentless Learning + Introspection)
Don’t let headlines about bubbles distract you from the real avenues of progress being explored in AI every week, including what had been thought to be a long-term blocker - continual learning (learning on the fly). https://app.grayswan.ai/ai-explainedThis, plus models introspecting (hesitate before you berate), Nano Banana 2 possibly spotted, Chinese imagen and more.AI Insiders ($9!): https://www.patreon.com/AIExplainedChapters:00:00 - Introduction01:26 - Continual Learning (Nested Learning / HOPE)07:00 - Introspection10:54 - Image-Gen ProgressNested Learning Post: https://research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/Nested Learning Paper: https://abehrouz.github.io/files/NL.pdfOriginal Titans Paper: https://arxiv.org/pdf/2501.00663Siri News: https://www.bloomberg.com/news/articles/2025-11-05/apple-plans-to-use-1-2-trillion-parameter-google-gemini-model-to-power-new-siriIntrospection: https://www.anthropic.com/research/introspectionFull Paper: https://transformer-circuits.pub/2025/introspection/index.html#mechanismsEarlier Work: https://www.anthropic.com/research/mapping-mind-language-modelhttps://transformer-circuits.pub/2024/scaling-monosemanticity/index.htmlRelease Post: https://x.com/AnthropicAI/status/1983584136972677319https://lmcouncil.ai Non-hype Newsletter: https://signaltonoise.beehiiv.com/Podcast: https://aiexplainedopodcast.buzzsprout.com/
--------
12:53
--------
12:53
Sora 2 - It will only get more realistic from here
Sora 2 - the start of the infinite slop-feed or a key step to a generalist agent? Better than VEO 3 or over-hyped? I bring out 6 details you may have missed, contrast the announcement to Periodic Labs and even squeeze in some Claude Sonnet 4.5 analysis. Maybe I should make my videos longer…https://80000hours.org/aiexplainedAI Insiders ($9!): https://www.patreon.com/AIExplainedChapters:00:00 - Introduction00:40 - Two models?01:15 - Rollout Details01:43 - Versus Sora 1 / Veo 304:30 - Sora App / Social Media06:40 - Masterplan09:30 - Generalist Agent? Periodic Labs12:05 - Claude Sonnet 4.513:42 - Future OutlookAnnouncement: https://openai.com/index/sora-2/Launch Video: https://www.youtube.com/live/gzneGhpXwjUSystem Card: https://cdn.openai.com/pdf/50d5973c-c4ff-4c2d-986f-c72b5d0ff069/sora_2_system_card.pdfSam Altman Blog Post on Sora App: https://blog.samaltman.com/sora-2Most Intelligent Claim: https://x.com/willdepue/status/1973089331284681110GTA: https://x.com/AndrewCurran_/status/1973298436536766666Meta Vibes: https://x.com/alexandr_wang/status/1971295156411433228?s=46Altman on Regulations: https://www.lesswrong.com/posts/5jjk4CDnj9tA7ugxr/openai-email-archives-from-musk-v-altmanOpenAI Profit: https://www.theinformation.com/articles/openais-first-half-results-4-3-billion-sales-2-5-billion-cash-burn?rc=sy0ihqPeriodic Labs: https://periodic.com/https://www.nytimes.com/2025/09/30/technology/ai-meta-google-openai-periodic.htmlhttps://x.com/LiamFedus/status/1973055380193431965https://baincapitalventures.com/insight/we-must-know-we-will-know/?s=09Sonnet 4.5: https://www.anthropic.com/news/claude-sonnet-4-5https://simple-bench.com/Non-hype Newsletter: https://signaltonoise.beehiiv.com/Podcast: https://aiexplainedopodcast.buzzsprout.com/
Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.