
Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but …
19/12/2025 | 19 min
The condensed highlights of hours of AI lab leader interviews, model releases, Gemini 3 Flash insights (plus it’s hidden flaw), Hassabis’ ‘proto-AGI’ and much more…https://matsprogram.org/apply?utm_source=ai-explained&utm_medium=youtube&utm_campaign=s26 Also, do check out my new app: https://lmcouncil.aiChapters: 00:00 - Introduction00:50 - Results02:44 - But… the Flaw04:49 - So Benchmarks are fake? No07:37 - Spatial Reasoning + Hassabis10:06 - Proto-AGI12:07 - Minimal AGI15:07 - Compute Slowdown17:56 - New Data ParadigmGemini 3 Flash: https://deepmind.google/models/gemini/flash/Hassabis Interview: https://www.youtube.com/watch?v=PqVbypvxDtoLegg Interview: https://www.youtube.com/watch?v=l3u_FAv33G0Pre-training Lead Interview: https://www.youtube.com/watch?v=cNGDAqFXvewAltman Interview: https://www.youtube.com/watch?v=2P27Ef-LLuQBrockman Video: https://x.com/OpenAI/status/2001336514786017417Post-Training Reveal: https://x.com/OfficialLoganK/status/2001742530472534442Hallucinations Paper: https://cdn.openai.com/pdf/d04913be-3f6f-4d2b-b283-ff432ef4aaa5/why-language-models-hallucinate.pdfPatreon Hallucinations Vid: https://www.patreon.com/posts/blockers-to-and-139264812AA-Omniscience Benchmark: https://artificialanalysis.ai/evaluations/omnisciencehttps://arxiv.org/pdf/2511.13029lmcouncil.ai/benchmarks https://simple-bench.com/https://x.com/scaling01/status/19996205877448132055.2 Codex Drop: https://cdn.openai.com/pdf/ac7c37ae-7f4c-4442-b741-2eabdeaf77e0/oai_5_2_Codex.pdfOpenAI Compute Trend: https://www.theinformation.com/articles/openais-350-billion-computing-cost-problem?rc=sy0ihqCramer Tweet/Response: https://x.com/BorisMPower/status/2001440650210976018OpenAI Valuation: https://www.theinformation.com/articles/openai-discussed-raising-tens-billions-valuation-around-750-billion?rc=sy0ihqIndian Data: https://www.reuters.com/world/india/with-freebies-openai-google-vie-indian-users-training-data-2025-12-17/TheInformation Data: https://x.com/theinformation/status/2001421225751351778Genie 3: https://deepmind.google/blog/genie-3-a-new-frontier-for-world-models/Sima 2: https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/Veo 3.1: https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/METR: https://metr.org/blohttps://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/2025-03-19-measuring-ai-ability-to-complete-long-tasks/AI Insiders ($9!): https://www.patreon.com/AIExplainedNon-hype Newsletter: https://signaltonoise.beehiiv.com/

GPT 5.2: OpenAI Strikes Back
12/12/2025 | 17 min
Full GPT-5.2 breakdown - did OpenAI reclaim the crown? A story of tokens, time and cost, plus 9 details you wouldn’t get just from reading the headlines.https://www.youtube.com/@eightythousandhoursAI Insiders ($9!): https://www.patreon.com/AIExplainedhttps://lmcouncil.aiChapters:00:00 - Introduction00:55 - Better than Human @ Professional Tasks?04:42 - Test time Compute07:05 - Benchmark Selection09:32 - Simple Results + council comparison13:01 - Long Context13:52 - Self-Improvement15:00 - 10 Years + New ModelsRelease Page: https://openai.com/index/introducing-gpt-5-2/GPT 5.2 Benchmark Comparison: https://www.reddit.com/r/singularity/comments/1pka1y9/gpt52_all_20_benchmarks_rankings_and_pricing/https://storage.googleapis.com/gweb-uniblog-publish-prod/original_images/gemini_3_table_final_HLE_Tools_on.gifhttps://lmcouncil.ai/benchmarksCharxiv: https://charxiv.github.io/#leaderboardGDPval: https://arxiv.org/pdf/2510.04374My vid: https://www.youtube.com/watch?v=oK5LxMaROSAKilpatrick: https://x.com/OfficialLoganK/status/1999270402712023158/photo/1Noam Brown: https://x.com/polynoamial/status/1999189845164667132New Model in New Year: https://www.theinformation.com/articles/openai-developing-garlic-model-counter-googles-recent-gains?rc=sy0ihq10 Years of OpenAI: https://openai.com/index/ten-years/GPQA: https://x.com/idavidrein/status/1841265634170278063ARC-AGI 1-2: https://arcprize.org/arc-agi/2/Sunday Robotics: https://x.com/tonyzzhao/status/1991204839578300813Non-hype Newsletter: https://signaltonoise.beehiiv.com/https://lmcouncil.ai

You Are Being Told Contradictory Things About AI: 8 examples
05/12/2025 | 20 min
With headlines of an imminent job apocalypse, code red for ChatGPT and recursive self-improvement, at the same time as Anthropic's CEO yesterday saying we know how to scale to AGI, and Gemini 3 DeepThink out today, it is easy to get lost among the narratives and counter-narratives. So here are both, plus the facts behind them, for you to decide.https://epoch.ai/data/data-centersEpoch AI is the sponsor of today’s video, and my views, and those expressed in this video, do not necessarily reflect Epoch AI’s views in any way.Chapters: 00:00 - Introduction00:42 - Job Apocalypse?01:45 - Scaling to AGI04:15 - Recursive Self-Improvement Needed, or Not09:57 - OpenAI Code Red vs Gemini 3 DeepThink vs Claude Opus 4.513:27 - DeepSeek Speciale vs Mistral Large v316:45 - Claude Soul Documenthttps://lmcouncil.ai/AI Insiders ($9!): https://www.patreon.com/AIExplainedGuardian Interview: https://www.theguardian.com/technology/ng-interactive/2025/dec/02/jared-kaplan-artificial-intelligence-train-itselfMIT Study on Jobs/Tasks: https://iceberg.mit.edu/report.pdfvs https://www.cnbc.com/2025/11/26/mit-study-finds-ai-can-already-replace-11point7percent-of-us-workforce.htmlAmodei on Scaling: https://www.youtube.com/watch?v=FEj7wAjwQIkClaude Soul Document: https://www.lesswrong.com/posts/vpNG99GhbBoLov9og/claude-4-5-opus-soul-documentCapabilities Original Stance: https://www.anthropic.com/news/core-views-on-ai-safetyIlya Interview: https://www.dwarkesh.com/p/ilya-sutskever-2Ricursive Intelligence: https://x.com/RicursiveAI/status/1995932204703346946Economist Worker Usage of GenAI: https://www.economist.com/finance-and-economics/2025/11/26/investors-expect-ai-use-to-soar-thats-not-happening#selection-1409.94-1413.42Mistral v3 Large: https://docs.mistral.ai/models/mistral-large-3-25-12Compute Slowdown Paper: https://joel-becker.com/images/publications/forecasting_time_horizon_under_compute_slowdown.pdfhttps://x.com/joel_bkr/status/1993023436541903155METR Chart: https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/https://www.theinformation.com/articles/openais-350-billion-computing-cost-problem?rc=sy0ihqOpenAI Code Red: https://www.anthropic.com/news/core-views-on-ai-safetyRocket Company: https://www.independent.co.uk/news/world/americas/sam-altman-rocket-elon-musk-spacex-b2878351.htmlDeepSeek Paper: https://arxiv.org/html/2512.02556v1DeepSeek Crowdstrike CCP: https://www.crowdstrike.com/en-us/blog/crowdstrike-researchers-identify-hidden-vulnerabilities-ai-coded-software/https://simple-bench.com/Patreon Post: https://www.patreon.com/c/aiexplained/postsRobot: https://x.com/jloganolson/status/1985850115379351799

Gemini 3 is Here: 11 Details You Might Have Missed
19/11/2025 | 21 min
Gemini 3 Pro is out, and records fell like snowflakes in Svalbard. No long description, chapters or links today, huge technical difficulties, including with audio, so just want to publish asap.https://app.grayswan.ai/ai-explainedhttps://lmcouncil.aiAI Insiders ($9!): https://www.patreon.com/AIExplainedNon-hype Newsletter: https://signaltonoise.beehiiv.com/Podcast: https://aiexplainedopodcast.buzzsprout.com/

Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
14/11/2025 | 18 min
A lot just got released in the last 36 hours, and it will all affect hundreds of millions of people. 10 details you would miss if you just read the headlines, from GPT 5.1 regressions, to how Claude hacked Govt Agencies, to SIMA 2, and Musical Turing Tests.https://assemblyai.com/aiexplainedChapters:00:00 - Introduction00:56 - GPT 5.1 Smarter?01:47 - Some Regressions03:22 - Sycophancy?05:22 - Claude Auto-Hacking 06:16 - Jailbreaking through Granularity08:22 - This Will be Re-used09:30 - Hallucinating Hacker09:57 - Surprisingly Neutral Tone12:18 - SIMA 214:10 - Alpha Parallels17:24 - AI MusicGPT 5.1 Announcement: https://openai.com/index/gpt-5-1/System Card: https://cdn.openai.com/pdf/4173ec8d-1229-47db-96de-06d87147e07e/5_1_system_card.pdfBenchmarks: https://openai.com/index/gpt-5-1-for-developers/Simple Bench: https://lmcouncil.ai/benchmarksAuto-Hacking: https://x.com/AnthropicAI/status/1989033793190277618https://www.anthropic.com/news/disrupting-AI-espionageReport: https://assets.anthropic.com/m/ec212e6566a0d47/original/Disrupting-the-first-reported-AI-orchestrated-cyber-espionage-campaign.pdfSima 2 Announcement: https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/https://x.com/amoufarek/status/1988986075331858693Scepticism: https://www.technologyreview.com/2025/11/13/1127921/google-deepmind-is-using-gemini-to-train-agents-inside-goat-simulator-3/Voyager: https://voyager.minedojo.org/Reuters Music: https://www.reuters.com/legal/litigation/are-you-listening-bots-survey-shows-ai-music-is-virtually-undetectable-2025-11-12/



AI Explained Official Podcast