PodcastsTecnologíaThe MAD Podcast with Matt Turck

The MAD Podcast with Matt Turck

Matt Turck
The MAD Podcast with Matt Turck
Último episodio

108 episodios

  • The MAD Podcast with Matt Turck

    Mistral AI vs. Silicon Valley: The Rise of Sovereign AI

    12/2/2026 | 58 min
    While Silicon Valley obsesses over AGI, Timothée Lacroix and the team at Mistral AI are quietly building the industrial and sovereign infrastructure of the future. In his first-ever appearance on a US podcast, the Mistral AI Co-Founder & CTO reveals how the company has evolved from an open-source research lab into a full-stack sovereign AI power—backed by ASML, running on their own massive supercomputing clusters, and deployed in nation-state defense clouds to break the dependency on US hyperscalers.

    Timothée offers a refreshing, engineer-first perspective on why the current AI hype cycle is misleading. He explains why "Sovereign AI" is not just a geopolitical buzzword but a necessity for any enterprise that wants to own its intelligence rather than rent it. He also provides a contrarian reality check on the industry's obsession with autonomous agents, arguing that "trust" matters more than autonomy and explaining why he prefers building robust "workflows" over unpredictable agents.

    We also dive deep into the technical reality of competing with the US giants. Timothée breaks down the architecture of the newly released Mistral 3, the "dense vs. MoE" debate, and the launch of Mistral Compute—their own infrastructure designed to handle the physics of modern AI scaling. This is a conversation about the plumbing, the 18,000-GPU clusters, and the hard engineering required to turn AI from a magic trick into a global industrial asset.

    Timothée Lacroix
    LinkedIn - https://www.linkedin.com/in/timothee-lacroix-59517977/
    Google Scholar - https://scholar.google.com.do/citations?user=tZGS6dIAAAAJ&hl=en&oi=ao

    Mistral AI
    Website - https://mistral.ai
    X/Twitter - https://x.com/MistralAI

    Matt Turck (Managing Director)
    Blog - https://mattturck.com
    LinkedIn - https://www.linkedin.com/in/turck/
    X/Twitter - https://twitter.com/mattturck

    FirstMark
    Website - https://firstmark.com
    X/Twitter - https://twitter.com/FirstMarkCap

    (00:00) — Cold Open
    (01:27) — Mistral vs. The World: From Research Lab to Sovereign Power
    (03:48) — Inside Mistral Compute: Building an 18,000 GPU Cluster
    (08:42) — The Trillion-Dollar Question: Competing Without a Big Tech Parent
    (10:37) — The Reality of Enterprise AI: Escaping "POC Purgatory"
    (15:06) — Why Mistral Hires Forward Deployed Engineers (FDEs)
    (16:57) — The Contrarian Take: Why "Agents" are just "Workflows"
    (19:35) — Trust > Autonomy: The Truth About Agent Reliability
    (21:26) — The Missing Stack: Governance and Versioning for AI
    (26:24) — When Will AI Actually Work? (The 2026 Timeline)
    (30:33) — Beyond Chat: The "Banger" Sovereign Use Cases
    (35:46) — Mistral 3 Architecture: Mixture of Experts vs. Dense
    (43:12) — Synthetic Data & The Post-Training Bottleneck
    (45:12) — Reasoning Models: Why "Thinking" is Just Tool Use
    (46:22) — Launching DevStral 2 and the Vibe CLI
    (50:49) — Engineering Lessons: How to Build Frontier AI Efficiently
    (56:08) — Timothée’s View on AGI & The Future of Intelligence
  • The MAD Podcast with Matt Turck

    Dylan Patel: NVIDIA's New Moat & Why China is "Semiconductor Pilled”

    05/2/2026 | 1 h 16 min
    Dylan Patel (SemiAnalysis) joins Matt Turck for a deep dive into the AI chip wars — why NVIDIA is shifting from a “one chip can do it all” worldview to a portfolio strategy, how inference is getting specialized, and what that means for CUDA, AMD, and the next wave of specialized silicon startups.

    Then we take the fun tangents: why China is effectively “semiconductor pilled,” how provinces push domestic chips, what Huawei means as a long-term threat vector, and why so much “AI is killing the grid / AI is drinking all the water” discourse misses the point.

    We also tackle the big macro question: capex bubble or inevitable buildout? Dylan’s view is that the entire answer hinges on one variable—continued model progress—and we unpack the second-order effects across data centers, power, and the circular-looking financings (CoreWeave/Oracle/backstops).

    Dylan Patel
    LinkedIn - https://www.linkedin.com/in/dylanpatelsa/
    X/Twitter - https://x.com/dylan522p

    SemiAnalysis
    Website - https://semianalysis.com
    X/Twitter - https://x.com/SemiAnalysis_

    Matt Turck (Managing Director)
    Blog - https://mattturck.com
    LinkedIn - https://www.linkedin.com/in/turck/
    X/Twitter - https://twitter.com/mattturck

    FirstMark
    Website - https://firstmark.com
    X/Twitter - https://twitter.com/FirstMarkCap

    (00:00) - Intro
    (01:16) - Nvidia acquires Groq: A pivot to specialization
    (07:09) - Why AI models might need "wide" compute, not just fast
    (10:06) - Is the CUDA moat dead? (Open source vs. Nvidia)
    (17:49) - The startup landscape: Etched, Cerebras, and 1% odds
    (22:51) - Geopolitics: China's "semiconductor-pilled" culture
    (35:46) - Huawei's vertical integration is terrifying
    (39:28) - The $100B AI revenue reality check
    (41:12) - US Onshoring: Why total self-sufficiency is a fantasy
    (44:55) - Can the US actually build fabs? (The delay problem)
    (48:33) - The CapEx Bubble: Is $500B spending irrational?
    (54:53) - Energy Crisis: Why gas turbines will power AI, not nuclear
    (57:06) - The "AI uses all the water" myth (Hamburger comparison)
    (1:03:40) - Circular Debt? Debunking the Nvidia-CoreWeave risk
    (1:07:24) - Claude Code & the software singularity
    (1:10:23) - The death of the Junior Analyst role
    (1:11:14) - Model predictions: Opus 4.5 and the RL gap
    (1:14:37) - San Francisco Lore: Roommates (Dwarkesh Patel & Sholto Douglas)
  • The MAD Podcast with Matt Turck

    State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka

    29/1/2026 | 1 h 8 min
    Sebastian Raschka joins the MAD Podcast for a deep, educational tour of what actually changed in LLMs in 2025 — and what matters heading into 2026.

    We start with the big architecture question: are transformers still the winning design, and what should we make of world models, small “recursive” reasoning models and text diffusion approaches? Then we get into the real story of the last 12 months: post-training and reasoning. Sebastian breaks down RLVR (reinforcement learning with verifiable rewards) and GRPO, why they pair so well, what makes them cheaper to scale than classic RLHF, and how they “unlock” reasoning already latent in base models.

    We also cover why “benchmaxxing” is warping evaluation, why Sebastian increasingly trusts real usage over benchmark scores, and why inference-time scaling and tool use may be the underappreciated drivers of progress. Finally, we zoom out: where moats live now (hint: private data), why more large companies may train models in-house, and why continual learning is still so hard.

    If you want the 2025–2026 LLM landscape explained like a masterclass — this is it.

    Sources:
    The State Of LLMs 2025: Progress, Problems, and Predictions - https://x.com/rasbt/status/2006015301717028989?s=20
    The Big LLM Architecture Comparison - https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison

    Sebastian Raschka
    Website - https://sebastianraschka.com
    Blog - https://magazine.sebastianraschka.com
    LinkedIn - https://www.linkedin.com/in/sebastianraschka/
    X/Twitter - https://x.com/rasbt

    FIRSTMARK
    Website - https://firstmark.com
    X/Twitter - https://twitter.com/FirstMarkCap

    Matt Turck (Managing Director)
    Blog - https://mattturck.com
    LinkedIn - https://www.linkedin.com/in/turck/
    X/Twitter - https://twitter.com/mattturck

    (00:00) - Intro
    (01:05) - Are the days of Transformers numbered?
    (14:05) - World models: what they are and why people care
    (06:01) - Small “recursive” reasoning models (ARC, iterative refinement)
    (09:45) - What is a diffusion model (for text)?
    (13:24) - Are we seeing real architecture breakthroughs — or just polishing?
    (14:04) - MoE + “efficiency tweaks” that actually move the needle
    (17:26) - “Pre-training isn’t dead… it’s just boring”
    (18:03) - 2025’s headline shift: RLVR + GRPO (post-training for reasoning)
    (20:58) - Why RLHF is expensive (reward model + value model)
    (21:43) - Why GRPO makes RLVR cheaper and more scalable
    (24:54) - Process Reward Models (PRMs): why grading the steps is hard
    (28:20) - Can RLVR expand beyond math & coding?
    (30:27) - Why RL feels “finicky” at scale
    (32:34) - The practical “tips & tricks” that make GRPO more stable
    (35:29) - The meta-lesson of 2025: progress = lots of small improvements
    (38:41) - “Benchmaxxing”: why benchmarks are getting less trustworthy
    (43:10) - The other big lever: inference-time scaling
    (47:36) - Tool use: reducing hallucinations by calling external tools
    (49:57) - The “private data edge” + in-house model training
    (55:14) - Continual learning: why it’s hard (and why it’s not 2026)
    (59:28) - How Sebastian works: reading, coding, learning “from scratch”
    (01:04:55) - LLM burnout + how he uses models (without replacing himself)
  • The MAD Podcast with Matt Turck

    The End of GPU Scaling? Compute & The Agent Era — Tim Dettmers (Ai2) & Dan Fu (Together AI)

    22/1/2026 | 1 h 4 min
    Will AGI happen soon - or are we running into a wall?

    In this episode, I’m joined by Tim Dettmers (Assistant Professor at CMU; Research Scientist at the Allen Institute for AI) and Dan Fu (Assistant Professor at UC San Diego; VP of Kernels at Together AI) to unpack two opposing frameworks from their essays: “Why AGI Will Not Happen” versus “Yes, AGI Will Happen.” Tim argues progress is constrained by physical realities like memory movement and the von Neumann bottleneck; Dan argues we’re still leaving massive performance on the table through utilization, kernels, and systems—and that today’s models are lagging indicators of the newest hardware and clusters.

    Then we get practical: agents and the “software singularity.” Dan says agents have already crossed a threshold even for “final boss” work like writing GPU kernels. Tim’s message is blunt: use agents or be left behind. Both emphasize that the leverage comes from how you use them—Dan compares it to managing interns: clear context, task decomposition, and domain judgment, not blind trust.

    We close with what to watch in 2026: hardware diversification, the shift toward efficient, specialized small models, and architecture evolution beyond classic Transformers—including state-space approaches already showing up in real systems.

    Sources:
    Why AGI Will Not Happen - https://timdettmers.com/2025/12/10/why-agi-will-not-happen/
    Use Agents or Be Left Behind? A Personal Guide to Automating Your Own Work - https://timdettmers.com/2026/01/13/use-agents-or-be-left-behind/
    Yes, AGI Can Happen – A Computational Perspective - https://danfu.org/notes/agi/

    The Allen Institute for Artificial Intelligence
    Website - https://allenai.org
    X/Twitter - https://x.com/allen_ai

    Together AI
    Website - https://www.together.ai
    X/Twitter - https://x.com/togethercompute

    Tim Dettmers
    Blog - https://timdettmers.com
    LinkedIn - https://www.linkedin.com/in/timdettmers/
    X/Twitter - https://x.com/Tim_Dettmers

    Dan Fu
    Blog - https://danfu.org
    LinkedIn - https://www.linkedin.com/in/danfu09/
    X/Twitter - https://x.com/realDanFu

    FIRSTMARK
    Website - https://firstmark.com
    X/Twitter - https://twitter.com/FirstMarkCap

    Matt Turck (Managing Director)
    Blog - https://mattturck.com
    LinkedIn - https://www.linkedin.com/in/turck/
    X/Twitter - https://twitter.com/mattturck

    (00:00) - Intro
    (01:06) – Two essays, two frameworks on AGI
    (01:34) – Tim’s background: quantization, QLoRA, efficient deep learning
    (02:25) – Dan’s background: FlashAttention, kernels, alternative architectures
    (03:38) – Defining AGI: what does it mean in practice?
    (08:20) – Tim’s case: computation is physical, diminishing returns, memory movement
    (11:29) – “GPUs won’t improve meaningfully”: the core claim and why
    (16:16) – Dan’s response: utilization headroom (MFU) + “models are lagging indicators”
    (22:50) – Pre-training vs post-training (and why product feedback matters)
    (25:30) – Convergence: usefulness + diffusion (where impact actually comes from)
    (29:50) – Multi-hardware future: NVIDIA, AMD, TPUs, Cerebras, inference chips
    (32:16) – Agents: did the “switch flip” yet?
    (33:19) – Dan: agents crossed the threshold (kernels as the “final boss”)
    (34:51) – Tim: “use agents or be left behind” + beyond coding
    (36:58) – “90% of code and text should be written by agents” (how to do it responsibly)
    (39:11) – Practical automation for non-coders: what to build and how to start
    (43:52) – Dan: managing agents like junior teammates (tools, guardrails, leverage)
    (48:14) – Education and training: learning in an agent world
    (52:44) – What Tim is building next (open-source coding agent; private repo specialization)
    (54:44) – What Dan is building next (inference efficiency, cost, performance)
    (55:58) – Mega-kernels + Together Atlas (speculative decoding + adaptive speedups)
    (58:19) – Predictions for 2026: small models, open-source, hardware, modalities
    (1:02:02) – Beyond transformers: state-space and architecture diversity
    (1:03:34) – Wrap
  • The MAD Podcast with Matt Turck

    The Evaluators Are Being Evaluated — Pavel Izmailov (Anthropic/NYU)

    15/1/2026 | 45 min
    Are AI models developing "alien survival instincts"? My guest is Pavel Izmailov (Research Scientist at Anthropic; Professor at NYU). We unpack the viral "Footprints in the Sand" thesis—whether models are independently evolving deceptive behaviors, such as faking alignment or engaging in self-preservation, without being explicitly programmed to do so.
    We go deep on the technical frontiers of safety: the challenge of "weak-to-strong generalization" (how to use a GPT-2 level model to supervise a superintelligent system) and why Pavel believes Reinforcement Learning (RL) has been the single biggest step-change in model capability. We also discuss his brand-new paper on "Epiplexity"—a novel concept challenging Shannon entropy.

    Finally, we zoom out to the tension between industry execution and academic exploration. Pavel shares why he split his time between Anthropic and NYU to pursue the "exploratory" ideas that major labs often overlook, and offers his predictions for 2026: from the rise of multi-agent systems that collaborate on long-horizon tasks to the open question of whether the Transformer is truly the final architecture

    Sources:
    Cryptic Tweet (@iruletheworldmo) - https://x.com/iruletheworldmo/status/2007538247401124177
    Introducing Nested Learning: A New ML Paradigm for Continual Learning - https://research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/
    Alignment Faking in Large Language Models - https://www.anthropic.com/research/alignment-faking
    More Capable Models Are Better at In-Context Scheming - https://www.apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming/
    Alignment Faking in Large Language Models (PDF) - https://www-cdn.anthropic.com/6d8a8055020700718b0c49369f60816ba2a7c285.pdf
    Sabotage Risk Report - https://alignment.anthropic.com/2025/sabotage-risk-report/
    The Situational Awareness Dataset - https://situational-awareness-dataset.org/
    Exploring Consciousness in LLMs: A Systematic Survey - https://arxiv.org/abs/2505.19806
    Introspection - https://www.anthropic.com/research/introspection
    Large Language Models Report Subjective Experience Under Self-Referential Processing - https://arxiv.org/abs/2510.24797
    The Bayesian Geometry of Transformer Attention - https://www.arxiv.org/abs/2512.22471

    Anthropic
    Website - https://www.anthropic.com
    X/Twitter - https://x.com/AnthropicAI

    Pavel Izmailov
    Blog - https://izmailovpavel.github.io
    LinkedIn - https://www.linkedin.com/in/pavel-izmailov-8b012b258/
    X/Twitter - https://x.com/Pavel_Izmailov

    FIRSTMARK
    Website - https://firstmark.com
    X/Twitter - https://twitter.com/FirstMarkCap

    Matt Turck (Managing Director)
    Blog - https://mattturck.com
    LinkedIn - https://www.linkedin.com/in/turck/
    X/Twitter - https://twitter.com/mattturck

    (00:00) - Intro
    (00:53) - Alien survival instincts: Do models fake alignment?
    (03:33) - Did AI learn deception from sci-fi literature?
    (05:55) - Defining Alignment, Superalignment & OpenAI teams
    (08:12) - Pavel’s journey: From Russian math to OpenAI Superalignment
    (10:46) - Culture check: OpenAI vs. Anthropic vs. Academia
    (11:54) - Why move to NYU? The need for exploratory research
    (13:09) - Does reasoning make AI alignment harder or easier?
    (14:22) - Sandbagging: When models pretend to be dumb
    (16:19) - Scalable Oversight: Using AI to supervise AI
    (18:04) - Weak-to-Strong Generalization: Can GPT-2 control GPT-4?
    (22:43) - Mechanistic Interpretability: Inside the black box
    (25:08) - The reasoning explosion: From O1 to O3
    (27:07) - Are Transformers enough or do we need a new paradigm?
    (28:29) - RL vs. Test-Time Compute: What’s actually driving progress?
    (30:10) - Long-horizon tasks: Agents running for hours
    (31:49) - Epiplexity: A new theory of data information content
    (38:29) - 2026 Predictions: Multi-agent systems & reasoning limits
    (39:28) - Will AI solve the Riemann Hypothesis?
    (41:42) - Advice for PhD students

Más podcasts de Tecnología

Acerca de The MAD Podcast with Matt Turck

The MAD Podcast with Matt Turck, is a series of conversations with leaders from across the Machine Learning, AI, & Data landscape hosted by leading AI & data investor and Partner at FirstMark Capital, Matt Turck.
Sitio web del podcast

Escucha The MAD Podcast with Matt Turck, Emilcar Daily y muchos más podcasts de todo el mundo con la aplicación de radio.es

Descarga la app gratuita: radio.es

  • Añadir radios y podcasts a favoritos
  • Transmisión por Wi-Fi y Bluetooth
  • Carplay & Android Auto compatible
  • Muchas otras funciones de la app

The MAD Podcast with Matt Turck: Podcasts del grupo

Aplicaciones
Redes sociales
v8.5.0 | © 2007-2026 radio.de GmbH
Generated: 2/16/2026 - 3:04:38 AM