PodcastsTecnologíaSearch Off the Record

Search Off the Record

Google
Search Off the Record
Último episodio

110 episodios

  • Search Off the Record

    Analysing Robots.txt at scale with HTTP Archive and BigQuery

    23/04/2026 | 27 min
    In this episode of Search Off the Record, Martin and Gary turn a simple robots.txt question into a data‑driven deep dive using HTTP Archive, WebPageTest, custom JavaScript metrics, and BigQuery. They explore how millions of real robots.txt files are actually written in 2025–2026, which directives and user‑agents are most common, and what that means for modern crawling and AI bots.
    Perfect for beginner to mid‑level developers and SEOs, you'll learn how large‑scale web measurement works (HTTP Archive, Chrome UX Report, Web Almanac), and how to turn raw crawl data into actionable SEO insights. Subscribe for more candid conversations about crawling, indexing, and the data behind how Google Search and the web really work.
    Resources:
    Web Almanac →  https://almanac.httparchive.org/en/2025/
    Robotstxt custom metric for the HTTP Archive → 
    https://github.com/HTTPArchive/custom-metrics/pull/191
    robots.txt parser change → https://github.com/google/robotstxt/commit/4af32e54b715442bb04cd0470e99192f0ffb9792#commitcomment-178586774
    Episode transcript → https://goo.gle/sotr108-transcript

    Listen to more Search Off the Record → https://goo.gle/sotr-yt  
    Subscribe to Google Search Channel → https://goo.gle/SearchCentral
    Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.
     #SOTRpodcast #SEO #GoogleSearch
    Speakers: Martin Splitt, Gary Illyes
  • Search Off the Record

    Are websites getting "fat"? Page weight, HTML size & Googlebot limits explained

    30/03/2026 | 32 min
    In this episode of Search Off the Record, Gary and Martin dig into what "page size" and "page weight" actually mean for developers, users, and search engines.
    They discuss exploding web page sizes: median mobile homepages hit 2.3 MB in 2025 Web Almanac (up 3x from 2015), key insights for developers on page weight definitions, Googlebot's crawl limits, HTML bloat from structured data/images, and why size still hurts UX on slow connections despite faster networks.
    If you build or maintain websites, this conversation will help you rethink how much data your pages ship, where bloat really comes from, and why page weight still matters even as connections get faster.
    Resources:
    ​Web Almanac → https://almanac.httparchive.org/en/2025/
    HTML living standard → https://html.spec.whatwg.org/multipage/
    How page speed helps with conversions → 
    https://www.thinkwithgoogle.com/marketing-strategies/app-and-mobile/mobile-page-speed-data/ 
    Episode transcript → https://goo.gle/sotr106-transcript
    Listen to more Search Off the Record → https://goo.gle/sotr-yt  Subscribe to Google Search Channel → https://goo.gle/SearchCentral
    Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.
     #SOTRpodcast #SEO #GoogleSearch
    Speakers: Martin Splitt, Gary Illyes
  • Search Off the Record

    Google crawlers behind the scenes

    12/03/2026 | 25 min
    Developers often talk about Googlebot as if it were a single program you could just run as "googlebot.exe", but that is not how Google's crawling actually works. In this episode of Search Off the Record, Martin and Gary from the Search Relations team unpack how Google's crawling infrastructure is really built and operated.​
    They cover why "Googlebot" is a misnomer and how it relates to a central crawling software-as-a-service used by many Google products​, how crawl behavior is controlled centrally to avoid overwhelming sites (throttling, handling 503s, and "don't break the internet" safeguards)​ and more!
    If you build for the web, work on SEO, or just want a more accurate mental model of how Google crawls pages, this behind‑the‑scenes discussion is for you.
    Resources:
    ​Crawlers → https://developer.google.com/crawling 
    Episode transcript → https://goo.gle/sotr107-transcript 
    Listen to more Search Off the Record → https://goo.gle/sotr-yt  
    Subscribe to Google Search Channel → https://goo.gle/SearchCentral 
    Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.
     #SOTRpodcast #SEO #GoogleSearch
    Speakers: Martin Splitt, Gary Illyes
  • Search Off the Record

    How Browsers Really Parse HTML (and What That Means for SEO)

    26/02/2026 | 32 min
    Martin and Gary unpack how HTML parsing really works, why the HTML standard is so lenient, and how messy markup can silently break key SEO signals like hreflang and rel=canonical. They revisit validators and cross‑browser hacks from the Netscape/IE days, and discuss whether semantic HTML and strict validity truly matter for search. You'll also hear when link hints like preload, prefetch, and DNS prefetch help performance (and indirectly SEO), and where meta and link tags really belong.

    Resources:
    HTML Living Standard → https://html.spec.whatwg.org/
    Episode transcript → https://goo.gle/sotr105-transcript

    Listen to more Search Off the Record → https://goo.gle/sotr-yt  Subscribe to Google Search Channel → https://goo.gle/SearchCentral
    Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.
     #SOTRpodcast #SEO #GoogleSearch
    Speakers: Martin Splitt, Gary Illyes
  • Search Off the Record

    Do You Still Need a Website in 2026?

    12/02/2026 | 28 min
    In this episode of Search Off the Record, Martin and Gary from the Google Search Relations team tackle a deceptively simple question: do you still need a website in 2026? Starting from the recurring industry claim that "the web is dead," they explore how the web has evolved through the rise of apps, AI chatbots, and social platforms, and why the answer almost always ends up being "it depends." Tune in for an engaging discussion on how websites remain relevant and what it means for content creation and discovery.
    Episode transcript → https://goo.gle/sotr103-transcript
    Listen to more Search Off the Record → https://goo.gle/sotr-yt 
    Subscribe to Google Search Channel → https://goo.gle/SearchCentral
     
    Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.
     #SOTRpodcast #SEO #GoogleSearch
    Speakers: Martin Splitt, Gary Illyes

Más podcasts de Tecnología

Acerca de Search Off the Record

Search Off the Record takes you behind the scenes of Google Search and its inner workings! In each episode, the folks from the Search Relations team will give you background info on the decision-making behind launches, feature prioritization in Search Console, and the projects Google Search teams are working on. They will share fun stories from the many conferences they attend as well as from their day-to-day working life at Google. They will also dive into the currently trending conversations in the SEO community at large. Have a listen!
Sitio web del podcast

Escucha Search Off the Record, Cupertino, podcast sobre Apple y muchos más podcasts de todo el mundo con la aplicación de radio.es

Descarga la app gratuita: radio.es

  • Añadir radios y podcasts a favoritos
  • Transmisión por Wi-Fi y Bluetooth
  • Carplay & Android Auto compatible
  • Muchas otras funciones de la app

Search Off the Record: Podcasts del grupo

  • Podcast Made by Google Podcast
    Made by Google Podcast
    Tecnología
Aplicaciones
Redes sociales
v8.8.13| © 2007-2026 radio.de GmbH
Generated: 4/29/2026 - 9:20:43 PM