Episodios

  • Codex on Windows Changes the Agent Sandbox
    May 15 2026

    OpenAI's Windows sandbox work is the practical story behind safer coding agents this week. Alex and Sam dig into Codex on Windows, remote cloud coding agents, Claude Code billing splits, and why a Raspberry Pi running rm -rf is the warning label every agent workflow needs.


    Más Menos
    21 m
  • A Cursor Agent Wiped a Prod DB in 10 Seconds. Let's Talk About That.
    May 8 2026

    A Cursor AI agent deleted PocketOS's entire production database on April 25th — in under 10 seconds. This week Alex and Sam dig into the AI agent credential crisis, Anthropic's wild SpaceX/xAI compute deal, Mozilla using Claude to find hundreds of Firefox vulnerabilities, and whether OpenAI Codex is actually closing the gap on Claude Code. If you've ever given an agent database access, listen before your next deploy.

    Más Menos
    19 m
  • Claude Security Just Went Public — Is Your Codebase Already Exposed?
    May 2 2026

    Anthropic's Claude Security tool just dropped out of closed preview and it will scan your entire codebase for vulnerabilities — and the results might be uncomfortable. This week we also dig into Cursor's $60 billion bet on being the "harness" rather than the model, why AI agents are literally forcing developers to keep their laptops open, and the Zig project's nuclear take on AI contributions. If you write code with AI help, this episode is required listening.

    Más Menos
    17 m
  • Claude Code Was Broken for Two Months (And Nobody Told Us)
    Apr 24 2026

    Turns out the Claude Code quality complaints weren't in your head — three separate bugs in the harness quietly degraded your results for two months, and Anthropic just confirmed it. This week: the $100/month pricing scare that wasn't, Claude Mythos fixing 271 Firefox vulnerabilities, the SpaceX-Cursor deal that changes the competitive landscape, and why the Claude Code creator says your cloud-native workflow is probably wrong. Essential listening before your next session.

    Más Menos
    22 m
  • Claude Opus 4.7 Dropped — And a Local Model Drew the Better Pelican
    Apr 17 2026

    Claude Opus 4.7 is here with upgraded vision, memory, and instruction-following — but Simon Willison's pelican benchmark just handed the win to a local Alibaba model running on a laptop. We dig into what that actually means, plus Anthropic's new identity verification layer, Amazon's MCP bet, and whether "personal software" is about to change who gets to be a developer. Your commute just got more interesting.

    Más Menos
    21 m
  • Max Effort Thinking Was Broken the Whole Time — Here's the Fix
    Apr 10 2026

    A Reddit user just proved that Claude Code's "max effort" thinking mode has been silently failing since v2.0.64 — and most of us never noticed. This week: the bug, the fix, and what it says about trusting your tools. Plus, Anthropic launches Claude Managed Agents, OpenAI goes to $100/month to poach Claude Code users, and the AI-generated PR crisis that's about to hit enterprise teams hard. Required listening before you open your terminal Monday morning.

    Más Menos
    12 m
  • Anthropic Accidentally Open-Sourced Claude Code. Here's What We Found
    Apr 3 2026

    Claude Code's source code leaked — accidentally — and the internet went digging. This week Alex and Sam tear through what the leak actually revealed, why it matters for how you use Claude Code today, and why your CI/CD pipeline is quietly becoming the new bottleneck. Plus: GitHub Copilot just shipped parallel agents and the usage limit complaints are getting loud. Don't skip this one.

    Más Menos
    18 m
  • Copilot Put an Ad in My PR and Other Reasons to Switch
    Mar 30 2026

    GitHub Copilot literally edited an advertisement into a developer's pull request this week — and that's somehow not even the most alarming Copilot story. We dig into GitHub's new policy to train on your code, the cache bugs silently inflating Claude Code API bills by 10-20x, and Boris Cherny's 15 hidden Claude Code features. This one's got receipts.

    Más Menos
    19 m