Episodes

  • Episode 55: Codex 0.132.0, Claude Code 2.1.145, Gemini Managed Agents, and WebMCP
    May 20 2026
    AgentStack Daily EP055 opens with the operator release readout: Codex CLI 0.132.0 adds first-class Python SDK authentication, simpler text turns, richer turn results, schema-constrained `codex exec resume`, faster TUI startup, auth-backed remote executor registration, image-fidelity preservation, goal-loop brakes, multi-session MCP replay fixes, remote websocket keepalives, and Windows install hardening. Claude Code CLI 2.1.145 adds `claude agents --json`, agent IDs in OpenTelemetry spans, GitHub repository and pull-request status in status-line JSON, richer plugin discovery before install, awaiting-input counts in terminal titles, hook payloads for background tasks and session crons, and several permission, MCP, terminal, review, plugin, and skill-loop fixes. Then the episode covers six concrete AgentStack topics: Google Gemini 3.5 Flash GA and Managed Agents, Chrome WebMCP, Google AI Studio's Workspace and Android build updates, Chrome DevTools for agents, and GitHub making GPT-5.3-Codex the base model for Copilot Business and Enterprise. Show notes: https://tobyonfitnesstech.com/podcasts/episode-55/
    Show More Show Less
    41 mins
  • Episode 54: Claude Code 2.1.144, Cursor Composer 2.5, Stainless, Notion, Vercel AI SDK, and Cloudflare Mesh
    May 19 2026
    AgentStack Daily EP054 opens on concrete release work: Claude Code CLI 2.1.144 stabilizes background and detached agent sessions, fixes a long startup hang when the API endpoint is unreachable, repairs MCP pagination and unsupported-image handling, adds background-session resume and a session-scoped model picker, and tightens read-before-edit and search-error behavior. Then five more builder-relevant moves: Cursor Composer 2.5, a Kimi K2.5-based coding model at roughly a tenth of frontier per-token cost; Anthropic acquiring Stainless and pulling SDK code generation in-house; Notion's Developer Platform turning the workspace into a hosted agent runtime with Workers and an External Agent API; the Vercel AI SDK rewriting its LangChain and LangGraph adapter; and Cloudflare Mesh putting zero-trust networking and identity under the agent lifecycle. Show notes: https://tobyonfitnesstech.com/podcasts/episode-54/
    Show More Show Less
    41 mins
  • Episode 53: OpenClaw 2026.5.18, Codex 0.131.0, Copilot Remote Agents, and Claude Search Grounding
    May 19 2026
    AgentStack Daily EP053 opens with concrete release work: OpenClaw v2026.5.18 adds typed plugin tooling, faster gateway readiness, dialog-aware browser automation, runtime parity QA, realtime Android Talk Mode, safer media handling, stronger channel delivery, Codex app-server repairs, proxy TLS support, and operator-facing Mac app polish. OpenAI Codex CLI 0.131.0 adds richer TUI controls, unified mentions, plugin marketplace and sharing commands, daemon-managed remote control, configured remote environments, an `openai-codex` Python SDK, `codex doctor`, and tougher sandbox, auth, app-server, and state handling. Then the episode moves to GitHub's May 18 Copilot agent updates for remote CLI steering, cheaper model choices, and one-click Actions repair, followed by Anthropic's API update that gives Claude's web search tool richer SEC filing data for cited financial research workflows. Show notes: https://tobyonfitnesstech.com/podcasts/episode-53/
    Show More Show Less
    41 mins
  • Episode 52: Local Agents Get Their Hardware Week
    May 18 2026
    This episode follows six concrete changes in the agent stack: Ollama pushing deeper into local coding-agent runtimes, LM Studio improving Apple Silicon vision inference and remote local servers, NVIDIA positioning DGX Spark as a serious local-agent machine, EXO showing where distributed local inference still needs hardening, xAI shipping Grok Build while redirecting older model slugs to Grok 4.3, and LiteLLM plus Envoy AI Gateway tightening the routing layer that sits between agents and models. Show notes: https://tobyonfitnesstech.com/podcasts/episode-52/
    Show More Show Less
    39 mins
  • Episode 51: OpenClaw 2026.5.12, Hermes Foundation, Claude Code Background Controls, and Gemini Agent Deployments
    May 16 2026
    AgentStack Daily EP051 opens with an agent-stack release readout: OpenClaw v2026.5.12 trims core installs, hardens Telegram, Codex, plugin, gateway, browser, and config paths, and improves reply delivery; Hermes Agent 2026.5.16 adds native Windows beta, PyPI installation, faster startup, a local OpenAI-compatible proxy, vision, video, browser, LSP, and verification upgrades; Claude Code 2.1.143 and 2.1.142 tighten plugin dependencies, background-session flags, PowerShell behavior, worktree isolation, MCP timeout handling, and agent-dashboard defaults. Then the episode turns to Google Cloud's Gemini Enterprise Agent Platform release notes for immutable agent revisions, traffic splitting, and Priority PayGo, and to Google's Interactions API breaking-change guide for the new `steps` timeline and `response_format` migration. Show notes: https://tobyonfitnesstech.com/podcasts/episode-51/
    Show More Show Less
    41 mins
  • Episode 50: AgentStack Daily EP050 — What's New in Agent Releases
    May 15 2026
    This AgentStack Daily episode covers what is new in LLM and agent tooling: Hermes Agent v2026.5.7 adds durable boards, worker health checks, checkpoint pruning, gateway resume, no-agent cron, provider plugins, platform allowlists, and MCP fixes; Claude Code v2.1.141 through v2.1.129 adds the agent view, hook JSON updates, plugin and workload-identity controls, MCP repairs, and background-agent permission fixes; Google ADK documents pause-and-resume agents with persisted state; and GitHub exposes Copilot agent tasks through REST endpoints. Show notes: https://tobyonfitnesstech.com/podcasts/episode-50/
    Show More Show Less
    42 mins
  • Episode 49: Gemini Deep Research, Agents SDK Sandbox Boundaries, vLLM Kernel Fixes, and Strands Runtime Controls
    May 12 2026
    EP049 goes deep on Google’s Gemini Deep Research Agent in the Interactions API, OpenAI Agents SDK sandbox and session fixes, vLLM’s DeepSeek V4 serving patch, and Strands Agents TypeScript runtime controls for hooks, MCP, compression, retries, and human interruption. Show notes: https://tobyonfitnesstech.com/podcasts/episode-49/
    Show More Show Less
    38 mins
  • Episode 48: Codex Remote Control, Agent RCE Hardening, Copilot Session Hooks, and Microsoft Agent Framework 1.5
    May 10 2026
    OpenClaw Daily EP048 opens with OpenAI Codex 0.130.0 and its remote-control app-server entrypoint, paged thread views, plugin hook metadata, config refresh, turn-diff accuracy, multi-environment image resolution, and telemetry changes. The episode then explains Microsoft’s Semantic Kernel RCE case study, GitHub Copilot SDK session hooks and diagnostics, and Microsoft Agent Framework 1.5 changes around Magentic orchestration, WebBrowsingTool allowlists, reasoning events, todo-state injection, and wire-format fixes. Show notes: https://tobyonfitnesstech.com/podcasts/episode-48/
    Show More Show Less
    37 mins