AI DAILY / DEV
FRIDAY
May 29, 2026

    Anthropic Ships Claude Opus 4.8 — and Dynamic Workflows Up to 1,000 Subagents

    • SWE-Bench Pro jumps from 64.3% to 69.2%, ahead of GPT-5.5 and Gemini 3.1 Pro on Anthropic's runs; agentic computer use 82.8% → 83.4%, knowledge-work Elo 1753 → 1890.
    • Anthropic says Opus 4.8 is ~4× less likely than 4.7 to let its own code flaws slip past unremarked, and is more willing to flag uncertainty mid-task.
    • Same $5/$25 per million token pricing as 4.7; fast mode is ~2.5× quicker, plus a new claude.ai 'effort' control.
    • Claude Code v2.1.154 ships Dynamic Workflows: Claude writes a JS orchestration script, runs up to 16 concurrent and 1,000 total subagents per run.
    • HN front page within hours; an 'Is Opus 4.8 broken?' thread on file-reading reliability is already trending alongside it.
    models anthropic.com

    xAI Drops Grok Build 0.1 onto the API in Public Beta

    • Same model that powers the Grok Build CLI: agentic coding focus, 256K context, text + image input, always-on reasoning.
    • Priced at $1/$2 per million tokens in/out and served at 100+ tokens/second — undercuts Opus 4.8 and GPT-5.5 by ~5×.
    • 70.8% on SWE-Bench Verified on xAI's internal harness — respectable for v1, still 15–18 points behind Claude Opus and GPT-5.5.
    • Built-in MCP support, function calling, and structured outputs; available via xAI API and OpenRouter from today.
    models x.ai

    Cursor, Windsurf and GitHub Copilot All Ship Opus 4.8 Within Hours

    • Cursor: 'more efficient than Opus 4.7' on CursorBench and 'more persistent on harder tasks,' rolled out same day.
    • Windsurf added Opus 4.8 at unchanged pricing plus a new Fast Mode priced at $25/M output tokens.
    • GitHub Copilot made Opus 4.8 generally available across Pro, Pro+, Business and Enterprise on launch day.
    • Day-one model parity across every major coding IDE is now the norm — and the differentiator is harness quality, not model access.
    tools x.com

    OpenAI Wires AP Election Results and SynthID Into ChatGPT for the Midterms

    • Starting this fall in the US and Brazil, ChatGPT will surface live Associated Press results on election night.
    • OpenAI is offering Codex Security and its Trusted Access for Cyber program free to registered US voting-system manufacturers.
    • SynthID watermarks now embedded in images generated by ChatGPT, Codex and the OpenAI API — survives screenshots and resaves.
    • Tools restricted from political impersonation, voter suppression and deceptive campaign use; Democracy Works partnership covers registration info.
    industry openai.com

    Apple's iOS 27 Siri Becomes a Chatbot With Its Own App

    • Bloomberg report ahead of WWDC June 8: Siri overhaul is the centerpiece of iOS 27, iPadOS 27 and macOS 27.
    • New dedicated Siri app, plus a system-wide 'Search or Ask' bar in the Dynamic Island that lets users swap in ChatGPT or Gemini.
    • Web-grounded answers with bullet points and large images — Apple's first serious shot at ChatGPT/Claude/Gemini parity.
    • Ships alongside a revamped Image Playground, systemwide AI grammar checker and AI-generated wallpapers.
    industry bloomberg.com

    Anthropic Opens Milan Office With Generali, Pirelli and Enel Already Onboard

    • Sixth European office in a year — joining London, Dublin, Paris, Zurich and Munich.
    • Named Italian customers include Generali, Unipol, Pirelli, Enel, Angelini Pharma, Bracco, Bending Spoons and Satispay.
    • Office spans sales, technical pre/post-sales and policy; Anthropic plans to triple its international workforce.
    • Lands four days after Anthropic co-founder Chris Olah presented at the Vatican alongside Pope Leo's Magnifica Humanitas encyclical.
    industry anthropic.com