AI Daily Dev — Week 15

Anthropic Launches Managed Agents in Public Beta

Anthropic launched Claude Managed Agents, a suite of composable APIs for building cloud-hosted agents at scale. Handles sandboxed code execution, state management, credential vaults, and scoped permissions out of the box. Early adopters include Notion, Asana, Rakuten, Sentry, and Vibecode.

tools anthropic.com

Meta Debuts Muse Spark from New Superintelligence Labs

Meta released Muse Spark, the first model from Alexandr Wang's Meta Superintelligence Labs. The natively multimodal model supports visual chain-of-thought and multi-agent orchestration. Meta says AI capex will hit $115-135B in 2026, nearly double last year.

models cnbc.com

Anthropic Releases Advisor Strategy — Opus Supervises Sonnet for Less

Anthropic shipped the Advisor Strategy for the Messages API, pairing Opus 4.6 as an advisor to Sonnet or Haiku. Scores 2.7 percentage points higher on SWE-bench Multilingual than Sonnet alone while costing 11.9% less per task. A practical pattern for production agent deployments.

tools anthropic.com

Anthropic Revenue Crosses $30B, Surpassing OpenAI for First Time

Anthropic's annualized revenue run rate hit $30B, overtaking OpenAI's $25B for the first time. Over 1,000 enterprise customers now spend more than $1M/year on Claude, doubling from 500 in February. The company's growth rate is unprecedented in enterprise SaaS.

industry anthropic.com

Google Releases Gemma 4 Under Apache 2.0 — Most Capable Open Model Family

Google shipped Gemma 4 in four sizes with native vision, audio, function calling, 256K context, and 140+ language support. The flagship 31B model scores 80% on LiveCodeBench and 89.2% on AIME 2026. Apache 2.0 license means zero restrictions for commercial use.

models blog.google

MiniMax Ships M2.7 — Open-Source Self-Evolving Agent Model

MiniMax released M2.7, hitting 56.22% on SWE-Pro and 57.0% on Terminal Bench 2. The model's three-component system (short-term memory, self-feedback, self-optimization) lets it improve over 24-hour windows. Medal rate of 66.6% matches Gemini-3.1, trailing only Claude Opus-4.6 (75.7%) and GPT-5.4 (71.2%).

open-source minimax.io

Cursor Ships Remote Agent Control — Run Agents from Your Phone

Cursor launched remote AI agent control, letting developers kick off coding agents on any machine and monitor them from a phone or other device. Enables asynchronous workflows where agents run on remote dev boxes while you're away from your desk.

tools cursor.com

OpenAI, Anthropic, Google Unite Against Chinese Model Theft

The three major AI labs announced intelligence sharing through the Frontier Model Forum to combat Chinese companies stealing models via adversarial distillation. The joint effort marks the first coordinated defense against systematic model extraction.

industry bloomberg.com

Visa and Nevermined Enable AI Agent Card Payments

Visa partnered with Nevermined to let AI agents make autonomous card purchases within cardholder-defined policies, using Visa Intelligent Commerce and Coinbase's x402 protocol. Agents can now spend money on your behalf with configurable limits.

industry visa.com

OpenAI Launches $100/Month Pro Tier with 5x Codex Usage

OpenAI introduced a Pro tier at $100/month offering 5x more Codex usage than Plus, with a limited-time 10x promotion through May 31. Aimed at power users and developers who've been hitting Plus limits on agent-heavy workflows.

tools openai.com