- Z.ai (Zhipu) drops the full 753B-parameter (40B active) MoE on Hugging Face under an MIT license — usable 1M-token context, two thinking-effort levels.
- SWE-bench Pro 62.1 (vs GPT-5.5 58.6) and FrontierSWE 74.4% (vs 72.6%), with API priced at roughly 1/6 of GPT-5.5.
- HN front page #3: 647 points, 368 comments in 20 hours — top comment: 'grateful to Chinese labs for being open with their work after the Fable 5 fiasco.'
- Lands the same week US export controls keep Anthropic's Fable 5 and Mythos 5 offline, sharpening the 'open model nobody can ban' framing.
- Swami Sivasubramanian's 11am ET keynote anchors a confirmed lineup: Kiro Pro Max tier, AWS FinOps Agent in preview, AgentCore updates, and Amazon Quick (the Q Business replacement) generally available.
- Kiro Pro Max adds higher usage caps and access to frontier models inside the spec-driven IDE that replaced Amazon Q Developer; international rollout extends the May 7 US launch.
- FinOps Agent investigates cost anomalies, opens Jira tickets from Cost Optimization Hub recommendations, and posts findings to Slack on a schedule.
- Gemma 4 (31B dense, 26B-A4B MoE, E2B) now live on Bedrock with native function calling, 256K context, and multimodal text/image/video/audio input.
- New method replays a representative slice of real ChatGPT traffic against an unreleased model to forecast misbehavior rates pre-launch.
- Built on ~1.3 million de-identified conversations across GPT-5 Thinking through GPT-5.4 deployments (Aug 2025 – Mar 2026).
- OpenAI says it sidesteps the narrow-prompt-set problem of classic evals, but acknowledges it can't reliably catch behaviors rarer than 1 in 200,000 messages.
- Published June 16 — first concrete look at how OpenAI plans to gate GPT-5.5/5.6-class launches now that the model spec ships faster than human review.
- Google DeepMind and Google.org partner with Imperial College London, IAS, IHES, Simons Institute, and TIFR to apply Gemini and AlphaEvolve to open math problems.
- Builds on Gemini Deep Think hitting IMO gold-medal level (5 of 6 problems) and AlphaEvolve improving solutions on 20% of >50 open problems.
- Programme funds joint research, tooling, and residencies — first cohort of mathematicians starts working alongside DeepMind staff this summer.
- Announced June 16 on the Google blog; pitched as the math-focused counterpart to last year's AI co-scientist push.
- Economic Index post (June 16) analysing 500,000 coding interactions across Claude.ai and Claude Code.
- Domain experts succeed more often and recover faster from agent mistakes; the gap between domain experts and intermediates is modest, but coding-only proficiency barely moves the needle.
- 79% of Claude Code conversations are full automation (agent acts) rather than augmentation (human + agent edits).
- Lands as Anthropic argues to enterprises that hiring 'people who deeply understand the problem' beats stacking junior coders in front of the agent.
- No restoration date announced; senior Anthropic engineers still in Washington meeting Commerce and the National Cyber Director's office.
- Polymarket pricing 'restored before July 1' near 80% but slipping intraday; refund window for users who upgraded June 9–14 closes June 20.
- GLM-5.2's MIT-weight drop today amplifies the developer narrative that the directive accelerated migration to open frontier models.
- Anthropic livestreamed from AWS Summit NYC at 9am EDT — Fable 5 conspicuously absent from the demo lineup, which leaned on Claude Cowork and Opus 4.8.
01
GLM-5.2 Lands With MIT Weights — Beats GPT-5.5 on Coding for One-Sixth the Cost
open-source venturebeat.com
02
AWS Summit NYC: Kiro Pro Max Goes International, FinOps Agent Hits Public Preview
tools techtimes.com
03
OpenAI Paper: Predicting Model Behavior by Simulating Deployment Before Release
research openai.com
04
DeepMind Launches AI for Math Initiative With Five Top Research Institutes
research blog.google
05
Anthropic Finds Domain Expertise Outweighs Coding Skill for Claude Code Users
research anthropic.com
06
Fable 5 Enters Day 5 Offline as Anthropic–Commerce Talks Drag On
industry explainx.ai