TLDR;Opus 4.7 dropped this week with real benchmark gains for coding agents — and three breaking API changes you need to handle before upgrading. Anthropic Labs launched Claude Design, a new product that packages finished designs as handoff bundles for Claude Code. Claude Code itself hit v2.1.114 with API & Model UpdatesIntroducing Claude Opus 4.7GA as of April 16. SWE-bench Verified jumps from 80.8% to 87.6%, CursorBench from 58% to 70%, and vision resolution is 3× higher. Pricing holds at $5/$25 per MTok — same as Opus 4.6. Three breaking API changes require immediate attention before you migrate:
New in 4.7: a new Behavior changes (not breaking, but prompt-tuning may be needed): more literal instruction following at lower effort levels, response length calibrated to task complexity, fewer tool calls by default, more direct tone, fewer subagents spawned by default. Benchmark to watch out for: BrowseComp (agentic web search) dropped from 83.7% on Opus 4.6 to 79.3% on Opus 4.7 — if your agents rely heavily on web research, test carefully before switching. See the full migration guide and the what's new doc. ⚠️ Deprecation Deadlines — Action Required
Also new: Claude Design — new Anthropic Labs productIntroducing Claude Design by Anthropic LabsLaunched April 17. A new Anthropic Labs product for creating designs, prototypes, slides, one-pagers, and marketing assets in collaboration with Claude Opus 4.7. Available in research preview for Pro, Max, Team, and Enterprise subscribers at claude.ai/design (off by default for Enterprise — admin toggle required). Why this matters for builders:
Enterprise usage-based customers get a one-time credit covering ~20 prompts, expiring July 17, 2026. Worth using before then if you want to evaluate on Anthropic's credit. Claude Code & CLIClaude Code v2.1.108–v2.1.114 — This Week's Releasesv2.1.114 — parity with Claude Code v2.1.114 internals. v2.1.113 (Apr 17) — Breaking SDK change: the TypeScript Agent SDK now spawns a native per-platform Claude Code binary instead of bundled JavaScript. Improves performance but requires the updated SDK package. 22 bug fixes across sessions, Remote Control, subagents, and terminal input. v2.1.112 (Apr 16) — Fixes the "claude-opus-4-7 is temporarily unavailable" error that blocked Opus 4.7 in auto mode. Required if you're running Opus 4.7. v2.1.111 (Apr 16) — Opus 4.7 v2.1.108 (Apr 14) — Full version history: claudefa.st changelog · GitHub CHANGELOG.md Redesigned Claude Code Desktop — Parallel Sessions & Integrated EditorReleased April 14. The desktop app now supports multiple Claude Code sessions running side by side from a single window, managed via a new sidebar. Also added: integrated terminal, file editing, HTML and PDF preview, and a faster diff viewer in a drag-and-drop layout. Introducing Routines in Claude CodeReleased April 14. Routines bundle a prompt, one or more repositories, and a set of MCP connectors — packaged once and run automatically on Anthropic-managed cloud infrastructure without keeping your laptop open. Three trigger types: schedule (hourly, nightly, or weekly), API call via a per-routine HTTP endpoint with a bearer token, or GitHub webhook on repository events. Limits: Pro users get 5 runs/day, Max gets 15, Team/Enterprise up to 25. Shipped under the Engineering reads from AnthropicScaling Managed Agents: Decoupling the Brain from the HandsArchitecture writeup on Claude Managed Agents: virtualizing sessions, harnesses, and sandboxes into stable interfaces so the harness implementation can change without breaking your integration. Worth reading before building production agentic pipelines on Managed Agents. Quantifying Infrastructure Noise in Agentic Coding EvalsHow network, container startup, and tool latency affect coding eval scores. Key finding: infrastructure noise accounts for several percentage points of variance and should be measured explicitly when comparing models. Relevant if you're running your own evals. Agent SDK & Managed AgentsClaude Agent SDK TypeScript — v0.2.107–v0.2.114 this weekv0.2.114 (Apr 18) — parity with Claude Code v2.1.114. v0.2.113 (Apr 17) — Breaking: v0.2.111 (Apr 16) — Opus 4.7 support — this version is required to use Opus 4.7. Per-tool Install: MCP EcosystemMCP 'Design Flaw' Puts 200K Servers at Risk — Researcher DisclosureOx Security published research on April 16 documenting a systemic vulnerability in MCP's STDIO transport that allows arbitrary OS command execution — 10 CVEs issued across downstream tools, with affected packages totaling 150M+ downloads. Anthropic declined to modify the protocol architecture, calling the behavior "expected," though they quietly updated their security policy guidance to recommend using STDIO MCP adapters with caution. If you're running STDIO-based MCP servers in production, review the Ox Security technical deep-dive and your own server's trust model. The vulnerability also enables zero-click prompt injection across Claude Code and other AI IDEs via malicious repository content. GitHub MCP Server v1.0.0 — Generally AvailableHit 1.0.0 this week with a new Developer Tools & Communityclaude-code-best-practices — #1 Trending on GitHub (19.7k stars)The awesome-claude-code — 39.3k stars, new additions this weekThe community-curated list of Claude Code skills, hooks, agents, and plugins continues to grow. New additions this week include OpenClaude — Open-Source Claude Code CLI Fork for 200+ ModelsOpenClaude is a community fork rewritten to support OpenAI-compatible APIs, Gemini, GitHub Models, Codex, and Ollama. Install: What Claude Code's Source Revealed About AI Engineering CultureHN-scored analysis (76 points) of the Claude Code source following its partial disclosure. Covers architectural patterns and engineering culture observations. Competitor Dev ToolsOpenAI Codex — Major Expansion on April 16Codex can now operate your computer alongside you, generate images, remember preferences, and handle repeatable tasks. Developer additions: PR review, multi-file/terminal views, SSH to remote devboxes, and 90+ new plugins (Atlassian Rovo, GitLab Issues, CircleCI, Microsoft Suite, Neon, Render). Most directly comparable to Claude Code's OpenAI Agents SDK — Sandbox Execution & Model-Native HarnessUpdated April 15 with sandboxed container execution, a model-native harness for frontier models, and a standardized workspace abstraction (local dirs, AWS S3, GCS, Azure Blob, R2). Python-first; TypeScript support planned. Directly comparable to Claude Managed Agents. TechCrunch writeup. Cursor 3.1 — Canvas for Interactive Agent OutputReleased April 16. Canvas lets agents generate persistent React interfaces (charts, tables, diff views) directly in the Agents Window instead of markdown output. Also adds self-hosted cloud agents for enterprise — full tool execution stays in your own network. Research & SafetyAutomated Alignment Researchers: Using LLMs to Scale Scalable OversightPublished April 14. Nine AI agents running on Claude Opus 4.6 conducted alignment research autonomously for 5 days (~800 agent-hours) and achieved a Performance Gap Recovered (PGR) score of 0.97 vs. 0.23 for a week of human researcher work on the same task. The paper also discusses sandbagging risks (models deliberately performing worse during evaluations) that affect how you should interpret evals of advanced models. Full research paper. Anthropic's Long-Term Benefit Trust Appoints Vas Narasimhan to BoardNarasimhan (CEO of Novartis) joins the Long-Term Benefit Trust — the governance body that can override Anthropic's board in cases where commercial incentives conflict with the company's mission. Relevant context for builders evaluating Anthropic's long-term governance stability. ⚠️ Action items this week
All resources
You're getting this because you subscribed to the Claude Marketplaces weekly digest at claudemarketplaces.com. First issue on the new Kit.com pipeline — reply if anything looks off. |