This issue appears related to #253126 (chat-no-response) but differs in the following ways:
Claude Sonnet 4 works fine in the same environment.
Quota is still deducted despite the failed response.
- Copilot Chat Extension Version: 0.31.2
- VS Code Version: 1.104.1
- OS Version: Windows 11 25H2
- Feature: ask mode
- Selected model: Claude Opus 4.1
Steps to Reproduce:
- Open VS Code with GitHub Copilot Chat extension installed.
- Sign in with a Copilot Pro+ account.
- Select Claude Opus 4.1 from the model picker and confirm selection.
- Enter a prompt that should return a multi‑paragraph answer.
- Observe that the output is truncated and then displays “Sorry, no response was returned”.
- Check quota usage — it decreases despite the failed response.
Expected Behavior:
Claude Opus 4.1 should return the full output without truncation or error, and failed responses should not consume quota.
Actual Behavior:
Output is truncated and then displays “Sorry, no response was returned”, and quota is still deducted.
But Claude Sonnet 4 works fine in the same environment.

This issue appears related to #253126 (chat-no-response) but differs in the following ways:
Claude Sonnet 4 works fine in the same environment.
Quota is still deducted despite the failed response.
Steps to Reproduce:
Expected Behavior:
Claude Opus 4.1 should return the full output without truncation or error, and failed responses should not consume quota.
Actual Behavior:
Output is truncated and then displays “Sorry, no response was returned”, and quota is still deducted.
But Claude Sonnet 4 works fine in the same environment.