feat(opencode): respect provider/model `streaming: false` to disable response streaming by sebdanielsson · Pull Request #31357 · anomalyco/opencode

sebdanielsson · 2026-06-08T12:30:18Z

Issue for this PR

Closes #785

Type of change

Bug fix
New feature
Refactor / code improvement
Documentation

What does this PR do?

Some OpenAI-compatible backends either don't support streaming or return broken streamed output. In my case a self-hosted vLLM (Gemma) corrupts streamed tool-call arguments (duplicates characters), so every edit comes back garbled. The existing options.streaming config wasn't actually consumed, so there was no way to opt out.

This makes options.streaming: false (per-model or per-provider) actually work. When set, it adds the AI SDK's simulateStreamingMiddleware, which calls doGenerate (stream: false on the wire) and replays the result as a simulated stream — so the rest of the pipeline is unchanged. Defaults to streaming on, so existing behavior is untouched.

{ "provider": { "vllm": { "options": { "streaming": false } } } }

Only covers the default AI SDK path. The experimental experimentalNativeLlm runtime is a separate path and isn't handled here.

How did you verify your code works?

Added a test in test/session/llm.test.ts that sets streaming: false and has the mock server return a non-streaming JSON completion. It only parses if the request was non-streaming (a streamed request expects SSE and fails), and asserts body.stream isn't true.
bun test test/session/llm.test.ts -> 27 pass, plus typecheck and oxlint clean.

Screenshots / recordings

No UI changes.

Checklist

I have tested my changes locally
I have not included unrelated changes in this PR

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

This PR adds a configuration-based way to disable true provider streaming while keeping the rest of the pipeline streaming-compatible by simulating a stream.

Changes:

Add simulateStreamingMiddleware() to opt out of on-the-wire streaming when options.streaming === false.
Add a test asserting that provider requests are not made with stream: true when streaming is disabled.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
packages/opencode/src/session/llm.ts	Adds streaming opt-out logic and conditionally injects simulated streaming middleware.
packages/opencode/test/session/llm.test.ts	Adds coverage to confirm provider requests are non-streaming when configured.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions · 2026-06-08T13:40:10Z

Thanks for updating your PR! It now meets our contributing guidelines. 👍

sebdanielsson · 2026-06-08T13:41:37Z

Just this version in a GitHub Action workflow, and it worked without the stream-stripping proxy we're currently using to get around this limitation. 👍

lmeyerov · 2026-06-08T18:44:59Z

We hit the Bedrock side of this. Meta's Llama models on Bedrock reject tool use over /converse-stream (ValidationException: This model doesn't support tool use in streaming mode), so with tools they're unusable today. streaming: false fixes it: verified against real Llama 4 Maverick, which then made native tool calls and answered correctly, with streaming models unaffected (Claude on Bedrock still streamed, test/session/llm.test.ts green).

One Bedrock-specific gap I ran into: the prompt-transform middleware only runs for args.type === "stream", so under doGenerate the Bedrock message transform gets skipped. Widening that guard to also run when streaming is off makes the Bedrock path correct under streaming: false. Happy to send a patch against this branch.

…ponse streaming Some OpenAI-compatible backends don't support streaming or return broken streamed output (e.g. self-hosted vLLM corrupting streamed tool-call args). The existing options.streaming config wasn't consumed, so there was no way to opt out. Honor options.streaming:false (per-model or per-provider) by adding the AI SDK's simulateStreamingMiddleware, which calls doGenerate (stream:false on the wire) and replays the result as a simulated stream, leaving the rest of the pipeline unchanged. Defaults to streaming on. Fixes anomalyco#785 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

… stream values Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

sebdanielsson · 2026-06-08T22:39:27Z

We hit the Bedrock side of this. Meta's Llama models on Bedrock reject tool use over /converse-stream (ValidationException: This model doesn't support tool use in streaming mode), so with tools they're unusable today. streaming: false fixes it: verified against real Llama 4 Maverick, which then made native tool calls and answered correctly, with streaming models unaffected (Claude on Bedrock still streamed, test/session/llm.test.ts green).

One Bedrock-specific gap I ran into: the prompt-transform middleware only runs for args.type === "stream", so under doGenerate the Bedrock message transform gets skipped. Widening that guard to also run when streaming is off makes the Bedrock path correct under streaming: false. Happy to send a patch against this branch.

Feel free to send it, thanks for testing!👍

Copilot AI review requested due to automatic review settings June 8, 2026 12:30

github-actions Bot added the needs:compliance This means the issue will auto-close after 2 hours. label Jun 8, 2026

Copilot AI reviewed Jun 8, 2026

View reviewed changes

Comment thread packages/opencode/src/session/llm.ts

Comment thread packages/opencode/test/session/llm.test.ts

github-actions Bot removed the needs:compliance This means the issue will auto-close after 2 hours. label Jun 8, 2026

sebdanielsson force-pushed the feat/disable-streaming-option branch from 0da9544 to fda049c Compare June 8, 2026 13:40

sebdanielsson and others added 2 commits June 9, 2026 00:34

test(opencode): tighten streaming-disabled assertion to reject truthy…

b96bf57

… stream values Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

sebdanielsson force-pushed the feat/disable-streaming-option branch from fda049c to b96bf57 Compare June 8, 2026 22:35

github-actions Bot mentioned this pull request Jun 9, 2026

📊 AI CLI 工具社区动态日报 2026-06-09 zx0828/big_model_radar#94

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(opencode): respect provider/model `streaming: false` to disable response streaming#31357

feat(opencode): respect provider/model `streaming: false` to disable response streaming#31357
sebdanielsson wants to merge 2 commits into
anomalyco:devfrom
sebdanielsson:feat/disable-streaming-option

sebdanielsson commented Jun 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

sebdanielsson commented Jun 8, 2026

Uh oh!

lmeyerov commented Jun 8, 2026

Uh oh!

sebdanielsson commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sebdanielsson commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue for this PR

Type of change

What does this PR do?

How did you verify your code works?

Screenshots / recordings

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

sebdanielsson commented Jun 8, 2026

Uh oh!

lmeyerov commented Jun 8, 2026

Uh oh!

sebdanielsson commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sebdanielsson commented Jun 8, 2026 •

edited

Loading