Return reasoning content from model output by zoeshawwang · Pull Request #112 · Serverless-Devs/agentrun-sdk-python

zoeshawwang · 2026-06-01T13:02:56Z

Summary

Remove protocol-level MODEL_PARAMETER_RULES gating from OpenAI Chat Completions and AG-UI reasoning output conversion.
Emit reasoning only when the returned reasoning_content payload is non-empty.
Keep reasoning extraction from additional_kwargs.reasoning_content, while stripping the nested transport-only field from protocol payloads.

Validation

uv run --python 3.11 --dev --extra server pytest -q tests/unittests/server/test_openai_protocol.py tests/unittests/server/test_agui_protocol.py tests/unittests/server/test_reasoning.py
uv run --python 3.11 --dev --extra server pytest -q tests/unittests/server
uv run --python 3.11 --dev --extra server ruff check agentrun/server/openai_protocol.py agentrun/server/agui_protocol.py tests/unittests/server/test_openai_protocol.py tests/unittests/server/test_agui_protocol.py
git diff --check

Notes

MODEL_PARAMETER_RULES can still be used by the runtime/model-call layer to control whether the model thinks. The SDK response protocol layer now trusts the actual model output: if reasoning is returned, it is surfaced; if reasoning is absent or empty, no reasoning field/event is emitted.

Protocol conversion should use the returned reasoning payload as the source of truth. MODEL_PARAMETER_RULES can still control model-side thinking, but OpenAI and AG-UI responses should not hide non-empty reasoning_content when the env flag says thinking is false. Constraint: User requested removal of protocol-level thinking_enabled = is_thinking_enabled_from_env() gating. Rejected: Keep env-based response suppression | runtime parameters can drift from the model output and hide returned reasoning. Confidence: high Scope-risk: narrow Directive: Do not reintroduce protocol-level reasoning env gates; only emit reasoning when returned reasoning_content is non-empty. Tested: uv run --python 3.11 --dev --extra server pytest -q tests/unittests/server/test_openai_protocol.py tests/unittests/server/test_agui_protocol.py tests/unittests/server/test_reasoning.py Tested: uv run --python 3.11 --dev --extra server pytest -q tests/unittests/server Tested: uv run --python 3.11 --dev --extra server ruff check agentrun/server/openai_protocol.py agentrun/server/agui_protocol.py tests/unittests/server/test_openai_protocol.py tests/unittests/server/test_agui_protocol.py Tested: git diff --check Change-Id: I638efa7ca19bf8ed9417fb1922d43205d4d52b65 Not-tested: Remote GitHub CI result pending after push.

Copilot

Pull request overview

This PR adjusts the OpenAI Chat Completions and AG-UI protocol handlers to surface reasoning_content based on actual model output (non-empty) rather than gating at the protocol layer via MODEL_PARAMETER_RULES. It also strips transport-only additional_kwargs.reasoning_content from emitted protocol payloads while still promoting it into the standardized reasoning field/event when present.

Changes:

Removed protocol-layer MODEL_PARAMETER_RULES gating for reasoning emission in OpenAI and AG-UI protocol conversions.
Promoted reasoning_content from additional_kwargs (and stripped the nested transport field), emitting reasoning only when non-empty.
Updated unit tests to assert reasoning is included even when MODEL_PARAMETER_RULES disables thinking.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
`agentrun/server/openai_protocol.py`	Removes protocol gating; promotes and cleans reasoning fields in OpenAI streaming/non-stream responses.
`agentrun/server/agui_protocol.py`	Removes protocol gating; emits AG-UI reasoning events when non-empty and strips transport-only reasoning fields from additions.
`tests/unittests/server/test_openai_protocol.py`	Updates/extends unit coverage for reasoning emission and promotion when thinking is disabled.
`tests/unittests/server/test_agui_protocol.py`	Updates unit coverage for reasoning events and ordering when thinking is disabled.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

            if event.event == EventType.REASONING:
-                if thinking_enabled:
-                    reasoning_content = event.data.get("delta", "")
-                    if reasoning_content:
-                        has_text = True
-                        yield self._build_chunk(
-                            context,
-                            {"reasoning_content": reasoning_content},
-                        )
+                reasoning_content = event.data.get("delta", "")
+                if reasoning_content:
+                    has_text = True
+                    yield self._build_chunk(


            if event.event == EventType.TEXT:
                content_parts.append(event.data.get("delta", ""))
                reasoning_content = get_reasoning_content(event.addition or {})
-                if thinking_enabled and reasoning_content:
+                if reasoning_content:
                    reasoning_parts.append(reasoning_content)


zoeshawwang requested review from OhYee and Copilot June 1, 2026 13:41

Copilot started reviewing on behalf of zoeshawwang June 1, 2026 13:41 View session

Copilot AI reviewed Jun 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return reasoning content from model output#112

Return reasoning content from model output#112
zoeshawwang wants to merge 1 commit into
mainfrom
support_reasoning_content

zoeshawwang commented Jun 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zoeshawwang commented Jun 1, 2026

Summary

Validation

Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants