Python: fix: filter history providers in handoff cloning to prevent duplicate messages by LEDazzio01 · Pull Request #5214 · microsoft/agent-framework

LEDazzio01 · 2026-04-10T21:53:17Z

Summary

Fixes #4695 — Supersedes #4714 (rebased onto current main).

When a HandoffAgentExecutor clones an agent via _clone_chat_agent(), history providers from
the original agent are copied verbatim. These providers re-inject previously stored messages on
each agent.run() call, causing the entire conversation to appear twice — once from the
handoff's _full_conversation and again from the history provider.

Fix

Filter out BaseHistoryProvider instances during cloning and replace them with a no-op
InMemoryHistoryProvider(load_messages=False, store_inputs=False, store_outputs=False)
to prevent the agent from auto-injecting a default one at runtime.

+ from agent_framework._sessions import AgentSession, BaseHistoryProvider, InMemoryHistoryProvider
  ...
+       filtered_providers = [
+           p for p in agent.context_providers
+           if not isinstance(p, BaseHistoryProvider)
+       ]
+       filtered_providers.append(
+           InMemoryHistoryProvider(
+               load_messages=False,
+               store_inputs=False,
+               store_outputs=False,
+           )
+       )
        return Agent(
            ...
-           context_providers=agent.context_providers,
+           context_providers=filtered_providers,

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines
Is this a breaking change? No

microsoft#4695)

Copilot

Pull request overview

This PR aims to fix duplicate chat history in Python handoff workflows by changing how agents are cloned for HandoffAgentExecutor, specifically preventing cloned agents from re-loading/storing prior messages via history providers.

Changes:

Filter BaseHistoryProvider instances out of cloned agents’ context_providers and add a no-op InMemoryHistoryProvider.
Refactor handoff executor conversation tracking/broadcasting and adjust handoff detection to return additional context.
Add a new build-time validation intended to enforce a per-service-call history persistence requirement.

Comments suppressed due to low confidence (1)

python/packages/orchestrations/agent_framework_orchestrations/_handoff.py:428

On handoff, the code appends handoff_message (a function_result-bearing message) to self._cache and then returns without clearing it. That leaves stale tool-result content in this executor’s cache, so the next time this agent is asked to respond it may replay tool artifacts and potentially trigger tool call/result mismatches. Either avoid caching this message, ensure it’s cleaned, or clear the cache before returning.

            # Add the handoff message to the cache so that the next invocation of the agent includes
            # the tool call result. This is necessary because each tool call must have a corresponding
            # tool result.
            self._cache.append(handoff_message)

            await ctx.send_message(
                AgentExecutorRequest(messages=[], should_respond=True),
                target_id=handoff_target,
            )
            await ctx.add_event(
                WorkflowEvent("handoff_sent", data=HandoffSentEvent(source=self.id, target=handoff_target))
            )
            self._autonomous_mode_turns = 0  # Reset autonomous mode turn counter on handoff
            return

Copilot · 2026-04-10T21:58:32Z