Skip to content

Does BIDI streaming support image output? #3963

@manjulayalavappara-insmed

Description

I am replicating bidirectional streaming for my multi-agent system. When a user asks for an image, the root agent should route the query to one of the sub-agents, which successfully returns the image as part of its response. However, this image output is not being surfaced in the final UI response.

I have tested this multiple times. Text and audio responses work perfectly, but when the user requests an image, there is no response. While checking the run_config, I noticed that response_modalities only lists text and audio; there is no mention of image support.

Expected behavior:
Input: text or audio
Output: text, audio, and image

Metadata

Metadata

Labels

answered[Status] This issue has been answered by the maintainerlive[Component] This issue is related to live, voice and video chatrequest clarification[Status] The maintainer need clarification or more information from the author

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions