Cid/attachments #581

gustavocidornelas · 2026-01-15T20:35:23Z

Pull Request

Summary

Introduces:

Multimodal trace support for OpenAI chat completion and responses API (image, audio, files).
"Attachment" support for every trace step.

Changes

Attachments

Introduces the Attachment abstraction, which represents media data uploaded to a blob store. The Attachment object stores metadata and the storageUri, which can be used by the Openlayer platform to fetch the media.
Every step can have an attachments field, which is an array of Attachment objects. This allows users to log arbitrary media to a step. For example:

from openlayer.lib import trace
from openlayer.lib.tracing import log_attachment

@trace()
def my_function():
    # Do something

    # Log attachment to the `my_function` step
    log_attachment("/path/to/file")
    return

When streaming the trace to the Openlayer step, we now scan the trace to check if there are any attachments. If there are, they are uploaded first and then the trace is streamed. The upload happens via the typical presigned URL flow.

OpenAI multimodal

Besides having attachments, this PR also instruments the trace_openai wrapper to parse image/audio/files in the input or output of OpenAI LLM calls.
To support media in OpenAI traces, we extend how inputs/outputs are represented. In summary, the schema is:

  # -------------- Inputs --------------------
  {
      "prompt": [
          # Old format. We still use it when the prompt only has strings
          {
              "role": "user",
              "content": "Simple text message"  # String for text-only (backwards compatibility)
          },
           # New format. Content as a list of objects with `type` (one of `text`, `image`, `audio`, or `file`)
          {
              "role": "user",
              "content": [  # List for multimodal
                  {"type": "text", "text": "What's in this image?"},
                  {"type": "image", "attachment": {"id": "...", "storageUri": "...", ...}}
              ]
          }
      ]
  }

# -------------- Output --------------------
# Old format. We still use it if the output is a simple string
  "Simple text response"  # String for text-only (backwards compatibility)

  # or
  {"type": "text", "text": "Text response"}

  # or
  {"type": "audio", "attachment": {"id": "...", "storageUri": "...", ...}}

  # or mixed
  [
      {"type": "text", "text": "Here's the image you requested:"},
      {"type": "image", "attachment": {...}}
  ]

Note that if the type is one of image, audio, or file, the other object field is an attachment, which is a serialized Attachment object.

Context

OPEN-8683: Multimodal attachment support for the Python SDK and OPEN-8684: Enhance OpenAI tracer to support multimodal inputs/outputs

Testing

Manual testing

…nputs/outputs

gustavocidornelas added 2 commits January 14, 2026 13:25

feat(closes OPEN-8683): multimodal attachment support for the Python SDK

5ecc273

feat(closes OPEN-8684): enhance OpenAI tracer to support multimodal i…

4054b8c

…nputs/outputs

whoseoyster requested review from shah-siddd and whoseoyster January 16, 2026 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cid/attachments #581

Cid/attachments #581

Uh oh!

gustavocidornelas commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Cid/attachments #581

Are you sure you want to change the base?

Cid/attachments #581

Uh oh!

Conversation

gustavocidornelas commented Jan 15, 2026

Pull Request

Summary

Changes

Attachments

OpenAI multimodal

Context

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants