Skip to content

Parse agent traces messages for SFT using teich#8232

Merged
lhoestq merged 1 commit into
mainfrom
parse-agent-traces-messages-for-sft
Jun 3, 2026
Merged

Parse agent traces messages for SFT using teich#8232
lhoestq merged 1 commit into
mainfrom
parse-agent-traces-messages-for-sft

Conversation

@lhoestq
Copy link
Copy Markdown
Member

@lhoestq lhoestq commented Jun 2, 2026

small breaking change (mostly for the dataset viewer on HF that has to take this into account): the field "traces" is renamed "trace" cc @cfahlgren1

This PR adds the teich extracted traces data, i.e. the fields messages, prompt, tools and metadata

In [1]: from datasets import load_dataset

In [2]: ds = load_dataset("lhoestq/agent-traces-example", split="train")

In [3]: ds[0]["messages"]
[{'role': 'user', 'content': 'Download a random dataset from Hugging Face, use DuckDB to inspect it, and come back with a short report about it. Be concise and include: dataset name, what files/format you found, row count or rough size if you can determine it,...'
 ...]

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@lhoestq lhoestq merged commit b7c064d into main Jun 3, 2026
13 of 15 checks passed
@lhoestq lhoestq deleted the parse-agent-traces-messages-for-sft branch June 3, 2026 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants