env server #799

mikasenghaas · 2026-01-28T17:14:19Z

Description

This PR introduces the EnvClient and EnvServer which expose the run_rollout and run_group methods of an environment from a separate process (pool). This is especially useful for multi-env training (e.g. in prime-rl) and multi-env evals (e.g. in vf-eval or online evals).

Example

Runnning vf-eval will spawn environments in env-server mode by default

uv run vf-eval gsm8k -n5 -r3

Design

Env Server Mode

You can put an environment into "env server mode" by calling

env = vf.load_environment(env_id, **env_args)
await env.start_server()

This will implicitly start an env server as a sidecar (in a subprocess) and try to route all calls to run_rollout and run_group to the env server.

EnvServer

A EnvServer is initialized like a regular environment with an env_id and env_args

env_server = ZMQEnvServer(
    env_id=args.env_id,
    env_args=args.env_args,
    address=address
)

try:
    await server.run()
finally:
    await server.close()

EnvClient

A EnvClient communicates with a env server over the configured address

env = ZMQEnvClient(address=address)

await env.run_rollout(...) # same as Environment.run_rollout
await env.run_group(...) # same as Environment.run_group
await env.evaluate(...) # same as Environment.evaluate

Sidecar Pattern

To sidecar an env server (e.g. from vf-eval) simply wrap the run_server class method in a Process and connect the client to the same address

env_server = Process(
    target=ZMQEnvServer.run_server,
    args=(config.env_id, config.env_args),
    kwargs=dict(address=address)
)
env_server.start()
env = ZMQEnvClient(address=address)

try:
   results = await env.evaluate(...)
finally:
  env_worker.terminate()
  env_worker.join(timeout=5)
  if env_worker.is_alive():
      env_worker.kill()
      env_worker.join()

Misc Changes

vf.setup_logging(...) supports logging to file now as well
We now store error info in the serializable RolloutOutput to be able to display error chains as before

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Additional Notes

Note

High Risk
Introduces a new multiprocess, networked execution path (ZMQ/msgpack) and refactors core rollout scheduling/serialization, which can affect correctness, performance, and cleanup behavior across evaluation runs.

Overview
Adds an environment “server mode” for evaluation/training. Environment can now spawn a sidecar ZMQEnvServer process and route run_rollout/run_group over a new EnvClient/ZMQEnvClient using ZMQ + msgpack, and vf-eval is updated to start/stop the server around each run.

Refactors rollout execution and serialization. Generation/scoring no longer use separate generation vs scoring semaphores; a single concurrency limit is applied via with_sem, tasks are always cleaned up on exit, and run_rollout/run_group now return pre-serialized RolloutOutput objects (builder now accumulates outputs, not states).

Changes error and logging surfaces. Rollout error is now a structured ErrorInfo (type + chain strings) instead of a repr string, ErrorChain string/repr semantics are swapped to preserve prior displays, and logging supports optional file output; tests/docs/CLI config are updated accordingly. Dependencies add pyzmq and msgpack.

^{Written by Cursor Bugbot for commit ed2b7d9. This will update automatically on new commits. Configure here.}

pyproject.toml

verifiers/workers/client/env_client.py

verifiers/utils/save_utils.py

verifiers/utils/logging_utils.py

verifiers/utils/env_utils.py

verifiers/envs/environment.py

mikasenghaas · 2026-01-28T18:10:27Z

still missing some unit tests which i will add (those will replace the __name__ == __main__ blocks in the env server/client impls which i used for debugging

willccbb · 2026-01-28T22:30:09Z

I think removing the scoring concurrency is fine. Users can make this part of their rubrics if they want (via class_objects or globals), have used this for multi-part judge rubrics + works well.

verifiers/envs/environment.py

verifiers/utils/async_utils.py

verifiers/envs/environment.py

pyproject.toml

verifiers/workers/types.py

docs/reference.md

verifiers/utils/logging_utils.py

verifiers/envs/environment.py

verifiers/workers/server/zmq_env_server.py

verifiers/workers/server/env_server.py

verifiers/utils/save_utils.py

verifiers/envs/environment.py

verifiers/workers/client/zmq_env_client.py

verifiers/envs/environment.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

verifiers/utils/logging_utils.py

mikasenghaas added 8 commits January 28, 2026 17:16

pick relevant changes from mika/env-worker

ba59032

runnable env server/client

0223d62

aligned interface

84433e2

integrate into vf-eval

ddc6b2f

minor

dd262de

do not double serialize

02715b3

fix retries

86f9276

pass state cols

a38022a

mikasenghaas force-pushed the env-server branch from c8bb765 to a38022a Compare January 28, 2026 17:16

mikasenghaas changed the base branch from overhaul-results-saving to main January 28, 2026 17:16

mikasenghaas commented Jan 28, 2026

View reviewed changes

willccbb reviewed Jan 28, 2026

View reviewed changes

verifiers/envs/environment.py Outdated Show resolved Hide resolved

willccbb reviewed Jan 28, 2026

View reviewed changes

verifiers/envs/environment.py Outdated Show resolved Hide resolved

willccbb reviewed Jan 28, 2026

View reviewed changes

verifiers/utils/async_utils.py Show resolved Hide resolved

mikasenghaas added 14 commits January 29, 2026 10:37

update pyrproject

377867e

update logging_utils

3cbdea8

change signatures from state -> output

9330c81

move extra env kwargs out of load_environment signatures

8584482

do not change signature

8b3dbc0

mini

0c8badd

name inner funcs

f5a5ef7

deprecate gen/score sem and move global sem into generate()

d6edb95

remove unnecesary module inti

4e11a27

fix error info in rollout output

846bcc0

run as daemon process

0eb5ed7

robustify task cleanup in env

409f580

graceful shutdowns

67c25d2

informative error

dc43e9a

mikasenghaas added 4 commits January 29, 2026 12:54

update docs

3d76f72

handle retries and state cols on server as well

eb9c1af

fix sampling args handling

775d0c0

use kill on second attempt

2f0000d

cursor bot reviewed Jan 29, 2026

View reviewed changes

address bugbot

a612201

cursor bot reviewed Jan 29, 2026

View reviewed changes

verifiers/envs/environment.py Outdated Show resolved Hide resolved

fix

0b8f156

cursor bot reviewed Jan 29, 2026

View reviewed changes

mikasenghaas mentioned this pull request Jan 29, 2026

resume evals #803

Draft

13 tasks

mikasenghaas added 8 commits January 29, 2026 17:33

address bugbot

ed266db

asserts

afa81d3

add client idx

42b90c3

quiet server-side env loading

521e130

minor

7681d92

dont assert nvm

27c2f69

fix eval display

b52228d

do not error pydantic on error response

e9a6611

cursor bot reviewed Jan 29, 2026

View reviewed changes

verifiers/workers/client/zmq_env_client.py Outdated Show resolved Hide resolved

verifiers/envs/environment.py Show resolved Hide resolved

verifiers/envs/environment.py Show resolved Hide resolved

verifiers/envs/environment.py Show resolved Hide resolved

mikasenghaas added 2 commits January 29, 2026 19:32

skip validation for rollout inputs (needed for multi-modal to work)

ec5387e

fix mutable args

09a6cb0

cursor bot reviewed Jan 29, 2026

View reviewed changes

verifiers/utils/logging_utils.py Show resolved Hide resolved

mikasenghaas added 4 commits January 29, 2026 19:51

wait for server health

a334e9c

lock to prevent race condition

3e0051b

fix logging

6efaa7a

Merge branch 'main' into env-server

85988ae

mikasenghaas changed the title ~~env server v2~~ env server Jan 29, 2026

docstring tweak

ed2b7d9

willccbb approved these changes Jan 30, 2026

View reviewed changes

willccbb merged commit 53e50f7 into main Jan 30, 2026
6 checks passed

env server #799

env server #799

Uh oh!

Conversation

mikasenghaas commented Jan 28, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Example

Design

Env Server Mode

EnvServer

EnvClient

Sidecar Pattern

Misc Changes

Type of Change

Testing

Checklist

Additional Notes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikasenghaas commented Jan 28, 2026

Uh oh!

willccbb commented Jan 28, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mikasenghaas commented Jan 28, 2026 •

edited by cursor bot

Loading