feat: upgrade vLLM to 0.16.0 by vivekkalyan · Pull Request #584 · OpenPipe/ART

vivekkalyan · 2026-02-27T02:35:46Z

Summary

Upgrades vllm from 0.15.1 to 0.16.0

Testing Scope

Models evaluated:
- OpenPipe/Qwen3-14B-Instruct
- Qwen/Qwen3-30B-A3B-Instruct-2507
Modes evaluated:
- inference-only
- strict replay
- ART-E
Experiment discipline:
- H200 single-GPU
- one fresh VM/cluster per run

Inference-only (single GPU, c=8)

14B:
- throughput: 621.88 -> 620.69 tok/s (-0.19%)
- latency avg: 1.2766s -> 1.2775s (+0.07%)
30B-A3B:
- 0.16.0: 620.38 tok/s, 1.1545s latency avg
- 0.15.1: 618.06 tok/s, 1.1585s latency avg
- +0.38% throughput, -0.34% latency for 0.16.0

ART-E

Task-quality metrics stayed stable across both models; 30B-A3B ART-E showed slight latency/throughput improvement on 0.16.0.

14B:

latency_mean: 0.384353 -> 0.384383 (+0.0076%)
latency_p95: 0.877676 -> 0.877771 (+0.0108%)
completion_tokens_per_sec: 335.3655 -> 335.0151 (-0.10%)

30B-A3B:

latency_mean: 0.306657 -> 0.300918 (-1.87%)
latency_p95: 0.740726 -> 0.733300 (-1.00%)
completion_tokens_per_sec: 428.8369 -> 442.5670 (+3.20%)

Compatibility notes

Upstream protocol paths changed around reasoning_content; thinking-model flows should be exercised for ART paths that still emit it.
Tinker renderer paths are Tinker API-specific and not the primary local vLLM path.

vivekkalyan · 2026-03-11T05:03:11Z

closed by #610

vivekkalyan mentioned this pull request Feb 27, 2026

feat(vllm): upgrade to 0.16.0 with single-GPU validation #583

Closed

vivekkalyan changed the title ~~feat(vllm): upgrade to 0.16.0 with single-GPU validation~~ feat: upgrade vLLM to 0.16.0 Feb 27, 2026

build: Upgrade vLLM to 0.16.0

44cf4ce

vivekkalyan force-pushed the feat/vllm-0.16.0 branch from 6b0ce35 to 44cf4ce Compare February 27, 2026 02:41

vivekkalyan mentioned this pull request Mar 11, 2026

build: Upgrade vLLM to 0.17.0 #610

Open

vivekkalyan closed this Mar 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: upgrade vLLM to 0.16.0#584

feat: upgrade vLLM to 0.16.0#584
vivekkalyan wants to merge 1 commit intomainfrom
feat/vllm-0.16.0

vivekkalyan commented Feb 27, 2026 •

edited

Loading

Uh oh!

vivekkalyan commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vivekkalyan commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Scope

Inference-only (single GPU, c=8)

ART-E

Compatibility notes

Uh oh!

vivekkalyan commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vivekkalyan commented Feb 27, 2026 •

edited

Loading