Skip to content

chore(deps-dev): update vllm requirement from >=0.22.0 to >=0.22.1#440

Merged
xhd0728 merged 1 commit into
mainfrom
dependabot/pip/vllm-gte-0.22.1
Jun 12, 2026
Merged

chore(deps-dev): update vllm requirement from >=0.22.0 to >=0.22.1#440
xhd0728 merged 1 commit into
mainfrom
dependabot/pip/vllm-gte-0.22.1

Conversation

@dependabot

@dependabot dependabot Bot commented on behalf of github Jun 10, 2026

Copy link
Copy Markdown
Contributor

Updates the requirements on vllm to permit the latest version.

Release notes

Sourced from vllm's releases.

v0.22.1

Highlights

This release features 8 commits from 6 contributors (1 new)!

v0.22.1 is a patch release on top of v0.22.0 with targeted bug fixes plus a couple of additions: new model support for JetBrains' Mellum v2, zentorch-accelerated quantized linear inference on AMD Zen CPUs, and fixes for multi-node Ray data-parallel serving, DeepSeek-V4 initialization, and a few model-loading regressions.

Model Support

  • New model: JetBrains' Mellum v2, an open-weights Mixture-of-Experts code-generation model (#43992).
  • DeepSeek-V4: resolve a CUTLASS fmin compatibility issue that broke initialization (0decac0d).
  • Fix OlmoHybridForCausalLM failing to initialise after the checkpoint changed rope_parameters from None to {"rope_type": None} (#43846).
  • Fix HyperCLOVAX loading after the upstream HuggingFace repo removed its remote code (now native in transformers >= 5.9.0): register the hyperclovax model_type so vLLM uses its vendored config instead of the stale auto_map (#43860).

Hardware & Performance

  • AMD Zen CPUs: route W8A8 (int8 dynamic-symmetric) and W4A16 (GPTQ) linear inference through zentorch kernels, registered ahead of the generic oneDNN CPU kernels, with transparent fallback on non-Zen CPUs, GPUs, and XPU (#41813).

Large Scale Serving

  • Fix a deterministic hang in multi-node Ray data-parallel serving with num_api_servers > 1 by excluding the Ray DP backend from the deferred (kernel-assigned) port allocation introduced in #42585 (#43864).

Build & CI

  • Docker: stop installing flashinfer-jit-cache via --extra-index-url while it is quarantined on PyPI, fixing image builds (#44366).
  • Normalize NIXL KV-connector wheel installs so only the wheel matching the image's CUDA major is kept, fixing ImportError: libcudart.so.12 when importing nixl_ep on CUDA 13 images (#44266).

Contributors

@​khluu, @​vadiklyutiy, @​aadwived, @​shadeMe, @​alec-flowers, @​hmellor

New Contributors

Commits
  • 0decac0 fix: resolve CUTLASS fmin compatibility for DeepSeek-V4 init
  • fd56c57 Fix OlmoHybridForCausalLM not initialising (#43846)
  • 7285178 [Bugfix] Fix HyperCLOVAX CI failure after upstream removed remote code (#43860)
  • 27509c8 [Bugfix][CI] Normalize NIXL connector CUDA wheel installs (#44266)
  • b284862 [docker] Stop using extra-index-url for flashinfer-jit-cache (#44366)
  • 932dfd5 [Feature] Add support for JetBrains' Mellum v2 code generation model (#43992)
  • 682ffeb [CPU][Zen] Route W8A8 and W4A16 linear inference through zentorch on AMD Zen ...
  • 1be7a57 [Bugfix] Exclude Ray DP from #42585's deferred port allocation (#43864)
  • See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Updates the requirements on [vllm](https://github.com/vllm-project/vllm) to permit the latest version.
- [Release notes](https://github.com/vllm-project/vllm/releases)
- [Changelog](https://github.com/vllm-project/vllm/blob/main/RELEASE.md)
- [Commits](vllm-project/vllm@v0.22.0...v0.22.1)

---
updated-dependencies:
- dependency-name: vllm
  dependency-version: 0.22.1
  dependency-type: direct:development
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels Jun 10, 2026
@xhd0728 xhd0728 merged commit f334fa7 into main Jun 12, 2026
3 checks passed
@dependabot dependabot Bot deleted the dependabot/pip/vllm-gte-0.22.1 branch June 12, 2026 05:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant