chore(deps): update loader dependencies major (major) by dreadnode-renovate-bot[bot] · Pull Request #194 · dreadnode/dyana

dreadnode-renovate-bot · 2026-02-24T20:12:14Z

ℹ️ Note

This PR body was truncated due to platform limits.

This PR contains the following updates:

Package	Change	Age	Confidence
psutil	`==6.1.1` → `==7.2.2`
transformers	`==4.57.6` → `==5.11.0`

Warning

Some dependencies could not be looked up. Check the Dependency Dashboard for more information.

Release Notes

giampaolo/psutil (psutil)

huggingface/transformers (transformers)

`v5.11.0`

Compare Source

Release v5.11.0

New Model additions

DiffusionGemma

DiffusionGemma is engineered to reduce the sequential bottlenecks of standard causal language models by employing an encoder-decoder architecture specifically optimized for inference speed. During inference, DiffusionGemma leverages multi-canvas sampling, where rather than generating one token at a time, the model iteratively denoises a full block of tokens using a diffusion sampler. This block-autoregressive approach facilitates text generation at higher speeds compared to traditional sequential generation methods.

Links: Documentation

GPU go brr (#46540) by @gante in #46540

DeepSeek-V3.2

DeepSeek-V3.2-Exp is an experimental model from DeepSeek-AI that introduces DeepSeek Sparse Attention (DSA), a trainable, fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios. Built on top of DeepSeek-V3.1-Terminus with a 685B-parameter Mixture-of-Experts backbone, it reduces the quadratic cost of attention over long sequences by attending only to a selected subset of past tokens while maintaining virtually identical benchmark performance. The work was extended in DeepSeek-V3.2 which pairs DSA with scalable reinforcement learning and achieves gold-medal level results on competition math and competitive programming benchmarks.

Links: Documentation | Paper

Add deepseek 3.2 exp (#41251) by @ArthurZucker in #41251

Kernels

The KernelConfig API was extended to support n-to-1 module fusion and parameter transformation, simplifying how custom kernels are integrated with Transformers modules. Additional fixes include resolving a dtype mismatch in the Mamba2 CUDA kernel path for NemotronH/Zamba2, adding fine-grained fp8/fp4 Triton kernel support, and correcting the FalconMamba fast-path warning to recommend pip install kernels instead of mamba-ssm.

Extended & simplified n-to-1 kernel fusion via KernelConfig (#46339) by @michaelbenayoun in [#46339]
Triton finegrained fp8/fp4 (#46407) by @IlyasMoutawwakil in [#46407]
Fix dtype mismatch in NemotronH/Zamba2 Mamba2 CUDA-kernel path (out_proj) (#46487) by @yuekaizhang in [#46487]
fix(falcon_mamba): recommend pip install kernels in fast-path warning (#46343) by @Anai-Guo in [#46343]

Parallelization

Fixed model parallel beam search bugs in the Qwen2-VL, Qwen2.5-VL, and Qwen3-VL MoE model families, and added documentation for tensor parallelism support with continuous batching.

[docs] tp for continuous batching (#46019) by @stevhliu in [#46019]
revisit history parallel beam search tests to avoid unnecessary fix (#46495) by @kaixuanliu in [#46495]
fix qwen series VL model's model parallel bug (#46316) by @kaixuanliu in [#46316]

Bugfixes and improvements

Fix the offsets in processing (#46525) by @zucchini-nlp in [#46525]
Fix buggy action sha pin (#46534) by @ydshieh in [#46534]
Fix trailing comma bug in DataCollatorForLanguageModeling example (#46527) by @JemmaUZH in [#46527]
Fix missing Gemma4Processor._compute_audio_num_tokens (#46416) by @csantosbh in [#46416]
Fix InternVL models (#46524) by @hmellor in [#46524]
fix(afmoe): reduce tokens in test_compile_static_cache to avoid flaky bfloat16 drift (#46521) by @ydshieh in [#46521]
[CB] Add a "max_requests_per_batch" parameter (#46434) by @remi-or in [#46434]
revamp cv docs and fix rf-detr (#46219) by @merveenoyan in [#46219]
Update hub metadata (#46379) by @zucchini-nlp in [#46379]
extend DeepseekV4FlashIntegrationTest to non-cuda device (#46517) by @sywangyi in [#46517]
[docs] deepgemm (#46361) by @stevhliu in [#46361]
[fix] regression introduced by #45534 (#46456) by @eustlb in [#46456]
Use torchvision's native LANCZOS interpolation instead of PIL fallback (#46496) by @NicolasHug in [#46496]
Add debugging info in pr-ci-caller.yml (#46505) by @ydshieh in [#46505]
Fix tests: 'Cohere2MoeModel' object has no attribute 'hf_device_map' (#46337) by @kaixuanliu in [#46337]
Bump the actions group across 1 directory with 19 updates (#46414) by @dependabot[bot] in [#46414]
Log some information in .github/workflows/pr-ci-post-dashboard-link.yml (#46499) by @ydshieh in [#46499]
feat(quantizers): support non-weight param names in TorchAo safetensors loading (#46325) by @agesf in [#46325]
docs: fix typo in make_list_of_images docstring (#46469) by @ramkumar27072006 in [#46469]
add XPU expectation for deepseek_ocr2 model tests (#46492) by @kaixuanliu in [#46492]
Fix sapiens2 tests: add XPU device expectations (#46488) by @kaixuanliu in [#46488]
Add vLLM smoke test to CI (#46383) by @hmellor in [#46383]
extend deepseek v4 test to xpu (#46366) by @sywangyi in [#46366]
Added cosmos3 model (#46146) by @MaciejBalaNV in [#46146]
fbgemm_fp8:Keep the current device aligned with the input tensor (#46403) by @kaixuanliu in [#46403]
[Modular] Add no_inherit_decorators and fixup wrong RoPE related inheritances (#46440) by @Bissmella in [#46440]
skip deepgemm test except cuda (#46090) by @jiqing-feng in [#46090]
Fix/video classification pipeline video processor (#46256) by @J3r3myPerera in [#46256]
ci: less flaky test_assisted_decoding_matches_greedy_search_1_same (#46445) by @ydshieh in [#46445]
Fix flip_back graph break (#46344) by @guarin in [#46344]
Add the other processors to auto-mappings (#46046) by @zucchini-nlp in [#46046]
fix: compatibility with torch<=2.7 (#46393) by @andylin-hao in [#46393]
fix: remove dynamic per-actor Slack ID lookup in ssh-runner workflow (#46327) by @ydshieh in [#46327]
[docs] Romanian translation of pipeline_tutorial.md, pipeline_gradio.md, pipeline_webserver.md and add_new_pipeline.md. (#46388) by @filipinescu in [#46388]
[docs] gemma4 typos (#46351) by @stevhliu in [#46351]
[docs] padding-free training (#46333) by @stevhliu in [#46333]
fix[vLLM x v5]: Default untied embeddings in AudioFlamingo3 and VibeVoice (#46400) by @harshaljanjani in [#46400]
Fix deepspeed docker (#46108) by @SunMarc in [#46108]
Fix conversion for clip models (#46406) by @zucchini-nlp in [#46406]
ci: mention code quality failure in CI dashboard comment (#46415) by @ydshieh in [#46415]
Fix noisy logging from image_processing module aliases issue - 46298 (#46350) by @skshmjn in [#46350]
Raise tqdm minimum to 4.60 to match tqdm.contrib.logging import (#46397) by @n0gu-furiosa in [#46397]
fix(gemma4_unified): conversion script and config bugs (#46398) by @douglas-reid in [#46398]
[docs] remove sparsity from compressed-tensors (#46387) by @stevhliu in [#46387]
[CB] Fix crashes when fork is not possible (#46251) by @remi-or in [#46251]
Improve CI dashboard comment: rename and deduplicate (#46412) by @ydshieh in [#46412]
Fix missing f-string prefixes in error messages (#46354) by @joaopedroassad in [#46354]
Add workflow to post CI Grafana dashboard link to PR (#46410) by @ydshieh in [#46410]
[docs] Romanian translation of fast_tokenizers.md, custom_tokenizers.md, tokenizer_summary.md, image_processors.md and video_processors.md. (#46356) by @filipinescu in [#46356]
Clean up new models after release (#46092) by @zucchini-nlp in [#46092]

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@ArthurZucker
- Add deepseek 3.2 exp (#41251)
@gante
- GPU go brr (#46540)
@merveenoyan
- revamp cv docs and fix rf-detr (#46219)
@sgerrard
- Quantization for small models (#46449)
@MaciejBalaNV
- Added cosmos3 model (#46146)
@J3r3myPerera
- Fix/video classification pipeline video processor (#46256)
@filipinescu
- [docs] Romanian translation of pipeline_tutorial.md, pipeline_gradio.md, pipeline_webserver.md and add_new_pipeline.md. (#46388)
- [docs] Romanian translation of fast_tokenizers.md, custom_tokenizers.md, tokenizer_summary.md, image_processors.md and video_processors.md. (#46356)

`v5.10.2`: Patch release v5.10.2

Compare Source

Patch release v5.10.2

There was a big bug in the model conversion of models related to clip, this affected models like sam3 and others. Please make sure to update 🙏

Fix conversion for clip models by @zucchini-nlp (#46406)

Full Changelog: huggingface/transformers@v5.10.1...v5.10.2

`v5.10.1`

Compare Source

Release v5.10.1

v5.10.0 was yanked as we publish on a corrupted branch. Sorry everyone, this happens when we rush a release!!!

New Model additions

Gemma4 unified+ Gemma4 MTP

Gemma 4 12B Unified is an encoder-free multimodal model with pretrained and instruction-tuned variants. Unlike standard Gemma 4, which uses dedicated encoder towers, Gemma 4 12B Unified projects raw inputs directly into the language model's embedding space through lightweight linear pipelines. This results in a simpler architecture while maintaining strong multimodal performance.

Key differences from standard Gemma 4:

No Vision Tower: Raw pixel patches are projected directly into LM space via a Dense + LayerNorm pipeline with factorized 2D positional embeddings, replacing the vision encoder.
No Audio Tower: Raw 16 kHz waveform samples are chunked into fixed-length frames and projected through a simple RMSNorm → Linear pipeline, replacing the mel spectrogram + Conformer encoder.
Shared Multimodal Pipeline: Both vision and audio use the same Gemma4UnifiedMultimodalEmbedder (RMSNorm → Linear) for the final projection to text hidden space.

You can find the original Gemma 4 12B Unified checkpoints under the Gemma 4 release.

who needs encoders? (#46385) by @douglas-reid @sgerrard @vasqu @molbap

Sapiens2

Sapiens2 is a family of high-resolution vision transformers pretrained on ~1 billion curated human images, designed for human-centric computer vision tasks including pose estimation, body-part segmentation, surface normal estimation, and pointmap estimation. The models scale from 0.4B to 5B parameters and train at native 1K resolution, with hierarchical 4K variants for extended spatial reasoning. Sapiens2 achieves substantial improvements over its predecessor with +4 mAP in pose estimation, +24.3 mIoU in body-part segmentation, and 45.6% error reduction in normal estimation.

Links: Documentation | Paper

Add Sapiens2 Model (#45919) by @guarin in #45919

DeepSeek-OCR-2

DeepSeek-OCR-2 is an OCR-specialized vision-language model built on a distinctive architecture that combines a SAM ViT-B vision encoder with a Qwen2 hybrid attention encoder, connected through an MLP projector to a DeepSeek-V2 Mixture-of-Experts (MoE) language model. The model features a hybrid attention mechanism that applies bidirectional attention over image tokens and causal attention over query tokens, enabling efficient and accurate document understanding. It supports both plain OCR tasks and grounding capabilities with coordinate-aware output for document conversion to markdown format.

Links: Documentation

Add Deepseek-OCR-2 model (#45075) by @thisisiron in #45075

Mellum

Mellum is a code-focused Mixture-of-Experts language model developed by JetBrains. It is derived from the Qwen3-MoE architecture with per-layer-type RoPE and interleaved sliding window attention. The model has 12B total parameters with 2.5B active parameters per token, using 64 routed experts with 8 activated per token across 28 layers.

Links: Documentation

feat: Add support for JetBrains' Mellum v2 code generation model (#46112) by @shadeMe in #46112

Breaking changes

The Gemma4 vision pooler now casts inputs to float32 before scaling to prevent float16 overflow (inf saturation) with large checkpoints, which may cause minor numerical differences in outputs for users running Gemma-4 vision models in float16.

🚨 Fix float16 overflow in Gemma4 vision pooler (#46277) by @Bluear7878

Audio Language Models (ALMs) now have a dedicated base model class without a language modeling head, aligning them with the design of Vision Language Models (VLMs); users relying on the previous model class structure should update their code to use the new base model class where appropriate.

🚨 [ALM] Add base model without head (#45534) by @eustlb

Parallelization

This release includes numerous bug fixes for model parallelism across multiple models (Gemma4, AltCLIP, ChineseClip, Blip-2, Whisper, Ovis2, Moshi) and parallel execution strategies, including fixes for tensor parallelism (TP), expert parallelism (EP), beam search under model parallel settings, and loss over-counting under TP/EP configurations. The continuous batching manager was also reworked for clearer control flow and improved TP race condition handling, and FSDP initialization via from_pretrained was introduced.

Fix dsv4 dequant + tp/ep (#46378) by @IlyasMoutawwakil in [#46378]
[CB] [Major] Rework manager to have clearer control flow + handle TP (#46070) by @remi-or in [#46070]
fix series of bugs for model parallel beam search (#46280) by @kaixuanliu in [#46280]
Fix model parallel issue for altclip model and ChineseClip model (#45487) by @kaixuanliu in [#45487]
Model parallel fix (#46230) by @kaixuanliu in [#46230]
[Revert] FSDP+Dtensor refactor related changes (#46246) by @vasqu in [#46246]
Fix model parallel bugs for Gemma4 (#45817) by @kaixuanliu in [#45817]
init FSDP through from_pretrained (#46102) by @3outeille in [#46102]
fix model parallel device mismatch issue in create_bidirectional_mask (#46221) by @kaixuanliu in [#46221]
Trainer.compute_loss: fix loss over-counting under TP and EP-as-TP (#45994) by @AmineDiro in [#45994]
Fix caching allocator warmup byte estimation for EP model loading (#46149) by @sywangyi in [#46149]

Cache

Fixed a regression in encoder-decoder cache initialization where the decoder config was incorrectly applied to the cross-attention cache, and resolved a RuntimeError caused by buffer size limits when warming up the cache on MPS devices. Additional test infrastructure improvements were made to support read-only cache environments used in CI.

fix: cache warmup RuntimeError on mps (#46239) by @McPatate in [#46239]
Make more tests work with read-only cache (#46299) by @ydshieh in [#46299]
Update a test to avoid writing to the default xet cache (#46250) by @ydshieh in [#46250]
Fix a regression in encoder-decoder generation cache initialization (#46111) by @kaixuanliu in [#46111]

Quantization

Added support for DeepGEMM BF16, mixed FP8/FP4, and MegaMoE quantization via a grouped linear refactor, while fixing two bugs: an FP8 MoE reverse substring issue affecting DSv4 initialization, and a BitsAndBytes 4-bit/8-bit quantization bug that silently dropped chunked tensors from one-to-many weight converters.

DeepGEMM BF16 + mixed FP8/FP4 + MegaMoE + refactor (#45634) by @IlyasMoutawwakil in [#45634]
Fix fp8 moe reverse substring (#46265) by @ArthurZucker in [#46265]
Fix bnb 4bit/8bit quantization drop chunked tensors bug (#46210) by @kaixuanliu in [#46210]

Bugfixes and improvements

Fix wrong changes produced by style/repo. check bot (#46371) by @ydshieh in [#46371]
Fix path traversal when saving Bark voice preset embeddings (#46237) by @LinZiyuu in [#46237]
Pass library_name/version to Hub calls via a shared HfApi (#46318) by @Wauplin in [#46318]
docs: update ACL Anthology URL in CITATION.cff (#46352) by @irfaan101 in [#46352]
[docs] contributing (#45465) by @stevhliu in [#45465]
[docs] Romanian translation of contributing.md, modular_transformers.md, multimodal_processing.md, add_vision_processing_components.md, add_audio_processing_components.md, modeling_rules.md, model_output_tracing.md, auto_docstring.md, testing.md, pr_checks.md and add_new_model.md . (#46345) by @filipinescu in [#46345]
[docs] xpu continuous batching (#46334) by @stevhliu in [#46334]
Fix incorrect attribute mapping relationships in GLM MoE DSA Config (#46338) by @Dovis01 in [#46338]
Fix grammar typos in Whisper documentation (#46336) by @calliec-1223 in [#46336]
[docs] update num_items_in_batch for causal LMs (#46335) by @stevhliu in [#46335]
Update compressed tensors minimum version (#46342) by @SunMarc in [#46342]
Fix _is_package_available reporting available without a version (#46125) by @blipbyte in [#46125]
remove sec (#46346) by @ydshieh in [#46346]
fix: include transitive relative imports when loading from local directory (#46022) by @trducng in [#46022]
perf(feature_extraction_sequence): skip re-splitting already-batched numpy arrays in pad() (#46329) by @Anai-Guo in [#46329]
[Zamba] Support attn_implementation dispatch (#46317) by @YangKai0616 in [#46317]
Fix TestAppRoutes test failures caused by deprecated asyncio.get_event_loop() on Python 3.10+ (#46340) by @ydshieh in [#46340]
[Qwen3VL] Fix video token placeholder: use self.video_token instead of hardcoded "<|placeholder|>" (#46296) by @kpal002 in [#46296]
chore(linter): fixes for rule 16 (#46023) by @tarekziade in [#46023]
[docs] Romanian translation of weightconverter.md, models.md, custom_models.md, monkey_patching.md, fusion_mapping.md, how_to_hack_models.md, model_sharing.md and serialization.md. (#46309) by @filipinescu in [#46309]
Normalize CUDA OOM errors when comparing commit failures in check_bad_commit (#46322) by @ydshieh in [#46322]
Fix unhandled exception noise from background safetensors conversion thread (#45752) by @dhruv7477 in [#45752]
Add Expectations for pipeline token classification tests (#46151) by @kaixuanliu in [#46151]
[docs] fix auto-add release dates (#46283) by @zucchini-nlp in [#46283]
Separate pip command syntax for notebook and CLI tabs in Quickstart (#46243) by @pvelayudhan in [#46243]
Romanian translation of README.md, index.md, installation.md, _config.py and quicktour.md. (#46166) by @filipinescu in [#46166]
Fall back to flat kwarg when modality dict is passed without it (#46195) by @Ace3Z in [#46195]
Fix load_adapter OOM caused by full-model warmup sizing (#46145) by @Yooniel in [#46145]
Replace assert with raise ImportError for optuna/ray dependency checks (#46263) by @SebTardif in [#46263]
chore(linter): respect TRF017 modeling rule (#46260) by @tarekziade in [#46260]
Delete dead code in qwen-vl series (#45827) by @zucchini-nlp in [#45827]
qa: fix ty caching and align CI with local run (#46278) by @tarekziade in [#46278]
Guard DeviceMesh import in continuous batching (#46205) by @danyalahmed1995 in [#46205]
Processor compatibility with vLLM (#46258) by @zucchini-nlp in [#46258]
Fix PR CI workflow cancellation condition (#46276) by @ydshieh in [#46276]
[fix] toctree (#46106) by @stevhliu in [#46106]
add more generic support for distributed trainer tests (#46109) by @kaixuanliu in [#46109]
add XPU Expectations for florence2 and lfm2_vl model test (#46275) by @kaixuanliu in [#46275]
Fix StaticCache building an empty layer list when num_kv_shared_layers == 0 (#46235) by @tengomucho in [#46235]
Fix inverted assertion in remove_handler (#46227) by @SebTardif in [#46227]
[ShieldGemma2] Support attn_implementation dispatch (#46069) by @YangKai0616 in [#46069]
[Gemma4] Replace one-hot matmul with F.embedding in position embeddings (#46176) by @Sriniketh24 in [#46176]
fix: kosmos2.5: properly expand embeddings table (#45835) by @nunq in [#45835]
find pytest launch error in torch 2.13.0.dev20260526 (#46252) by @sywangyi in [#46252]
[Test][Kosmos2.5] Add XPU expectations for integration tests (#46135) by @YangKai0616 in [#46135]
Support FA2 flash_attn_with_kvcache for XPU continuous batching (#46028) by @YangKai0616 in [#46028]
[Configs] Fix layer type validation to include its mlp counterpart (#46220) by @vasqu in [#46220]
Fix num_items_in_batch over-counting for causal LM losses (#46204) by @qgallouedec in [#46204]
RF-DETR doc fixes (#46244) by @merveenoyan in [#46244]
Use main instead of commit SHA for now (#46241) by @ydshieh in [#46241]
Enable push event (to main) for PR CI workflow (#46240) by @ydshieh in [#46240]
fix(hrm_text): Add XPU Expectations for tests (#46214) by @kaixuanliu in [#46214]
[deepseek_v4] keep hc_head / sinks / position_bias in fp32 (#46198) by @ArthurZucker in [#46198]
Fix FSDP2 and distributed checkpointing imports for older PyTorch versions (#46141) by @ryota-komatsu in [#46141]
Fix Gemma4 Array Mask Indexing (#46203) by @petecao in [#46203]
utils: handle flash_attn missing from importlib packages_distributions without crashing (#45524) by @SAY-5 in [#45524]
[AMD CI] revert AMD mi325 hf-workflows ref from SHA back to @main (#46213) by @Abdennacer-Badaoui in [#46213]
[GLM-4.6V] Update with GLM-GA Processor (#46184) by @zRzRzRzRzRzRzR in [#46184]
update xpu expectation for falcon mamba (#46086) by @sywangyi in [#46086]
chore: enable Dependabot weekly GitHub Actions bumps (#46157) by @hf-dependantbot-rollout[bot] in [#46157]
Fix Gemma4 use_bidirectional_attention="all" mask behavior (#46079) by @oliverholworthy in [#46079]
Fix loading with only 1 device or distributed config (#46197) by @Cyrilvallez in [#46197]
Fix TypeError on list-typed ignore_keys_at_rope_validation in RoPE config (#46142) by @Charly21r in [#46142]
Support XPU autocast dtype fallback for FlashAttention (#46199) by @YangKai0616 in [#46199]
Fix path traversal when saving named chat templates (#46191) by @LinZiyuu in [#46191]
Fix is_last off-by-one in MaskGenerationPipeline for partial batches (#46136) by @J3r3myPerera in [#46136]
Fix wrong variable in chec

✂ Note

PR body was truncated to here.

Configuration

📅 Schedule: (UTC)

Branch creation
- At any time (no schedule defined)
Automerge
- At any time (no schedule defined)

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

👻 Immortal: This PR will be recreated if closed unmerged. Get config help if that's undesired.

If you want to rebase/retry this PR, check this box

This PR has been generated by Mend Renovate.

| datasource | package | from | to | | ---------- | ------------ | ------ | ------ | | pypi | psutil | 6.1.1 | 7.2.2 | | pypi | transformers | 4.57.6 | 5.11.0 |

dreadnode-renovate-bot Bot added the type/digest Dependency digest updates label Feb 24, 2026

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch 3 times, most recently from 07525d6 to 3ac3e72 Compare March 1, 2026 00:53

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from 3ac3e72 to 4daa5d1 Compare March 8, 2026 00:48

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch 2 times, most recently from 3e0d62f to 4b95150 Compare April 1, 2026 00:57

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from 4b95150 to 40a28f1 Compare April 8, 2026 00:52

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch 2 times, most recently from 85f7052 to c4f4579 Compare April 19, 2026 00:59

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from c4f4579 to 37b26b9 Compare April 26, 2026 01:01

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from 37b26b9 to ca4e25e Compare May 3, 2026 01:07

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from ca4e25e to b5496fe Compare May 10, 2026 01:09

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from b5496fe to a845574 Compare May 17, 2026 01:11

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from a845574 to f7682ea Compare May 24, 2026 01:12

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from f7682ea to cfc6d09 Compare June 7, 2026 01:19

chore(deps): update loader dependencies major

e691441

| datasource | package | from | to | | ---------- | ------------ | ------ | ------ | | pypi | psutil | 6.1.1 | 7.2.2 | | pypi | transformers | 4.57.6 | 5.11.0 |

dreadnode-renovate-bot Bot force-pushed the renovate/major-loader-deps-major branch from cfc6d09 to e691441 Compare June 14, 2026 01:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(deps): update loader dependencies major (major)#194

chore(deps): update loader dependencies major (major)#194
dreadnode-renovate-bot[bot] wants to merge 1 commit into
mainfrom
renovate/major-loader-deps-major

dreadnode-renovate-bot Bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

dreadnode-renovate-bot Bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

Release v5.11.0

New Model additions

DiffusionGemma

DeepSeek-V3.2

Kernels

Parallelization

Bugfixes and improvements

Significant community contributions

v5.10.2: Patch release v5.10.2

Patch release v5.10.2

Release v5.10.1

New Model additions

Gemma4 unified+ Gemma4 MTP

Sapiens2

DeepSeek-OCR-2

Mellum

Breaking changes

Parallelization

Cache

Quantization

Bugfixes and improvements

Configuration

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

dreadnode-renovate-bot Bot commented Feb 24, 2026 •

edited

Loading

`v5.10.2`: Patch release v5.10.2