feat(auto): add Kilo Auto Balanced model by lambertjosh · Pull Request #1031 · Kilo-Org/cloud

lambertjosh · 2026-03-11T15:52:23Z

Summary

Adds a new kilo-auto/balanced auto model that mirrors Frontier's mode-based routing structure but uses cheaper underlying models:

Kimi K2.5 (moonshotai/kimi-k2.5) for heavy modes: plan, general, architect, orchestrator, ask, debug (where Frontier uses Opus)
Minimax M2.5 (minimax/minimax-m2.5) for implementation modes: build, explore, code (where Frontier uses Sonnet)

Context length and max completion tokens are derived from the minimum of both models (204,800 / 65,536). Pricing set at $2/$8 per M tokens. Added to preferredModels between Frontier and Free.

Verification

pnpm typecheck — passes (no new errors introduced; all existing errors are in unrelated files)

Visual Changes

N/A

Reviewer Notes

Pricing (prompt_price / completion_price) is placeholder — may need adjustment based on actual upstream costs.
supports_images is false since Minimax M2.5 lacks vision support, even though Kimi K2.5 does support it.
opencode_settings is undefined since neither underlying model fits the existing families (claude, gpt, gemini, llama, mistral).
Renamed internal constants (CODE_MODEL → FRONTIER_CODE_MODEL, MODE_TO_MODEL → FRONTIER_MODE_TO_MODEL) for clarity now that there are two routing tables.

Routes to Kimi K2.5 for heavy modes (plan, general, architect, orchestrator, ask, debug) and Minimax M2.5 for implementation modes (build, explore, code), offering a lower-cost alternative to Frontier.

src/lib/kilo-auto-model.ts

kilo-code-bot · 2026-03-11T15:55:47Z

Code Review Summary

Status: 2 Issues Found | Recommendation: Address before merge

Overview

Severity	Count
CRITICAL	0
WARNING	2
SUGGESTION	0

Fix these issues in Kilo Cloud

Issue Details (click to expand)

WARNING

File	Line	Issue
`src/lib/kilo-auto-model.ts`	59	Advertised `max_completion_tokens` exceeds the `moonshotai/kimi-k2.5` limit used by several Balanced modes, which can lead to provider-side 400s for otherwise valid requests.
`src/lib/kilo-auto-model.ts`	149	Balanced routes to `minimax/minimax-m2.5:free` when the free variant is enabled, so auth, rate limiting, and billing all treat this paid auto model as free.

Other Observations (not in diff)

N/A

Files Reviewed (3 files)

src/lib/kilo-auto-model.ts - 2 issues
src/lib/models.ts - 0 issues
src/tests/openrouter-models-sorting.approved.json - 0 issues

_{Reviewed by gpt-5.4-20260305 · 448,089 tokens}

The deprecatedAutoModels function was producing entries with undefined IDs for models without legacy mappings (like the new balanced model), causing the /api/openrouter/models endpoint to return 500.

lambertjosh · 2026-03-11T22:07:50Z

Demo videos:

Legacy extension: https://www.loom.com/share/de082b1128e84382b1a6e8a3d6f9838e
New CLI: https://www.loom.com/share/2666512d76404225afc2beb6be2c8f8d

src/tests/openrouter-models-sorting.approved.json

src/lib/kilo-auto-model.ts

lambertjosh · 2026-03-11T22:12:53Z

Also validated that the modes do hit the correct models on the backend and used both. Flipping between modes in the same session also appears to work correctly.

Minimax M2.5 needs search_and_replace only (no apply_diff/edit_file). Apply this restriction globally since the extension can't know which underlying model handles each request.

lambertjosh · 2026-03-11T22:25:54Z

@chrarnoldus - one item I wasn't sure about, there are different roo settings for minimax:m2.5 and Kimi:k2.5. I applied the minimax ones as I wanted to be restrictive, but would love your guidance here.

The nvidia entry was accidentally dropped during the merge. Restoring it fixes the preferredIndex values in the approval test.

* Update Kimi prices * Use edit_file instead of apply_diff * Use free MiniMax instead of paid * Remove unsupported parameters * Simplify deprecated mapping without flatMap

chrarnoldus · 2026-03-12T08:40:48Z

I applied the minimax ones as I wanted to be restrictive, but would love your guidance here.

edit_file is simpler, so I changed it to that one.

chrarnoldus · 2026-03-12T08:41:46Z

The PR originally used paid MiniMax, was that intentional? I changed it to free, but we can always change it back.

edit: roadmap says free

src/lib/kilo-auto-model.ts

feat(auto): add Kilo Auto Balanced model

f0f9afd

Routes to Kimi K2.5 for heavy modes (plan, general, architect, orchestrator, ask, debug) and Minimax M2.5 for implementation modes (build, explore, code), offering a lower-cost alternative to Frontier.

kilo-code-bot bot reviewed Mar 11, 2026

View reviewed changes

src/lib/kilo-auto-model.ts Show resolved Hide resolved

fix(auto): filter models without legacy mapping and update test snapshot

2ea7079

The deprecatedAutoModels function was producing entries with undefined IDs for models without legacy mappings (like the new balanced model), causing the /api/openrouter/models endpoint to return 500.

lambertjosh requested a review from chrarnoldus March 11, 2026 22:07

lambertjosh commented Mar 11, 2026

View reviewed changes

src/tests/openrouter-models-sorting.approved.json Outdated Show resolved Hide resolved

Apply suggestion from @lambertjosh

1a11521

lambertjosh commented Mar 11, 2026

View reviewed changes

src/lib/kilo-auto-model.ts Outdated Show resolved Hide resolved

Apply suggestion from @lambertjosh

18cfc8a

fix(auto): add tool restrictions to balanced model

3baf980

Minimax M2.5 needs search_and_replace only (no apply_diff/edit_file). Apply this restriction globally since the extension can't know which underlying model handles each request.

lambertjosh and others added 4 commits March 11, 2026 18:29

fix: restore nvidia nemotron in preferredModels and update snapshot

5ee2cbc

The nvidia entry was accidentally dropped during the merge. Restoring it fixes the preferredIndex values in the approval test.

merge: resolve conflict with main, keep bumped preferredIndex values

4f8149e

* Remove reference to disabled free model

cf60cff

* Update Kimi prices * Use edit_file instead of apply_diff * Use free MiniMax instead of paid * Remove unsupported parameters * Simplify deprecated mapping without flatMap

Approvals

e33985d

chrarnoldus approved these changes Mar 12, 2026

View reviewed changes

kilo-code-bot bot reviewed Mar 12, 2026

View reviewed changes

src/lib/kilo-auto-model.ts Show resolved Hide resolved

src/lib/kilo-auto-model.ts Show resolved Hide resolved

chrarnoldus merged commit 65074d9 into main Mar 12, 2026
18 checks passed

chrarnoldus deleted the feat/kilo-auto-balanced branch March 12, 2026 09:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(auto): add Kilo Auto Balanced model#1031

feat(auto): add Kilo Auto Balanced model#1031
chrarnoldus merged 9 commits intomainfrom
feat/kilo-auto-balanced

lambertjosh commented Mar 11, 2026

Uh oh!

Uh oh!

kilo-code-bot bot commented Mar 11, 2026 •

edited

Loading

WARNING

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

Uh oh!

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

chrarnoldus commented Mar 12, 2026

Uh oh!

chrarnoldus commented Mar 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lambertjosh commented Mar 11, 2026

Summary

Verification

Visual Changes

Reviewer Notes

Uh oh!

Uh oh!

kilo-code-bot bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Overview

WARNING

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

Uh oh!

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

lambertjosh commented Mar 11, 2026

Uh oh!

chrarnoldus commented Mar 12, 2026

Uh oh!

chrarnoldus commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kilo-code-bot bot commented Mar 11, 2026 •

edited

Loading

chrarnoldus commented Mar 12, 2026 •

edited

Loading