From 406242eec8221448dceef0c627c5c7a0eb50b28d Mon Sep 17 00:00:00 2001
From: Simon Iribarren <simon.ig13@gmail.com>
Date: Mon, 8 Jun 2026 22:56:01 +0200
Subject: [PATCH 1/3] docs: add QVAC provider setup guide

Add a setup page for QVAC, a local-first, peer-to-peer AI runtime that
exposes an OpenAI-compatible server via `qvac serve openai`, and connects
to Roo Code through the OpenAI Compatible provider. Register it in the
providers index so it appears in the sidebar and provider table.
---
 docs/providers/index.json |  6 +++
 docs/providers/qvac.md    | 91 +++++++++++++++++++++++++++++++++++++++
 2 files changed, 97 insertions(+)
 create mode 100644 docs/providers/qvac.md
diff --git a/docs/providers/index.json b/docs/providers/index.json
index e2fee275..c3a581b3 100644
--- a/docs/providers/index.json
+++ b/docs/providers/index.json
@@ -84,6 +84,12 @@
       "extension": true,
       "cloud": true
     },
+    {
+      "id": "providers/qvac",
+      "title": "QVAC",
+      "extension": true,
+      "cloud": false
+    },
     {
       "id": "providers/qwen-code",
       "title": "Qwen Code CLI",
diff --git a/docs/providers/qvac.md b/docs/providers/qvac.md
new file mode 100644
index 00000000..01da347a
--- /dev/null
+++ b/docs/providers/qvac.md
@@ -0,0 +1,91 @@
+---
+sidebar_label: QVAC
+description: Run local-first, peer-to-peer AI models with QVAC and connect them to Roo Code through its OpenAI-compatible server.
+keywords:
+  - QVAC
+  - local models
+  - Roo Code
+  - OpenAI compatible
+  - peer-to-peer AI
+  - local-first
+  - offline AI
+  - gpt-oss
+  - tool calling
+---
+
+# Using QVAC With Roo Code
+
+[QVAC](https://qvac.com) is an open-source runtime for local-first, peer-to-peer AI. It can expose your local models through an OpenAI-compatible HTTP server, letting you connect them to Roo Code using the **OpenAI Compatible** provider.
+
+**Website:** [https://qvac.com](https://qvac.com)
+
+---
+
+## Setting Up QVAC
+
+1.  **Install the QVAC CLI:**
+
+    ```bash
+    npm i -g @qvac/cli
+    ```
+
+2.  **Define a model alias:** Create a `qvac.config.json` that maps a serve alias to a model. The alias you choose here is the model id you will enter in Roo Code.
+
+    ```json
+    {
+      "serve": {
+        "models": {
+          "gpt-oss-20b": {
+            "model": "GPT_OSS_20B_INST_Q4_K_M",
+            "preload": true,
+            "config": {
+              "ctx_size": 32768,
+              "tools": true
+            }
+          }
+        }
+      }
+    }
+    ```
+
+    Two settings matter when using QVAC as a coding agent:
+    *   **`ctx_size`** defaults to `1024`, which is far too small for agent prompts. Set it explicitly (e.g. `32768`).
+    *   **`tools: true`** enables function/tool calling. Roo Code relies on native tool calling, so without this the model returns plain text instead of tool calls.
+
+3.  **Start the server:**
+
+    ```bash
+    qvac serve openai
+    ```
+
+    This starts an OpenAI-compatible REST API on port `11434` by default (use `--port` to change it). Your base URL is `http://127.0.0.1:11434/v1`.
+
+---
+
+## Configuration in Roo Code
+
+1.  **Open Roo Code Settings:** Click the gear icon (<Codicon name="gear" />) in the Roo Code panel.
+2.  **Select Provider:** Choose "OpenAI Compatible" from the "API Provider" dropdown.
+3.  **Enter Base URL:** Use `http://127.0.0.1:11434/v1` (or the port you set with `--port`).
+4.  **Enter API Key:** QVAC's server does not validate the key, but the field is required—enter any non-empty string (e.g. `qvac`).
+5.  **Enter Model ID:** Use the serve alias from your `qvac.config.json` (e.g. `gpt-oss-20b`).
+
+---
+
+## Tips and Notes
+
+*   **Use a capable, agent-tuned model.** Tool-calling quality is bounded by the model you run. Small models often fail to invoke tools reliably; a larger agent-tuned model such as `gpt-oss-20b` is a good local default.
+*   **Set the context window explicitly.** The QVAC LLM `ctx_size` default of `1024` is too small for Roo Code's prompts. Set it to something like `32768` in `qvac.config.json`.
+*   **Enable tools.** Roo Code uses native tool calling exclusively. Set `"tools": true` in the model config or the model will respond with text instead of tool calls.
+*   **Reasoning models.** For reasoning-tuned models such as Qwen3, set `"reasoning_budget": 0` in the model config unless you specifically want extended reasoning.
+*   **Preload for a faster first response.** Setting `"preload": true` loads the model when the server starts, avoiding a cold start on your first request.
+*   **Resource requirements.** Running large language models locally is resource-intensive. Make sure your machine can handle the model and context size you choose.
+
+---
+
+## Troubleshooting
+
+*   **"Model Not Found":** The model id in Roo Code must exactly match a serve alias defined in `qvac.config.json`.
+*   **Model replies with text instead of using tools:** Add `"tools": true` to the model's `config` in `qvac.config.json` and restart the server.
+*   **Context overflow or truncated prompts:** Increase `ctx_size` (the default `1024` is too small for agent prompts).
+*   **Connection errors:** Confirm `qvac serve openai` is running and that the Base URL and port match (`http://127.0.0.1:11434/v1` by default).

From c162b4b5da80e53425ae98466eac8374e2f2cbe2 Mon Sep 17 00:00:00 2001
From: Simon Iribarren <simon.ig13@gmail.com>
Date: Tue, 9 Jun 2026 08:53:27 +0200
Subject: [PATCH 2/3] docs: use canonical qvac.tether.io URL

---
 docs/providers/qvac.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/providers/qvac.md b/docs/providers/qvac.md
index 01da347a..32058c8b 100644
--- a/docs/providers/qvac.md
+++ b/docs/providers/qvac.md
@@ -15,9 +15,9 @@ keywords:
 
 # Using QVAC With Roo Code
 
-[QVAC](https://qvac.com) is an open-source runtime for local-first, peer-to-peer AI. It can expose your local models through an OpenAI-compatible HTTP server, letting you connect them to Roo Code using the **OpenAI Compatible** provider.
+[QVAC](https://qvac.tether.io) is an open-source runtime for local-first, peer-to-peer AI. It can expose your local models through an OpenAI-compatible HTTP server, letting you connect them to Roo Code using the **OpenAI Compatible** provider.
 
-**Website:** [https://qvac.com](https://qvac.com)
+**Website:** [https://qvac.tether.io](https://qvac.tether.io)
 
 ---
 

From 803b6c8be30f1cdc7987af55624317b17b1c6de4 Mon Sep 17 00:00:00 2001
From: Simon Iribarren <simon.ig13@gmail.com>
Date: Tue, 9 Jun 2026 11:17:07 +0200
Subject: [PATCH 3/3] docs(qvac): reference Qwen3.5 in reasoning-model note

---
 docs/providers/qvac.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/providers/qvac.md b/docs/providers/qvac.md
index 32058c8b..b95f2d37 100644
--- a/docs/providers/qvac.md
+++ b/docs/providers/qvac.md
@@ -77,7 +77,7 @@ keywords:
 *   **Use a capable, agent-tuned model.** Tool-calling quality is bounded by the model you run. Small models often fail to invoke tools reliably; a larger agent-tuned model such as `gpt-oss-20b` is a good local default.
 *   **Set the context window explicitly.** The QVAC LLM `ctx_size` default of `1024` is too small for Roo Code's prompts. Set it to something like `32768` in `qvac.config.json`.
 *   **Enable tools.** Roo Code uses native tool calling exclusively. Set `"tools": true` in the model config or the model will respond with text instead of tool calls.
-*   **Reasoning models.** For reasoning-tuned models such as Qwen3, set `"reasoning_budget": 0` in the model config unless you specifically want extended reasoning.
+*   **Reasoning models.** For reasoning-tuned models such as Qwen3.5, set `"reasoning_budget": 0` in the model config unless you specifically want extended reasoning.
 *   **Preload for a faster first response.** Setting `"preload": true` loads the model when the server starts, avoiding a cold start on your first request.
 *   **Resource requirements.** Running large language models locally is resource-intensive. Make sure your machine can handle the model and context size you choose.