move websockets to worker #2974

evanpelle · 2026-01-20T22:02:14Z

If this PR fixes an issue, link it below. If not, delete these two lines.
Resolves #(issue number)

Description:

Describe the PR.

Please complete the following:

I have added screenshots for all UI updates
I process any text displayed to the user through translateText() and I've added it to the en.json file
I have added relevant tests to the test directory
I confirm I have thoroughly tested these changes and take full responsibility for any bugs introduced

Please put your Discord username so you can be contacted if a bug or regression is found:

DISCORD_USERNAME

coderabbitai · 2026-01-20T22:02:54Z

Walkthrough

Moves public-lobby handling to a distributed worker model: Master coordinates workers and schedules games; each worker runs a WorkerLobbyService that serves /wN-prefixed lobby WebSockets. Client selects a worker path and syncs times via serverTime in lobby broadcasts.

Changes

Cohort / File(s)	Summary
Client lobby socket & UI `src/client/LobbySocket.ts`, `src/client/PublicLobby.ts`, `tests/client/LobbySocket.test.ts`	Client now fetches server config, picks a worker path (`/wN`), and connects via WebSocket using that prefix. `PublicLobby` consumes `PublicGames` (includes `serverTime` and `startsAt`) and computes a client time offset. Tests updated to the new payload shape.
Worker-side lobby service & routing `src/server/Worker.ts`, `src/server/WorkerLobbyService.ts`	Adds `WorkerLobbyService` that handles `/wN/lobbies` upgrades, maintains connected lobby clients, broadcasts lobby lists, and IPCs with Master. Worker routing strips/validates `/wN` prefixes and delegates other upgrades to the game WebSocket server.
Master coordination & scheduling `src/server/Master.ts`, `src/server/MasterLobbyService.ts`	Replaces in-process lobby broadcast with `MasterLobbyService`. Tracks worker readiness, aggregates per-worker lobbies, periodically schedules new games when lobby count is low, and sends IPC `createGame`/`lobbiesBroadcast` messages.
IPC schemas `src/server/IPCBridge.ts`	Introduces typed IPC message schemas and TypeScript types for worker↔master messages (workerReady, lobbyList, lobbiesBroadcast, createGame).
Schemas & game info `src/core/Schemas.ts`, `src/server/GameServer.ts`, `src/server/GameManager.ts`	Public lobby type changed: `msUntilStart` → `startsAt`, plus `serverTime`. `GameManager` exposes `publicLobbies()`; `GameServer.gameInfo()` returns `startsAt` and `serverTime`.
Privilege & logging `src/server/PrivilegeRefresher.ts`	Privilege refresher now accepts `env` and suppresses certain error logs in Dev.
HTTP fallback & nginx `src/client/LobbySocket.ts`, `nginx.conf`	Removed HTTP polling fallback and server endpoint caching for `/api/public_lobbies`; nginx location block for that endpoint removed. Client no longer falls back to HTTP polling on repeated WS failures.
Misc small changes `src/server/GamePreviewBuilder.ts`, other minor edits	Minor data-source change for active player count and small refactors across lobby handling code.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant Master
    participant Worker
    participant GameManager

    Client->>Master: GET /config (worker count)
    Master-->>Client: { numWorkers }
    Client->>Client: pick random worker path /wN
    Client->>Worker: WS connect to /wN/lobbies
    Worker->>Worker: validate /wN, route upgrade to lobby WSS
    Worker->>GameManager: publicLobbies()
    GameManager-->>Worker: [GameServer...]
    Worker-->>Client: send PublicGames { serverTime, games[{startsAt,...}] }
    Client->>Client: compute serverTimeOffset and display startsAt adjusted

sequenceDiagram
    participant Worker1
    participant Worker2
    participant Master
    participant GameManager

    Worker1->>Master: IPC workerReady {workerId}
    Worker2->>Master: IPC workerReady {workerId}
    Master->>Master: start polling tick
    loop periodic tick
        Master->>Master: aggregate all worker lobby lists
        Master-->>Master: if total < threshold -> create game
        Master->>Worker2: IPC createGame {gameID, gameConfig}
        Worker2->>GameManager: create game
        GameManager-->>Worker2: new GameServer
        Master->>Worker1: IPC lobbiesBroadcast {PublicGames}
        Master->>Worker2: IPC lobbiesBroadcast {PublicGames}
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Poem

Workers hum in lines of code and light ✨
Master counts lobbies through day and night 🌙
Clients pick a path and sync their time ⏰
Games are born where schedules rhyme 🎲
Distributed halls now sing in flight 🚀

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description contains only the repository's placeholder template with no implementation details, objectives, or context about the changeset. It does not describe any part of the actual changes made.	Replace the placeholder template with a meaningful description of the WebSocket relocation changes, including the motivation, architectural changes, and any testing or migration notes.
Docstring Coverage	⚠️ Warning	Docstring coverage is 20.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'move websockets to worker' directly describes the main architectural change: WebSocket handling is relocated from the master process to worker processes, which aligns with the significant refactoring shown in the changeset.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 7

🤖 Fix all issues with AI agents

In `@src/client/LobbySocket.ts`:
- Around line 12-15: The getRandomWorkerPath function can produce `/wNaN` or
out-of-range indices when numWorkers is 0 or invalid; update getRandomWorkerPath
to validate numWorkers (ensure it's a finite integer >= 1), default to 1 (or a
safe fallback like 1) when invalid, and clamp/select an index in the range [0,
numWorkers-1] so the returned path is always `/w0`..`/wN`. Locate and update
getRandomWorkerPath and any other place that calls it (the other occurrence
referenced at line 23) to use the same validation/guard logic or call a shared
validated helper.
- Line 1: The call to getServerConfigFromClient inside LobbySocket.start() can
throw and currently aborts before connectWebSocket and polling begin; wrap calls
to getServerConfigFromClient (used in start() and any other locations noted) in
a try/catch, assign a safe default server config or signal fallback mode when an
exception occurs, and ensure connectWebSocket() and the HTTP polling startup
logic still run (or start fallback polling) even on error; update the error
handling in the start() method and the other occurrences (around lines where
getServerConfigFromClient is called) to catch exceptions, log the error, and
proceed with defaultConfig/fallbackPolling so the lobby updates still start.

In `@src/server/RedisClient.ts`:
- Around line 148-169: In getAllLobbyInfo, corrupted JSON values for lobby
entries are left in Redis and repeatedly logged; update the catch block for
JSON.parse failures to remove the bad record by calling await
redis.hdel(LOBBY_INFO_KEY, gameID) (and then log the error with context), so
replace the current log.error-only behavior with deletion of the key using
redis.hdel before or after logging.
- Around line 83-101: In getPendingGamesForWorker, when JSON.parse fails for a
pending-game entry you should delete the corrupted record from Redis to stop
repeated errors; inside the catch in getPendingGamesForWorker (function name)
call redis.hdel(PENDING_GAMES_KEY, gameID) (or the equivalent async delete
method) after logging the parse failure, and handle/delete any error from hdel
gracefully so the function still returns the remaining valid PendingGame[].
- Around line 130-136: Replace the loose unknown type on LobbyInfo.gameConfig
with the concrete GameConfig | undefined union: update the LobbyInfo interface
(symbol: LobbyInfo) to declare gameConfig: GameConfig | undefined and add an
import for GameConfig from src/core/Schemas.ts (or the module that exports it)
so call sites no longer need unsafe assertions and TypeScript can type-check
game configuration fields.

In `@src/server/Worker.ts`:
- Around line 60-75: In server.on("upgrade") add a guard that parses the
incoming pathname for a worker prefix like "/wX" (extract the first path segment
from pathname) and compare it to the current process/instance worker identifier
(e.g., this.workerPrefix / workerId); if the prefix is present and does not
match the local worker, immediately reject the upgrade and destroy the socket
(do not call wss.handleUpgrade or lobbiesWss.handleUpgrade); otherwise continue
routing to lobbiesWss or wss as before. Ensure the check runs before the
existing pathname === "/lobbies" / endsWith("/lobbies") logic so mismatched
worker-prefixed requests can’t be accepted by the wrong WebSocketServer.
- Around line 172-206: startGameInfoPolling is not removing full lobbies from
Redis, causing the master to keep stale public IDs; after obtaining info =
game.gameInfo() check whether the lobby is full (e.g., info.clients?.length >=
info.gameConfig?.maxPlayers or other capacity field on gameConfig) and if so
call removePublicLobbyID(gameID) and removeLobbyInfo(gameID) (same as the
existing removal for started lobbies). Update the polling loop in
startGameInfoPolling / startPolling right after setLobbyInfo to detect full
lobbies and remove their IDs so Redis no longer advertises joinable lobbies from
this worker.

🧹 Nitpick comments (2)

src/server/Master.ts (1)

176-186: Make Redis writes for new lobbies atomic or rollback.

If addPendingGame succeeds but addPublicLobbyID fails, a pending game is left behind and never advertised. Use Redis MULTI or rollback with removePendingGame.

🛠️ Simple rollback approach

import {
  addPendingGame,
  addPublicLobbyID,
+  removePendingGame,
  getPublicLobbyIDs,
} from "./RedisClient";

async function schedulePublicGame() {
  const gameID = generateID();

  try {
    const gameConfig = await playlist.gameConfig();
    await addPendingGame(gameID, gameConfig);
-    await addPublicLobbyID(gameID);
+    try {
+      await addPublicLobbyID(gameID);
+    } catch (error) {
+      await removePendingGame(gameID);
+      throw error;
+    }
    log.info(`Scheduled public game ${gameID} via Redis`);
  } catch (error) {
    log.error(`Failed to schedule public game ${gameID}:`, error);
    throw error;
  }
}

src/server/RedisClient.ts (1)

7-59: Reduce duplicate Redis config and allow env overrides.

Both branches repeat the same options and hardcode prod host/port. A small shared config keeps this clean and more deploy‑friendly.

♻️ Proposed refactor

-const isDevMode = process.env.GAME_ENV === "dev";
+const isDevMode = process.env.GAME_ENV === "dev";
+const redisOptions = {
+  maxRetriesPerRequest: 3,
+  retryStrategy(times: number) {
+    const delay = Math.min(times * 50, 2000);
+    return delay;
+  },
+};
+const prodRedisHost = process.env.REDIS_HOST ?? "127.0.0.1";
+const prodRedisPort = Number(process.env.REDIS_PORT ?? 6379);

 ...
-    redis = new Redis({
-      host,
-      port,
-      maxRetriesPerRequest: 3,
-      retryStrategy(times) {
-        const delay = Math.min(times * 50, 2000);
-        return delay;
-      },
-    });
+    redis = new Redis({ host, port, ...redisOptions });

 ...
-  redis = new Redis({
-    host: "127.0.0.1",
-    port: 6379,
-    maxRetriesPerRequest: 3,
-    retryStrategy(times) {
-      const delay = Math.min(times * 50, 2000);
-      return delay;
-    },
-  });
+  redis = new Redis({ host: prodRedisHost, port: prodRedisPort, ...redisOptions });

src/client/LobbySocket.ts

coderabbitai · 2026-01-20T22:15:02Z

src/client/LobbySocket.ts

+function getRandomWorkerPath(numWorkers: number): string {
+  const workerIndex = Math.floor(Math.random() * numWorkers);
+  return `/w${workerIndex}`;
+}


⚠️ Potential issue | 🟡 Minor

Guard against invalid worker count.

If numWorkers is 0/NaN, the path can become /wNaN or point to a non‑existent worker. Add a small guard and fallback.

🛠️ Suggested guard

function getRandomWorkerPath(numWorkers: number): string { - const workerIndex = Math.floor(Math.random() * numWorkers); - return `/w${workerIndex}`; + if (!Number.isFinite(numWorkers) || numWorkers < 1) { + console.warn("Invalid numWorkers, defaulting to /w0"); + return "/w0"; + } + const workerIndex = Math.floor(Math.random() * numWorkers); + return `/w${workerIndex}`; }

Also applies to: 23-23

🤖 Prompt for AI Agents

In `@src/client/LobbySocket.ts` around lines 12 - 15, The getRandomWorkerPath function can produce `/wNaN` or out-of-range indices when numWorkers is 0 or invalid; update getRandomWorkerPath to validate numWorkers (ensure it's a finite integer >= 1), default to 1 (or a safe fallback like 1) when invalid, and clamp/select an index in the range [0, numWorkers-1] so the returned path is always `/w0`..`/wN`. Locate and update getRandomWorkerPath and any other place that calls it (the other occurrence referenced at line 23) to use the same validation/guard logic or call a shared validated helper.

coderabbitai · 2026-01-20T22:15:02Z

src/server/RedisClient.ts

+export async function getPendingGamesForWorker(
+  workerIndex: (gameID: string) => number,
+  workerId: number,
+): Promise<PendingGame[]> {
+  await redisReady;
+  const allGames = await redis.hgetall(PENDING_GAMES_KEY);
+  const games: PendingGame[] = [];
+
+  for (const [gameID, value] of Object.entries(allGames)) {
+    if (workerIndex(gameID) === workerId) {
+      try {
+        games.push(JSON.parse(value) as PendingGame);
+      } catch (e) {
+        log.error(`Failed to parse pending game ${gameID}:`, e);
+      }
+    }
+  }
+
+  return games;


⚠️ Potential issue | 🟡 Minor

Clean up corrupted pending-game records.

If JSON parsing fails, the bad entry stays forever and keeps logging. Remove it on parse failure.

🐛 Suggested fix

if (workerIndex(gameID) === workerId) { try { games.push(JSON.parse(value) as PendingGame); } catch (e) { log.error(`Failed to parse pending game ${gameID}:`, e); + await redis.hdel(PENDING_GAMES_KEY, gameID); } }

🤖 Prompt for AI Agents

In `@src/server/RedisClient.ts` around lines 83 - 101, In getPendingGamesForWorker, when JSON.parse fails for a pending-game entry you should delete the corrupted record from Redis to stop repeated errors; inside the catch in getPendingGamesForWorker (function name) call redis.hdel(PENDING_GAMES_KEY, gameID) (or the equivalent async delete method) after logging the parse failure, and handle/delete any error from hdel gracefully so the function still returns the remaining valid PendingGame[].

coderabbitai · 2026-01-20T22:15:02Z

src/server/RedisClient.ts

+export interface LobbyInfo {
+  gameID: string;
+  numClients: number;
+  msUntilStart?: number;
+  gameConfig?: unknown;
+  updatedAt: number;
+}


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

# First, let's find the GameConfig type rg -n --type=ts "type GameConfig|interface GameConfig"

Repository: openfrontio/OpenFrontIO

Length of output: 148

🏁 Script executed:

# Let's look at the RedisClient.ts file around those lines cat -n src/server/RedisClient.ts | head -160 | tail -50

Repository: openfrontio/OpenFrontIO

Length of output: 1862

🏁 Script executed:

# Search for LobbyInfo usage rg -n --type=ts "LobbyInfo" -B 2 -A 2

Repository: openfrontio/OpenFrontIO

Length of output: 3494

🏁 Script executed:

# Search for gameConfig usage rg -n --type=ts "gameConfig" -B 2 -A 2

Repository: openfrontio/OpenFrontIO

Length of output: 50379

Use GameConfig | undefined instead of unknown for gameConfig.

The GameConfig type is already well-defined in the codebase (from src/core/Schemas.ts). Using unknown forces type assertions at call sites and hides errors. Replace it with the proper typed union to keep the code safe and clear.

♻️ Possible change

export interface LobbyInfo { gameID: string; numClients: number; msUntilStart?: number; - gameConfig?: unknown; + gameConfig?: GameConfig; updatedAt: number; }

Remember to import GameConfig from src/core/Schemas.ts if not already present.

🤖 Prompt for AI Agents

In `@src/server/RedisClient.ts` around lines 130 - 136, Replace the loose unknown type on LobbyInfo.gameConfig with the concrete GameConfig | undefined union: update the LobbyInfo interface (symbol: LobbyInfo) to declare gameConfig: GameConfig | undefined and add an import for GameConfig from src/core/Schemas.ts (or the module that exports it) so call sites no longer need unsafe assertions and TypeScript can type-check game configuration fields.

coderabbitai · 2026-01-20T22:15:02Z

src/server/RedisClient.ts

+export async function getAllLobbyInfo(): Promise<LobbyInfo[]> {
+  await redisReady;
+  const allInfo = await redis.hgetall(LOBBY_INFO_KEY);
+  const lobbies: LobbyInfo[] = [];
+  const now = Date.now();
+
+  for (const [gameID, value] of Object.entries(allInfo)) {
+    try {
+      const info = JSON.parse(value) as LobbyInfo;
+      // Skip stale entries (older than 5 seconds)
+      if (now - info.updatedAt > 5000) {
+        await redis.hdel(LOBBY_INFO_KEY, gameID);
+        continue;
+      }
+      lobbies.push(info);
+    } catch (e) {
+      log.error(`Failed to parse lobby info ${gameID}:`, e);
+    }
+  }
+
+  return lobbies;
+}


⚠️ Potential issue | 🟡 Minor

Clean up corrupted lobby-info records on parse failure.

Right now bad JSON stays and will keep logging. Delete it on failure.

🐛 Suggested fix

try { const info = JSON.parse(value) as LobbyInfo; // Skip stale entries (older than 5 seconds) if (now - info.updatedAt > 5000) { await redis.hdel(LOBBY_INFO_KEY, gameID); continue; } lobbies.push(info); } catch (e) { log.error(`Failed to parse lobby info ${gameID}:`, e); + await redis.hdel(LOBBY_INFO_KEY, gameID); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

export async function getAllLobbyInfo(): Promise<LobbyInfo[]> {

await redisReady;

const allInfo = await redis.hgetall(LOBBY_INFO_KEY);

const lobbies: LobbyInfo[] = [];

const now = Date.now();

for (const [gameID, value] of Object.entries(allInfo)) {

try {

const info = JSON.parse(value) as LobbyInfo;

// Skip stale entries (older than 5 seconds)

if (now - info.updatedAt > 5000) {

await redis.hdel(LOBBY_INFO_KEY, gameID);

continue;

}

lobbies.push(info);

} catch (e) {

log.error(`Failed to parse lobby info ${gameID}:`, e);

}

}

return lobbies;

}

export async function getAllLobbyInfo(): Promise<LobbyInfo[]> {

await redisReady;

const allInfo = await redis.hgetall(LOBBY_INFO_KEY);

const lobbies: LobbyInfo[] = [];

const now = Date.now();

for (const [gameID, value] of Object.entries(allInfo)) {

try {

const info = JSON.parse(value) as LobbyInfo;

// Skip stale entries (older than 5 seconds)

if (now - info.updatedAt > 5000) {

await redis.hdel(LOBBY_INFO_KEY, gameID);

continue;

}

lobbies.push(info);

} catch (e) {

log.error(`Failed to parse lobby info ${gameID}:`, e);

await redis.hdel(LOBBY_INFO_KEY, gameID);

}

}

return lobbies;

}

🤖 Prompt for AI Agents

In `@src/server/RedisClient.ts` around lines 148 - 169, In getAllLobbyInfo, corrupted JSON values for lobby entries are left in Redis and repeatedly logged; update the catch block for JSON.parse failures to remove the bad record by calling await redis.hdel(LOBBY_INFO_KEY, gameID) (and then log the error with context), so replace the current log.error-only behavior with deletion of the key using redis.hdel before or after logging.

coderabbitai · 2026-01-20T22:15:02Z

src/server/Worker.ts

+  const wss = new WebSocketServer({ noServer: true });
+  const lobbiesWss = new WebSocketServer({ noServer: true });
+
+  // Handle upgrade requests to route to correct WebSocket server
+  server.on("upgrade", (request, socket, head) => {
+    const pathname = request.url ?? "";
+    if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {
+      lobbiesWss.handleUpgrade(request, socket, head, (ws) => {
+        lobbiesWss.emit("connection", ws, request);
+      });
+    } else {
+      wss.handleUpgrade(request, socket, head, (ws) => {
+        wss.emit("connection", ws, request);
+      });
+    }
+  });


⚠️ Potential issue | 🟡 Minor

Validate worker prefix on WebSocket upgrades.

Upgrade handling bypasses the /wX middleware, so a mismatched path can be accepted by the wrong worker. Parse /wX and reject mismatches before routing.

🛠️ Suggested guard

server.on("upgrade", (request, socket, head) => { - const pathname = request.url ?? ""; + const pathname = request.url ?? ""; + const match = pathname.match(/^\/w(\d+)(\/.*)?$/); + if (match && parseInt(match[1]) !== workerId) { + socket.destroy(); + return; + } if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) { lobbiesWss.handleUpgrade(request, socket, head, (ws) => { lobbiesWss.emit("connection", ws, request); }); } else { wss.handleUpgrade(request, socket, head, (ws) => { wss.emit("connection", ws, request); }); } });

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const wss = new WebSocketServer({ noServer: true });

const lobbiesWss = new WebSocketServer({ noServer: true });

// Handle upgrade requests to route to correct WebSocket server

server.on("upgrade", (request, socket, head) => {

const pathname = request.url ?? "";

if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {

lobbiesWss.handleUpgrade(request, socket, head, (ws) => {

lobbiesWss.emit("connection", ws, request);

});

} else {

wss.handleUpgrade(request, socket, head, (ws) => {

wss.emit("connection", ws, request);

});

}

});

const wss = new WebSocketServer({ noServer: true });

const lobbiesWss = new WebSocketServer({ noServer: true });

// Handle upgrade requests to route to correct WebSocket server

server.on("upgrade", (request, socket, head) => {

const pathname = request.url ?? "";

const match = pathname.match(/^\/w(\d+)(\/.*)?$/);

if (match && parseInt(match[1]) !== workerId) {

socket.destroy();

return;

}

if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {

lobbiesWss.handleUpgrade(request, socket, head, (ws) => {

lobbiesWss.emit("connection", ws, request);

});

} else {

wss.handleUpgrade(request, socket, head, (ws) => {

wss.emit("connection", ws, request);

});

}

});

🤖 Prompt for AI Agents

In `@src/server/Worker.ts` around lines 60 - 75, In server.on("upgrade") add a guard that parses the incoming pathname for a worker prefix like "/wX" (extract the first path segment from pathname) and compare it to the current process/instance worker identifier (e.g., this.workerPrefix / workerId); if the prefix is present and does not match the local worker, immediately reject the upgrade and destroy the socket (do not call wss.handleUpgrade or lobbiesWss.handleUpgrade); otherwise continue routing to lobbiesWss or wss as before. Ensure the check runs before the existing pathname === "/lobbies" / endsWith("/lobbies") logic so mismatched worker-prefixed requests can’t be accepted by the wrong WebSocketServer.

coderabbitai · 2026-01-20T22:15:02Z

src/server/Worker.ts

+  // Periodically write this worker's public games to Redis
+  startGameInfoPolling(gm);
+
+  async function startGameInfoPolling(gameManager: GameManager) {
+    startPolling(async () => {
+      try {
+        const publicLobbyIDs = await getPublicLobbyIDs();
+
+        for (const gameID of publicLobbyIDs) {
+          // Only update games that belong to this worker
+          if (config.workerIndex(gameID) !== workerId) continue;
+
+          const game = gameManager.game(gameID);
+          if (game === null) {
+            // Game no longer exists, clean up
+            await removeLobbyInfo(gameID);
+            await removePublicLobbyID(gameID);
+            continue;
+          }
+
+          const info = game.gameInfo();
+          await setLobbyInfo({
+            gameID: info.gameID,
+            numClients: info.clients?.length ?? 0,
+            msUntilStart: info.msUntilStart,
+            gameConfig: info.gameConfig,
+            updatedAt: Date.now(),
+          });
+
+          // Remove from public lobbies if game started or is full
+          if (info.msUntilStart !== undefined && info.msUntilStart <= 250) {
+            await removePublicLobbyID(gameID);
+            await removeLobbyInfo(gameID);
+          }
+        }


⚠️ Potential issue | 🟠 Major

Remove full lobbies from Redis set to avoid scheduler stall.

Full lobbies are filtered out for clients, but the public lobby ID remains in Redis, so the master may stop scheduling even when no joinable lobbies exist. Remove IDs when the lobby is full.

🛠️ Suggested fix

const info = game.gameInfo(); - await setLobbyInfo({ - gameID: info.gameID, - numClients: info.clients?.length ?? 0, - msUntilStart: info.msUntilStart, - gameConfig: info.gameConfig, - updatedAt: Date.now(), - }); + const numClients = info.clients?.length ?? 0; + const maxPlayers = info.gameConfig?.maxPlayers; + const isFull = + maxPlayers !== undefined && numClients >= maxPlayers; + + await setLobbyInfo({ + gameID: info.gameID, + numClients, + msUntilStart: info.msUntilStart, + gameConfig: info.gameConfig, + updatedAt: Date.now(), + }); // Remove from public lobbies if game started or is full - if (info.msUntilStart !== undefined && info.msUntilStart <= 250) { + if ( + (info.msUntilStart !== undefined && info.msUntilStart <= 250) || + isFull + ) { await removePublicLobbyID(gameID); await removeLobbyInfo(gameID); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Periodically write this worker's public games to Redis

startGameInfoPolling(gm);

async function startGameInfoPolling(gameManager: GameManager) {

startPolling(async () => {

try {

const publicLobbyIDs = await getPublicLobbyIDs();

for (const gameID of publicLobbyIDs) {

// Only update games that belong to this worker

if (config.workerIndex(gameID) !== workerId) continue;

const game = gameManager.game(gameID);

if (game === null) {

// Game no longer exists, clean up

await removeLobbyInfo(gameID);

await removePublicLobbyID(gameID);

continue;

}

const info = game.gameInfo();

await setLobbyInfo({

gameID: info.gameID,

numClients: info.clients?.length ?? 0,

msUntilStart: info.msUntilStart,

gameConfig: info.gameConfig,

updatedAt: Date.now(),

});

// Remove from public lobbies if game started or is full

if (info.msUntilStart !== undefined && info.msUntilStart <= 250) {

await removePublicLobbyID(gameID);

await removeLobbyInfo(gameID);

}

}

// Periodically write this worker's public games to Redis

startGameInfoPolling(gm);

async function startGameInfoPolling(gameManager: GameManager) {

startPolling(async () => {

try {

const publicLobbyIDs = await getPublicLobbyIDs();

for (const gameID of publicLobbyIDs) {

// Only update games that belong to this worker

if (config.workerIndex(gameID) !== workerId) continue;

const game = gameManager.game(gameID);

if (game === null) {

// Game no longer exists, clean up

await removeLobbyInfo(gameID);

await removePublicLobbyID(gameID);

continue;

}

const info = game.gameInfo();

const numClients = info.clients?.length ?? 0;

const maxPlayers = info.gameConfig?.maxPlayers;

const isFull =

maxPlayers !== undefined && numClients >= maxPlayers;

await setLobbyInfo({

gameID: info.gameID,

numClients,

msUntilStart: info.msUntilStart,

gameConfig: info.gameConfig,

updatedAt: Date.now(),

});

// Remove from public lobbies if game started or is full

if (

(info.msUntilStart !== undefined && info.msUntilStart <= 250) ||

isFull

) {

await removePublicLobbyID(gameID);

await removeLobbyInfo(gameID);

}

}

🤖 Prompt for AI Agents

In `@src/server/Worker.ts` around lines 172 - 206, startGameInfoPolling is not removing full lobbies from Redis, causing the master to keep stale public IDs; after obtaining info = game.gameInfo() check whether the lobby is full (e.g., info.clients?.length >= info.gameConfig?.maxPlayers or other capacity field on gameConfig) and if so call removePublicLobbyID(gameID) and removeLobbyInfo(gameID) (same as the existing removal for started lobbies). Update the polling loop in startGameInfoPolling / startPolling right after setLobbyInfo to detect full lobbies and remove their IDs so Redis no longer advertises joinable lobbies from this worker.

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

Dockerfile (1)
10-11: Pipeline failure: missing build tools for redis-memory-server

The npm ci step fails because redis-memory-server (devDependency) compiles Redis from source during postinstall. The build stage lacks make and other build tools.

Add build-essential to the build stage before running npm ci:
🔧 Proposed fix
 # Build stage - install ALL dependencies and build
 FROM base AS build
 ENV HUSKY=0
+# Install build tools needed for native dependencies
+RUN apt-get update && apt-get install -y build-essential && rm -rf /var/lib/apt/lists/*
 # Copy package files first for better caching
 COPY package*.json ./
 RUN --mount=type=cache,target=/root/.npm \
     npm ci

🤖 Fix all issues with AI agents

In `@package.json`:
- Line 78: The package.json dependency for "redis-memory-server" is set to an
invalid version ^0.15.0; update the entry for "redis-memory-server" to the
latest available 0.14.1 (e.g., change "redis-memory-server": "^0.15.0" to
"redis-memory-server": "^0.14.1") so npm/yarn can resolve it while keeping
"ioredis" ^5.9.2 unchanged.

In `@src/client/LobbySocket.ts`:
- Line 1: The start() method in LobbySocket currently calls
getServerConfigFromClient and lets errors propagate, preventing lobby
connections when /api/env fails; update start() (and similar calls around the
40-45 region) to catch errors from getServerConfigFromClient, log the error via
the existing logger, and fall back to a sensible default config/path (e.g.,
default server path or empty config) so lobby updates still start; ensure the
catch only supplies the fallback and does not suppress other unrelated errors.
- Around line 12-15: getRandomWorkerPath can produce "/wNaN" when numWorkers is
0 or invalid; update the function (getRandomWorkerPath) to validate numWorkers
(e.g., if not a finite positive integer) and use a safe fallback index (such as
0) or normalize with Math.max(1, Math.floor(...)) so the returned path is always
a valid "/w{n}" string; ensure the same guard/normalization is applied to any
other callers that compute workerIndex in this module.

In `@src/server/RedisClient.ts`:
- Around line 39-60: The current production branch sets redisReady =
Promise.resolve() immediately, which can let callers proceed before the Redis
client actually connects; change this to set redisReady to a Promise that
resolves when the Redis client's "connect" event fires and rejects (or rejects
and logs) on an "error" (and optionally "end") event so callers await a real
connection; update the block where redis is created (the Redis(...) constructor
and the redis.on handlers) to create this promise, attach resolve in
redis.on("connect") and reject in redis.on("error"), and ensure existing
log.info/log.error calls remain but no longer prematurely mark readiness.

In `@src/server/Worker.ts`:
- Line 33: The WebSocket upgrade handler currently only checks
pathname.endsWith("/lobbies") and thus skips validating the /w{workerId} prefix
that the HTTP middleware enforces; update the upgrade handling logic (where
pathname is checked, near the code that uses WorkerLobbyService) to parse and
strip a leading /w{workerId} segment and verify it equals this.id (e.g., compare
`/w${this.id}`), returning/ignoring the upgrade if it mismatches, so WebSocket
routing matches the HTTP middleware behavior.

🧹 Nitpick comments (7)

src/server/RedisClient.ts (3)
154-161: Side effect in read operation: getAllLobbyInfo deletes stale entries

This function both reads and deletes data. This can cause issues:

Unexpected behavior for callers expecting a pure read

Multiple concurrent calls may cause race conditions

Harder to reason about and test

Consider separating cleanup into a dedicated function or polling loop.
♻️ Suggested approach
 export async function getAllLobbyInfo(): Promise<LobbyInfo[]> {
   await redisReady;
   const allInfo = await redis.hgetall(LOBBY_INFO_KEY);
   const lobbies: LobbyInfo[] = [];
   const now = Date.now();
+  const staleKeys: string[] = [];

   for (const [gameID, value] of Object.entries(allInfo)) {
     try {
       const info = JSON.parse(value) as LobbyInfo;
       // Skip stale entries (older than 5 seconds)
       if (now - info.updatedAt > 5000) {
-        await redis.hdel(LOBBY_INFO_KEY, gameID);
+        staleKeys.push(gameID);
         continue;
       }
       lobbies.push(info);
     } catch (e) {
       log.error(`Failed to parse lobby info ${gameID}:`, e);
+      staleKeys.push(gameID);
     }
   }

+  // Clean up stale entries in background (fire-and-forget)
+  if (staleKeys.length > 0) {
+    redis.hdel(LOBBY_INFO_KEY, ...staleKeys).catch((err) => {
+      log.error("Failed to clean up stale lobby entries:", err);
+    });
+  }
+
   return lobbies;
 }
12-38: Consider extracting shared Redis configuration to reduce duplication

The Redis client options and event handlers are duplicated between dev and prod modes. Extract common logic:
♻️ Example refactor
function createRedisClient(host: string, port: number): Redis {
  const client = new Redis({
    host,
    port,
    maxRetriesPerRequest: 3,
    retryStrategy(times) {
      return Math.min(times * 50, 2000);
    },
  });

  client.on("error", (err) => {
    log.error("Redis connection error:", err);
  });

  client.on("connect", () => {
    log.info(`Connected to Redis at ${host}:${port}`);
  });

  return client;
}
130-136: Use typed GameConfig instead of unknown for gameConfig

The LobbyInfo interface uses gameConfig?: unknown, but GameConfig is already imported and used elsewhere. For type safety:
 export interface LobbyInfo {
   gameID: string;
   numClients: number;
   msUntilStart?: number;
-  gameConfig?: unknown;
+  gameConfig?: GameConfig;
   updatedAt: number;
 }
src/server/Master.ts (1)
176-188: Consider cleanup on schedule failure

If addPendingGame succeeds but addPublicLobbyID fails, you'll have an orphan pending game in Redis. Consider cleanup:
♻️ Suggested fix
 async function schedulePublicGame() {
   const gameID = generateID();

   try {
     const gameConfig = await playlist.gameConfig();
     await addPendingGame(gameID, gameConfig);
-    await addPublicLobbyID(gameID);
-    log.info(`Scheduled public game ${gameID} via Redis`);
+    try {
+      await addPublicLobbyID(gameID);
+      log.info(`Scheduled public game ${gameID} via Redis`);
+    } catch (error) {
+      // Rollback pending game if public lobby ID fails
+      await removePendingGame(gameID).catch(() => {});
+      throw error;
+    }
   } catch (error) {
     log.error(`Failed to schedule public game ${gameID}:`, error);
     throw error;
   }
 }
You'll need to import removePendingGame from ./RedisClient.
src/server/WorkerLobbyService.ts (3)
52-65: URL path matching may be too broad

The check pathname.endsWith("/lobbies") would match paths like /foo/lobbies or /api/lobbies. If those paths should go to gameWss, this could cause issues.
♻️ Suggested fix for stricter matching
   private setupUpgradeHandler() {
     this.server.on("upgrade", (request, socket, head) => {
       const pathname = request.url ?? "";
-      if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {
+      // Match exactly "/lobbies" or "/<worker>/lobbies" pattern
+      if (pathname === "/lobbies" || /^\/\d+\/lobbies$/.test(pathname)) {
         this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => {
           this.lobbiesWss.emit("connection", ws, request);
         });
       } else {
144-151: Extract magic number and improve type handling

The 250 ms threshold appears in multiple places. Also, casting info.gameConfig to a partial object is fragile.
♻️ Suggested improvements
// At module level or in a constants file
const GAME_START_THRESHOLD_MS = 250;

// In the filter:
if (info.msUntilStart !== undefined && info.msUntilStart <= GAME_START_THRESHOLD_MS)
  return false;

// For gameConfig, consider using the GameConfig type with optional fields
// or use a type guard function
182-186: Duplicate cleanup logic for lobby removal

Lobby cleanup happens in two places:

Lines 182-186: When game === null

Lines 199-202: When msUntilStart <= 250

Consider extracting to a helper:
♻️ Example helper
private async cleanupLobby(gameID: string): Promise<void> {
  await removeLobbyInfo(gameID);
  await removePublicLobbyID(gameID);
}
Also applies to: 199-202

coderabbitai · 2026-01-20T22:20:38Z

package.json

    "prettier-plugin-organize-imports": "^4.1.0",
    "prettier-plugin-sh": "^0.17.4",
    "protobufjs": "^7.5.3",
+    "redis-memory-server": "^0.15.0",


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

# First, let's check the package.json file around the lines mentioned head -120 package.json | tail -50

Repository: openfrontio/OpenFrontIO

Length of output: 1694

🌐 Web query:

latest version of ioredis npm package 2025

💡 Result:

The latest ioredis release is v5.9.2 (released January 15, 2026). [1]

Sources:
[1] GitHub releases — redis/ioredis (v5.9.2).

🌐 Web query:

latest version of redis-memory-server npm package 2025

💡 Result:

The latest redis-memory-server npm version is 0.14.1 (as of January 20, 2026). [1][2]

Sources:

Socket (package info) — shows v0.14.1. [1]

jsDelivr package page — shows v0.14.1. [2]

Update redis-memory-server version—0.15.0 does not exist.

ioredis ^5.9.2 is correct (latest version). However, redis-memory-server ^0.15.0 is invalid. The latest available version is 0.14.1 (as of January 20, 2026). Change to ^0.14.1 to fix the dependency resolution failure.

Both packages are good choices: ioredis is the standard Node.js Redis client, and redis-memory-server provides isolated Redis instances for development and testing.

🤖 Prompt for AI Agents

In `@package.json` at line 78, The package.json dependency for "redis-memory-server" is set to an invalid version ^0.15.0; update the entry for "redis-memory-server" to the latest available 0.14.1 (e.g., change "redis-memory-server": "^0.15.0" to "redis-memory-server": "^0.14.1") so npm/yarn can resolve it while keeping "ioredis" ^5.9.2 unchanged.

src/client/LobbySocket.ts

coderabbitai · 2026-01-20T22:20:38Z

src/client/LobbySocket.ts

+function getRandomWorkerPath(numWorkers: number): string {
+  const workerIndex = Math.floor(Math.random() * numWorkers);
+  return `/w${workerIndex}`;
+}


⚠️ Potential issue | 🟠 Major

Guard against numWorkers <= 0 to avoid /wNaN.

If numWorkers() is 0 or invalid, the path becomes /wNaN and both WS + HTTP fail. Add a safe fallback.

✅ Suggested fix

function getRandomWorkerPath(numWorkers: number): string { - const workerIndex = Math.floor(Math.random() * numWorkers); + const safeCount = + Number.isFinite(numWorkers) && numWorkers > 0 ? Math.floor(numWorkers) : 1; + const workerIndex = Math.floor(Math.random() * safeCount); return `/w${workerIndex}`; }

Also applies to: 23-23

🤖 Prompt for AI Agents

In `@src/client/LobbySocket.ts` around lines 12 - 15, getRandomWorkerPath can produce "/wNaN" when numWorkers is 0 or invalid; update the function (getRandomWorkerPath) to validate numWorkers (e.g., if not a finite positive integer) and use a safe fallback index (such as 0) or normalize with Math.max(1, Math.floor(...)) so the returned path is always a valid "/w{n}" string; ensure the same guard/normalization is applied to any other callers that compute workerIndex in this module.

coderabbitai · 2026-01-20T22:20:38Z

src/server/RedisClient.ts

+} else {
+  // In production, connect to real Redis
+  redis = new Redis({
+    host: "127.0.0.1",
+    port: 6379,
+    maxRetriesPerRequest: 3,
+    retryStrategy(times) {
+      const delay = Math.min(times * 50, 2000);
+      return delay;
+    },
+  });
+
+  redis.on("error", (err) => {
+    log.error("Redis connection error:", err);
+  });
+
+  redis.on("connect", () => {
+    log.info("Connected to Redis");
+  });
+
+  redisReady = Promise.resolve();
+}


⚠️ Potential issue | 🟠 Major

Production Redis: redisReady resolves before connection is established

In production mode, redisReady = Promise.resolve() is set immediately, but the Redis connection happens asynchronously. If any Redis operation is called before the connect event fires, it may fail or behave unexpectedly.

Consider waiting for the actual connection:

🔧 Proposed fix

} else { // In production, connect to real Redis - redis = new Redis({ + const prodRedis = new Redis({ host: "127.0.0.1", port: 6379, maxRetriesPerRequest: 3, retryStrategy(times) { const delay = Math.min(times * 50, 2000); return delay; }, }); + redis = prodRedis; - redis.on("error", (err) => { + prodRedis.on("error", (err) => { log.error("Redis connection error:", err); }); - redis.on("connect", () => { + prodRedis.on("connect", () => { log.info("Connected to Redis"); }); - redisReady = Promise.resolve(); + redisReady = new Promise<void>((resolve) => { + prodRedis.once("ready", () => { + resolve(); + }); + }); }

🤖 Prompt for AI Agents

In `@src/server/RedisClient.ts` around lines 39 - 60, The current production branch sets redisReady = Promise.resolve() immediately, which can let callers proceed before the Redis client actually connects; change this to set redisReady to a Promise that resolves when the Redis client's "connect" event fires and rejects (or rejects and logs) on an "error" (and optionally "end") event so callers await a real connection; update the block where redis is created (the Redis(...) constructor and the redis.on handlers) to create this promise, attach resolve in redis.on("connect") and reject in redis.on("error"), and ensure existing log.info/log.error calls remain but no longer prematurely mark readiness.

coderabbitai · 2026-01-20T22:20:38Z

src/server/Worker.ts

 import { startPolling } from "./PollingLoop";
 import { PrivilegeRefresher } from "./PrivilegeRefresher";
 import { verifyTurnstileToken } from "./Turnstile";
+import { WorkerLobbyService } from "./WorkerLobbyService";


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # First, let's see the structure and locate the relevant files git ls-files | grep -E "(Worker|Lobby)" | head -20

Repository: openfrontio/OpenFrontIO

Length of output: 538

🏁 Script executed:

#!/bin/bash # Read Worker.ts to see context around lines 33 and 55-71 wc -l src/server/Worker.ts head -80 src/server/Worker.ts | cat -n

Repository: openfrontio/OpenFrontIO

Length of output: 3108

🏁 Script executed:

#!/bin/bash # Search for upgrade handlers and WebSocket routing rg -n "upgrade|wss|WebSocket" src/server/ -A 3

Repository: openfrontio/OpenFrontIO

Length of output: 9052

🏁 Script executed:

#!/bin/bash # Get the full setupUpgradeHandler method and surrounding context sed -n '52,65p' src/server/WorkerLobbyService.ts | cat -n

Repository: openfrontio/OpenFrontIO

Length of output: 682

🏁 Script executed:

#!/bin/bash # Look for Express middleware that might handle /w{workerId} paths rg -n "app\.use|router\.|/w" src/server/Worker.ts -B 2 -A 2

Repository: openfrontio/OpenFrontIO

Length of output: 1536

🏁 Script executed:

#!/bin/bash # Check if there's any middleware setup in Worker.ts rg -n "middleware|use\(" src/server/Worker.ts -A 3

Repository: openfrontio/OpenFrontIO

Length of output: 986

🏁 Script executed:

#!/bin/bash # Get the complete middleware implementation to understand what it does sed -n '83,106p' src/server/Worker.ts | cat -n

Repository: openfrontio/OpenFrontIO

Length of output: 950

WebSocket upgrade handler does not validate worker ID—inconsistent with HTTP middleware.

The Express middleware validates /w{workerId} for HTTP requests (line 12: returns 404 on mismatch), but the WebSocket upgrade handler only checks pathname.endsWith("/lobbies") without validating the worker ID prefix. This means requests like /w2/lobbies will be routed correctly on the current worker even though HTTP would reject them.

Either:

Strip and validate the /w{workerId} prefix in the upgrade handler (matching HTTP behavior), or

Document that WebSocket upgrades intentionally accept any worker prefix for load balancing.

🤖 Prompt for AI Agents

In `@src/server/Worker.ts` at line 33, The WebSocket upgrade handler currently only checks pathname.endsWith("/lobbies") and thus skips validating the /w{workerId} prefix that the HTTP middleware enforces; update the upgrade handling logic (where pathname is checked, near the code that uses WorkerLobbyService) to parse and strip a leading /w{workerId} segment and verify it equals this.id (e.g., compare `/w${this.id}`), returning/ignoring the upgrade if it mismatches, so WebSocket routing matches the HTTP middleware behavior.

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/server/Master.ts (1)
144-148: Missing Redis connection info in worker restart.

Restarted workers don't receive REDIS_HOST and REDIS_PORT environment variables. In dev mode with redis-memory-server (dynamic port), restarted workers will fail to connect to Redis.
🐛 Suggested fix
     // Restart the worker with the same ID
     const newWorker = cluster.fork({
       WORKER_ID: workerId,
       ADMIN_TOKEN,
       INSTANCE_ID,
+      REDIS_HOST: process.env.REDIS_HOST,
+      REDIS_PORT: process.env.REDIS_PORT,
     });

🤖 Fix all issues with AI agents

In `@src/server/RedisClient.ts`:
- Around line 46-70: redisReady is being resolved immediately for workers
instead of waiting for the actual Redis connection; change the worker branch in
RedisClient.ts to create redisReady as a Promise that resolves when the Redis
client emits a successful connection/ready event (use redis.once("connect" or
"ready") to resolve) and rejects on "error" so callers wait for a real
connection; ensure you remove the immediate Promise.resolve() and tie resolution
to the same event handlers used for logging (redis.on/once) to avoid race
conditions.

In `@src/server/WorkerLobbyService.ts`:
- Around line 52-65: The upgrade handler in setupUpgradeHandler currently routes
solely on the "/lobbies" suffix and ignores a leading worker prefix (e.g.
"/w2"), allowing cross-worker connections; update the server.on("upgrade") logic
in setupUpgradeHandler to parse and validate an optional "/w<N>" prefix from
request.url (extract N with a regex like ^/w(\d+)(/|$)), compare it to this
worker's id (use the existing worker id field, e.g. this.workerId), and if the
prefix exists and does not match return an HTTP 404/close the socket
(destroy/end) instead of proceeding; if it matches or no prefix is present,
strip the prefix before checking for "/lobbies" and continue using
lobbiesWss.handleUpgrade or gameWss.handleUpgrade as before.

♻️ Duplicate comments (7)

src/server/RedisClient.ts (4)
162-168: Use GameConfig instead of unknown for type safety.

GameConfig is already imported at line 3. Using unknown forces unsafe type assertions at call sites.
♻️ Suggested fix
 export interface LobbyInfo {
   gameID: string;
   numClients: number;
   msUntilStart?: number;
-  gameConfig?: unknown;
+  gameConfig?: GameConfig;
   updatedAt: number;
 }
71-92: Production: redisReady resolves before connection is established.

In production mode, redisReady = Promise.resolve() completes immediately, but Redis connection is async. Operations called before the connect event may fail.
🛠️ Suggested fix
   redis.on("connect", () => {
     log.info("Connected to Redis");
   });

-  redisReady = Promise.resolve();
+  redisReady = new Promise<void>((resolve) => {
+    redis.once("ready", () => {
+      resolve();
+    });
+  });
 }
123-130: Clean up corrupted pending-game records on parse failure.

If JSON parsing fails, the bad entry stays in Redis forever and keeps logging errors. Delete it after logging.
🛠️ Suggested fix
       try {
         games.push(JSON.parse(value) as PendingGame);
       } catch (e) {
         log.error(`Failed to parse pending game ${gameID}:`, e);
+        await redis.hdel(PENDING_GAMES_KEY, gameID);
       }
195-197: Clean up corrupted lobby-info records on parse failure.

Stale entries get cleaned up (lines 190-192), but parse failures leave bad JSON in Redis forever.
🛠️ Suggested fix
     } catch (e) {
       log.error(`Failed to parse lobby info ${gameID}:`, e);
+      await redis.hdel(LOBBY_INFO_KEY, gameID);
     }
src/client/LobbySocket.ts (2)
12-15: Guard against invalid numWorkers to avoid /wNaN paths.

If numWorkers() returns 0, NaN, or a negative number, Math.floor(Math.random() * numWorkers) produces NaN or invalid indices. Add a fallback.
🛠️ Suggested fix
 function getRandomWorkerPath(numWorkers: number): string {
+  if (!Number.isFinite(numWorkers) || numWorkers < 1) {
+    console.warn("Invalid numWorkers, defaulting to /w0");
+    return "/w0";
+  }
   const workerIndex = Math.floor(Math.random() * numWorkers);
   return `/w${workerIndex}`;
 }
40-45: Handle config fetch errors so lobby updates still start.

If /api/env fails, start() throws before connectWebSocket() runs, leaving the lobby completely dead. Wrap in try/catch and fall back to a default worker path.
🛠️ Suggested fix
   async start() {
     this.wsConnectionAttempts = 0;
-    // Get config to determine number of workers, then pick a random one
-    const config = await getServerConfigFromClient();
-    this.workerPath = getRandomWorkerPath(config.numWorkers());
+    try {
+      // Get config to determine number of workers, then pick a random one
+      const config = await getServerConfigFromClient();
+      this.workerPath = getRandomWorkerPath(config.numWorkers());
+    } catch (error) {
+      console.error("Failed to load server config, defaulting to /w0:", error);
+      this.workerPath = "/w0";
+    }
     this.connectWebSocket();
   }
src/server/WorkerLobbyService.ts (1)
172-208: Full lobbies are not removed from the public lobby set.

The filter in startLobbyPolling hides full lobbies from clients, but startGameInfoPolling only removes lobbies when msUntilStart <= 250. Full lobbies remain in Redis, which can cause the master scheduler to think public lobbies exist when none are joinable.

This was flagged in a previous review and appears unaddressed.
Suggested fix
         const info = game.gameInfo();
+        const numClients = info.clients?.length ?? 0;
+        const maxPlayers = (info.gameConfig as { maxPlayers?: number })
+          ?.maxPlayers;
+        const isFull =
+          maxPlayers !== undefined && numClients >= maxPlayers;
+
         await setLobbyInfo({
           gameID: info.gameID,
-          numClients: info.clients?.length ?? 0,
+          numClients,
           msUntilStart: info.msUntilStart,
           gameConfig: info.gameConfig,
           updatedAt: Date.now(),
         });

         // Remove from public lobbies if game started or is full
-        if (info.msUntilStart !== undefined && info.msUntilStart <= 250) {
+        if (
+          (info.msUntilStart !== undefined && info.msUntilStart <= 250) ||
+          isFull
+        ) {
           await removePublicLobbyID(gameID);
           await removeLobbyInfo(gameID);
         }

🧹 Nitpick comments (3)

src/server/WorkerLobbyService.ts (1)
136-170: Type assertion may hide mismatches.

Line 161 uses as GameInfo which bypasses type checking. If GameInfo has required fields not present in the mapped object, runtime errors may occur silently.
Safer approach
         .map(
           (info) =>
-            ({
+            {
               gameID: info.gameID,
               numClients: info.numClients,
               gameConfig: info.gameConfig,
               msUntilStart: info.msUntilStart,
-            }) as GameInfo,
+            } satisfies Partial<GameInfo>,
         );
Or define a proper mapping function that returns GameInfo with all required fields.
src/server/Worker.ts (2)
77-82: Positional arguments with undefined placeholder reduce clarity.

Using undefined as a placeholder for the refreshInterval parameter makes the call harder to read. Named parameters via an options object would be clearer, but this is minor.
Consider an options object (future improvement)

If PrivilegeRefresher is refactored later, using an options object would improve readability:
// Future pattern:
new PrivilegeRefresher({
  endpoint: config.jwtIssuer() + "/cosmetics.json",
  log,
  env: config.env(),
});
556-558: Suppressing errors in Dev may hide issues during development.

Silencing matchmaking poll errors in Dev environment means developers won't see network failures or misconfigurations. Consider logging at a lower level (debug/warn) instead of suppressing entirely.
Alternative: log at debug level in Dev
       } catch (error) {
-        if (config.env() !== GameEnv.Dev) {
-          log.error(`Error polling matchmaking:`, error);
-        }
+        const level = config.env() === GameEnv.Dev ? "debug" : "error";
+        log[level](`Error polling matchmaking:`, error);
       }

coderabbitai · 2026-01-21T00:25:19Z

src/server/RedisClient.ts

+  } else {
+    // Worker: connect to Redis using env vars set by master
+    const host = process.env.REDIS_HOST ?? "127.0.0.1";
+    const port = parseInt(process.env.REDIS_PORT ?? "6379");
+
+    redis = new Redis({
+      host,
+      port,
+      maxRetriesPerRequest: 3,
+      retryStrategy(times) {
+        const delay = Math.min(times * 50, 2000);
+        return delay;
+      },
+    });
+
+    redis.on("error", (err) => {
+      log.error("Redis connection error:", err);
+    });
+
+    redis.on("connect", () => {
+      log.info(`Worker connected to Redis memory server at ${host}:${port}`);
+    });
+
+    redisReady = Promise.resolve();
+  }


⚠️ Potential issue | 🟠 Major

Dev worker: redisReady also resolves before connection.

Same issue as production - the worker's redisReady resolves immediately at line 69 while the actual Redis connection happens asynchronously.

🛠️ Suggested fix

redis.on("connect", () => { log.info(`Worker connected to Redis memory server at ${host}:${port}`); }); - redisReady = Promise.resolve(); + redisReady = new Promise<void>((resolve) => { + redis.once("ready", () => { + resolve(); + }); + }); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

} else {

// Worker: connect to Redis using env vars set by master

const host = process.env.REDIS_HOST ?? "127.0.0.1";

const port = parseInt(process.env.REDIS_PORT ?? "6379");

redis = new Redis({

host,

port,

maxRetriesPerRequest: 3,

retryStrategy(times) {

const delay = Math.min(times * 50, 2000);

return delay;

},

});

redis.on("error", (err) => {

log.error("Redis connection error:", err);

});

redis.on("connect", () => {

log.info(`Worker connected to Redis memory server at ${host}:${port}`);

});

redisReady = Promise.resolve();

}

} else {

// Worker: connect to Redis using env vars set by master

const host = process.env.REDIS_HOST ?? "127.0.0.1";

const port = parseInt(process.env.REDIS_PORT ?? "6379");

redis = new Redis({

host,

port,

maxRetriesPerRequest: 3,

retryStrategy(times) {

const delay = Math.min(times * 50, 2000);

return delay;

},

});

redis.on("error", (err) => {

log.error("Redis connection error:", err);

});

redis.on("connect", () => {

log.info(`Worker connected to Redis memory server at ${host}:${port}`);

});

redisReady = new Promise<void>((resolve) => {

redis.once("ready", () => {

resolve();

});

});

}

🤖 Prompt for AI Agents

In `@src/server/RedisClient.ts` around lines 46 - 70, redisReady is being resolved immediately for workers instead of waiting for the actual Redis connection; change the worker branch in RedisClient.ts to create redisReady as a Promise that resolves when the Redis client emits a successful connection/ready event (use redis.once("connect" or "ready") to resolve) and rejects on "error" so callers wait for a real connection; ensure you remove the immediate Promise.resolve() and tie resolution to the same event handlers used for logging (redis.on/once) to avoid race conditions.

coderabbitai · 2026-01-21T00:25:19Z

src/server/WorkerLobbyService.ts

+  private setupUpgradeHandler() {
+    this.server.on("upgrade", (request, socket, head) => {
+      const pathname = request.url ?? "";
+      if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {
+        this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => {
+          this.lobbiesWss.emit("connection", ws, request);
+        });
+      } else {
+        this.gameWss.handleUpgrade(request, socket, head, (ws) => {
+          this.gameWss.emit("connection", ws, request);
+        });
+      }
+    });
+  }


⚠️ Potential issue | 🟠 Major

WebSocket upgrade handler lacks worker prefix validation.

The upgrade handler routes based on /lobbies path but does not validate the /wX worker prefix. HTTP middleware in Worker.ts validates worker ID and returns 404 on mismatch, but WebSocket upgrades bypass this. A client connecting to /w2/lobbies will be accepted even if this is worker 0.

Suggested fix

private setupUpgradeHandler() { this.server.on("upgrade", (request, socket, head) => { const pathname = request.url ?? ""; + + // Validate worker prefix (matches HTTP middleware behavior) + const match = pathname.match(/^\/w(\d+)(\/.*)?$/); + if (match && parseInt(match[1]) !== this.workerId) { + socket.destroy(); + return; + } + if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) { this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => { this.lobbiesWss.emit("connection", ws, request); }); } else { this.gameWss.handleUpgrade(request, socket, head, (ws) => { this.gameWss.emit("connection", ws, request); }); } }); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

private setupUpgradeHandler() {

this.server.on("upgrade", (request, socket, head) => {

const pathname = request.url ?? "";

if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {

this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => {

this.lobbiesWss.emit("connection", ws, request);

});

} else {

this.gameWss.handleUpgrade(request, socket, head, (ws) => {

this.gameWss.emit("connection", ws, request);

});

}

});

}

private setupUpgradeHandler() {

this.server.on("upgrade", (request, socket, head) => {

const pathname = request.url ?? "";

// Validate worker prefix (matches HTTP middleware behavior)

const match = pathname.match(/^\/w(\d+)(\/.*)?$/);

if (match && parseInt(match[1]) !== this.workerId) {

socket.destroy();

return;

}

if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {

this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => {

this.lobbiesWss.emit("connection", ws, request);

});

} else {

this.gameWss.handleUpgrade(request, socket, head, (ws) => {

this.gameWss.emit("connection", ws, request);

});

}

});

}

🤖 Prompt for AI Agents

In `@src/server/WorkerLobbyService.ts` around lines 52 - 65, The upgrade handler in setupUpgradeHandler currently routes solely on the "/lobbies" suffix and ignores a leading worker prefix (e.g. "/w2"), allowing cross-worker connections; update the server.on("upgrade") logic in setupUpgradeHandler to parse and validate an optional "/w<N>" prefix from request.url (extract N with a regex like ^/w(\d+)(/|$)), compare it to this worker's id (use the existing worker id field, e.g. this.workerId), and if the prefix exists and does not match return an HTTP 404/close the socket (destroy/end) instead of proceeding; if it matches or no prefix is present, strip the prefix before checking for "/lobbies" and continue using lobbiesWss.handleUpgrade or gameWss.handleUpgrade as before.

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/server/Master.ts (1)
144-148: Restarted workers miss Redis connection info.

When a worker crashes and restarts, the cluster.fork() call does not pass REDIS_HOST and REDIS_PORT. The new worker will fall back to defaults (127.0.0.1:6379), which breaks in dev mode where Redis memory server runs on a different port.
🐛 Suggested fix
     // Restart the worker with the same ID
     const newWorker = cluster.fork({
       WORKER_ID: workerId,
       ADMIN_TOKEN,
       INSTANCE_ID,
+      REDIS_HOST: redisInfo.host,
+      REDIS_PORT: String(redisInfo.port),
     });
Note: You need to capture redisInfo in a scope accessible to the exit handler, or re-read from process.env.

♻️ Duplicate comments (7)

src/server/RedisClient.ts (5)
162-168: Use GameConfig instead of unknown for gameConfig.

GameConfig is already imported at line 3. Using unknown forces type assertions at call sites and hides type errors.
♻️ Suggested fix
 export interface LobbyInfo {
   gameID: string;
   numClients: number;
   msUntilStart?: number;
-  gameConfig?: unknown;
+  gameConfig?: GameConfig;
   updatedAt: number;
 }
46-70: Dev worker: redisReady resolves before connection is established.

At line 69, redisReady = Promise.resolve() resolves immediately, but Redis connection happens asynchronously. Any Redis operation called before the connect event fires may fail.
🔧 Suggested fix
     redis.on("connect", () => {
       log.info(`Worker connected to Redis memory server at ${host}:${port}`);
     });

-    redisReady = Promise.resolve();
+    redisReady = new Promise<void>((resolve) => {
+      redis.once("ready", () => {
+        resolve();
+      });
+    });
   }
71-92: Production Redis: redisReady resolves before connection is established.

Same issue as dev worker. Line 91 resolves immediately, but redis.on("connect") fires later. Early Redis calls may fail or behave unexpectedly.
🔧 Suggested fix
   redis.on("connect", () => {
     log.info("Connected to Redis");
   });

-  redisReady = Promise.resolve();
+  redisReady = new Promise<void>((resolve) => {
+    redis.once("ready", () => {
+      resolve();
+    });
+  });
 }
123-131: Clean up corrupted pending-game records on parse failure.

If JSON parsing fails, the bad entry stays in Redis forever and keeps logging errors on every poll. Delete it after logging.
🐛 Suggested fix
       try {
         games.push(JSON.parse(value) as PendingGame);
       } catch (e) {
         log.error(`Failed to parse pending game ${gameID}:`, e);
+        await redis.hdel(PENDING_GAMES_KEY, gameID);
       }
195-197: Clean up corrupted lobby-info records on parse failure.

Same issue as pending games. Bad JSON stays in Redis and logs repeatedly. Delete it on failure.
🐛 Suggested fix
     } catch (e) {
       log.error(`Failed to parse lobby info ${gameID}:`, e);
+      await redis.hdel(LOBBY_INFO_KEY, gameID);
     }
src/server/WorkerLobbyService.ts (2)
52-65: WebSocket upgrade handler lacks worker prefix validation.

The upgrade handler routes based on /lobbies path but does not validate the /wX worker prefix. HTTP middleware in Worker.ts validates worker ID and returns 404 on mismatch, but WebSocket upgrades bypass this. A client connecting to /w2/lobbies will be accepted even if this is worker 0.
🔧 Suggested fix
 private setupUpgradeHandler() {
   this.server.on("upgrade", (request, socket, head) => {
     const pathname = request.url ?? "";
+
+    // Validate worker prefix (matches HTTP middleware behavior)
+    const match = pathname.match(/^\/w(\d+)(\/.*)?$/);
+    if (match && parseInt(match[1]) !== this.workerId) {
+      socket.destroy();
+      return;
+    }
+
     if (pathname === "/lobbies" || pathname.endsWith("/lobbies")) {
       this.lobbiesWss.handleUpgrade(request, socket, head, (ws) => {
         this.lobbiesWss.emit("connection", ws, request);
       });
     } else {
       this.gameWss.handleUpgrade(request, socket, head, (ws) => {
         this.gameWss.emit("connection", ws, request);
       });
     }
   });
 }
188-202: Full lobbies not removed from Redis, causing scheduler stall.

When a lobby reaches max capacity, it's filtered out for clients in startLobbyPolling (lines 147-151), but its ID stays in public_lobby_ids. The master checks getPublicLobbyIDs() and sees a lobby exists, so it won't schedule new games. Players see no joinable lobbies.
🔧 Suggested fix
           const info = game.gameInfo();
+          const numClients = info.clients?.length ?? 0;
+          const maxPlayers = (info.gameConfig as { maxPlayers?: number })?.maxPlayers;
+          const isFull = maxPlayers !== undefined && numClients >= maxPlayers;
+
           await setLobbyInfo({
             gameID: info.gameID,
-            numClients: info.clients?.length ?? 0,
+            numClients,
             msUntilStart: info.msUntilStart,
             gameConfig: info.gameConfig,
             updatedAt: Date.now(),
           });

           // Remove from public lobbies if game started or is full
-          if (info.msUntilStart !== undefined && info.msUntilStart <= 250) {
+          if (
+            (info.msUntilStart !== undefined && info.msUntilStart <= 250) ||
+            isFull
+          ) {
             await removePublicLobbyID(gameID);
             await removeLobbyInfo(gameID);
           }

🧹 Nitpick comments (2)

Dockerfile (1)

46-46: Adding redis-server supports the new distributed lobby system.

This is fine for development and single-container deployments. For production at scale, consider using a managed Redis service or a separate Redis container for better reliability and monitoring.

src/server/WorkerLobbyService.ts (1)

143-152: Type cast needed due to unknown in LobbyInfo.

The cast info.gameConfig as { maxPlayers?: number } at line 146 is a workaround for LobbyInfo.gameConfig being typed as unknown. Once that interface uses GameConfig, this cast can be removed for better type safety.

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/client/LobbySocket.test.ts (1)

52-111: Fix async start and timer advance to stop CI failures.
start() is async, so the WebSocket may not exist yet, and polling timers may not fire without async timer advance.

🔧 Suggested test fix

-  it("delivers lobby updates from websocket messages", () => {
+  it("delivers lobby updates from websocket messages", async () => {
     const updates: LobbyUpdateData[] = [];
     const socket = new PublicLobbySocket((data) => updates.push(data));

-    socket.start();
+    await socket.start();
+    await Promise.resolve();
     const ws = MockWebSocket.instances.at(-1);
+    expect(ws).toBeTruthy();
     expect(ws?.url).toContain("/lobbies");
@@
-  it("falls back to HTTP polling after max websocket attempts", async () => {
+  it("falls back to HTTP polling after max websocket attempts", async () => {
     vi.useFakeTimers();
@@
-    socket.start();
+    await socket.start();
     const ws = MockWebSocket.instances.at(-1);
     ws?.dispatchEvent(new CloseEvent("close"));

     await Promise.resolve();
     expect(fetchMock).toHaveBeenCalledTimes(1);

-    vi.advanceTimersByTime(60);
+    await vi.advanceTimersByTimeAsync(60);
     await Promise.resolve();
     expect(fetchMock).toHaveBeenCalledTimes(2);

🤖 Fix all issues with AI agents

In `@src/client/LobbySocket.ts`:
- Around line 181-190: The HTTP fallback in fetchLobbiesHTTP is invalid because
/api/public_lobbies doesn't exist; either remove the entire fetch fallback in
fetchLobbiesHTTP (delete the fetch to `${this.workerPath}/api/public_lobbies`,
the response.ok check, JSON parsing, and the onLobbiesUpdate call) and rely on
the existing WebSocket /lobbies flow (invoke or await whatever WebSocket
request/handler currently populates lobbies), or implement the missing server
HTTP endpoint and correct the URL to include a leading slash
(`/${this.workerPath}/api/public_lobbies`) so the fetch works; reference:
fetchLobbiesHTTP, workerPath, onLobbiesUpdate, and the WebSocket /lobbies
endpoint.

In `@src/client/PublicLobby.ts`:
- Around line 51-63: handleLobbiesUpdate computes serverTimeOffset from
this.lobbies[0].serverTime which can be undefined and produce NaN; fix by
locating a lobby with a valid numeric serverTime (e.g., find first l where
typeof l.serverTime === 'number' && isFinite(l.serverTime')) and only compute
this.serverTimeOffset = found.serverTime - Date.now(), otherwise default
this.serverTimeOffset to 0; also ensure when computing startsAt you consistently
use a numeric fallback (e.g., Date.now()) before subtracting
this.serverTimeOffset so lobbyIDToStart.set(l.gameID, (startsAt ?? Date.now()) -
this.serverTimeOffset) always uses finite numbers.

In `@src/server/MasterLobbyService.ts`:
- Around line 63-72: handleWorkerReady currently calls startPolling every time
the last worker becomes ready, which can start duplicate schedulers after worker
restarts; add a one‑time guard boolean (e.g., this.schedulingStarted) on the
MasterLobbyService class, initialize it false, check it in handleWorkerReady
before calling startPolling and set it true immediately after starting polling
so subsequent calls to handleWorkerReady (or worker restarts) do not start
additional loops; reference the existing methods/fields: handleWorkerReady,
readyWorkers, startPolling, tick, and config.numWorkers().
- Around line 74-78: The getAllLobbies method currently uses an implicit-return
arrow with an assignment inside forEach; change it to compute a single timestamp
(e.g., const now = Date.now()) and use a block-bodied forEach to set each
GameInfo.serverTime to that extracted now value so all lobbies share the same
timestamp and the update uses a more idiomatic block statement; adjust
references to this.workerLobbies, lobbies, and serverTime inside getAllLobbies
accordingly.

src/client/LobbySocket.ts

coderabbitai · 2026-01-29T20:18:23Z

src/client/PublicLobby.ts

+  private handleLobbiesUpdate(data: LobbyUpdateData) {
+    this.lobbies = data.lobbies;
+
+    // Calculate offset between server time and client time
+    if (this.lobbies.length > 0) {
+      this.serverTimeOffset = this.lobbies[0].serverTime - Date.now();
+    }
    this.lobbies.forEach((l) => {
      if (!this.lobbyIDToStart.has(l.gameID)) {
-        const msUntilStart = l.msUntilStart ?? 0;
-        this.lobbyIDToStart.set(l.gameID, msUntilStart + Date.now());
+        // Convert server's startsAt to client time by subtracting offset
+        const startsAt = l.startsAt ?? Date.now();
+        this.lobbyIDToStart.set(l.gameID, startsAt - this.serverTimeOffset);
      }


⚠️ Potential issue | 🟡 Minor

Guard against missing serverTime to avoid NaN offsets.
If any payload lacks serverTime, the offset math breaks and the countdown becomes NaN.

🛡️ Suggested guard

- if (this.lobbies.length > 0) { - this.serverTimeOffset = this.lobbies[0].serverTime - Date.now(); - } + if (this.lobbies.length > 0) { + const serverTime = this.lobbies[0]?.serverTime; + this.serverTimeOffset = + typeof serverTime === "number" ? serverTime - Date.now() : 0; + } @@ - const startsAt = l.startsAt ?? Date.now(); + const startsAt = l.startsAt ?? l.serverTime ?? Date.now(); this.lobbyIDToStart.set(l.gameID, startsAt - this.serverTimeOffset);

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

private handleLobbiesUpdate(data: LobbyUpdateData) {

this.lobbies = data.lobbies;

// Calculate offset between server time and client time

if (this.lobbies.length > 0) {

this.serverTimeOffset = this.lobbies[0].serverTime - Date.now();

}

this.lobbies.forEach((l) => {

if (!this.lobbyIDToStart.has(l.gameID)) {

const msUntilStart = l.msUntilStart ?? 0;

this.lobbyIDToStart.set(l.gameID, msUntilStart + Date.now());

// Convert server's startsAt to client time by subtracting offset

const startsAt = l.startsAt ?? Date.now();

this.lobbyIDToStart.set(l.gameID, startsAt - this.serverTimeOffset);

}

private handleLobbiesUpdate(data: LobbyUpdateData) {

this.lobbies = data.lobbies;

// Calculate offset between server time and client time

if (this.lobbies.length > 0) {

const serverTime = this.lobbies[0]?.serverTime;

this.serverTimeOffset =

typeof serverTime === "number" ? serverTime - Date.now() : 0;

}

this.lobbies.forEach((l) => {

if (!this.lobbyIDToStart.has(l.gameID)) {

// Convert server's startsAt to client time by subtracting offset

const startsAt = l.startsAt ?? l.serverTime ?? Date.now();

this.lobbyIDToStart.set(l.gameID, startsAt - this.serverTimeOffset);

}

});

}

🤖 Prompt for AI Agents

In `@src/client/PublicLobby.ts` around lines 51 - 63, handleLobbiesUpdate computes serverTimeOffset from this.lobbies[0].serverTime which can be undefined and produce NaN; fix by locating a lobby with a valid numeric serverTime (e.g., find first l where typeof l.serverTime === 'number' && isFinite(l.serverTime')) and only compute this.serverTimeOffset = found.serverTime - Date.now(), otherwise default this.serverTimeOffset to 0; also ensure when computing startsAt you consistently use a numeric fallback (e.g., Date.now()) before subtracting this.serverTimeOffset so lobbyIDToStart.set(l.gameID, (startsAt ?? Date.now()) - this.serverTimeOffset) always uses finite numbers.

coderabbitai · 2026-01-29T20:18:23Z

src/server/MasterLobbyService.ts

+  private handleWorkerReady(workerId: number) {
+    this.readyWorkers.add(workerId);
+    this.log.info(
+      `Worker ${workerId} is ready. (${this.readyWorkers.size}/${this.config.numWorkers()} ready)`,
+    );
+    if (this.readyWorkers.size === this.config.numWorkers()) {
+      this.log.info("All workers ready, starting game scheduling");
+      startPolling(this.tick, 100);
+    }
+  }


⚠️ Potential issue | 🟠 Major

Prevent duplicate polling loops after worker restarts.
If a worker restarts, handleWorkerReady can call startPolling again and create a second scheduler. Add a one‑time guard.

🛡️ Suggested guard

export class MasterLobbyService { private readonly workers = new Map<number, Worker>(); private readonly workerLobbies = new Map<number, GameInfo[]>(); private readonly readyWorkers = new Set<number>(); + private pollingStarted = false; @@ private handleWorkerReady(workerId: number) { this.readyWorkers.add(workerId); this.log.info( `Worker ${workerId} is ready. (${this.readyWorkers.size}/${this.config.numWorkers()} ready)`, ); - if (this.readyWorkers.size === this.config.numWorkers()) { + if ( + !this.pollingStarted && + this.readyWorkers.size === this.config.numWorkers() + ) { + this.pollingStarted = true; this.log.info("All workers ready, starting game scheduling"); startPolling(this.tick, 100); } }

🤖 Prompt for AI Agents

In `@src/server/MasterLobbyService.ts` around lines 63 - 72, handleWorkerReady currently calls startPolling every time the last worker becomes ready, which can start duplicate schedulers after worker restarts; add a one‑time guard boolean (e.g., this.schedulingStarted) on the MasterLobbyService class, initialize it false, check it in handleWorkerReady before calling startPolling and set it true immediately after starting polling so subsequent calls to handleWorkerReady (or worker restarts) do not start additional loops; reference the existing methods/fields: handleWorkerReady, readyWorkers, startPolling, tick, and config.numWorkers().

coderabbitai · 2026-01-29T20:18:23Z

src/server/MasterLobbyService.ts

+  private getAllLobbies(): GameInfo[] {
+    const lobbies = Array.from(this.workerLobbies.values()).flat();
+    lobbies.forEach((g) => (g.serverTime = Date.now()));
+    return lobbies;
+  }


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

# Find the file and check its content fd MasterLobbyService.ts --type f

Repository: openfrontio/OpenFrontIO

Length of output: 99

🏁 Script executed:

# If found, examine lines around 74-78 rg -n "getAllLobbies" src/server/MasterLobbyService.ts -A 5 -B 2

Repository: openfrontio/OpenFrontIO

Length of output: 536

🏁 Script executed:

# Check for Biome configuration fd -e "biome.json" -e "biome.jsonc" -e ".biomeignore"

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Search for Biome rules or config references rg "biome" --type json --type toml --type jsonc

Repository: openfrontio/OpenFrontIO

Length of output: 96

🏁 Script executed:

# Check if there are any ESLint or Biome rule references rg "forEach" src/server/MasterLobbyService.ts -B 3 -A 3

Repository: openfrontio/OpenFrontIO

Length of output: 257

🏁 Script executed:

# Search for linting config in common locations ls -la | head -20

Repository: openfrontio/OpenFrontIO

Length of output: 1252

🏁 Script executed:

# Check package.json for linting setup rg '"biome"' --type json -A 5 -B 5

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Look for any ESLint or linting configuration fd -e "eslint*" -e "tsconfig*" -e "prettier*"

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Read package.json to see what linting tools are used cat package.json | head -80

Repository: openfrontio/OpenFrontIO

Length of output: 2997

🏁 Script executed:

# Search for Biome in package.json more carefully cat package.json | grep -i biome

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Find ESLint configuration fd -e "eslint.config*" -e ".eslintrc*"

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Check for ESLint config in package.json or root cat package.json | grep -A 20 '"eslintConfig"'

Repository: openfrontio/OpenFrontIO

Length of output: 49

🏁 Script executed:

# Look at the entire package.json to see if there's an eslintConfig section wc -l package.json

Repository: openfrontio/OpenFrontIO

Length of output: 83

🏁 Script executed:

# Read the full package.json to find ESLint config tail -100 package.json

Repository: openfrontio/OpenFrontIO

Length of output: 3062

🏁 Script executed:

# Search for ESLint config file - may be in different location find . -maxdepth 2 -name "eslint.config.*" -o -name ".eslintrc*" 2>/dev/null

Repository: openfrontio/OpenFrontIO

Length of output: 85

🏁 Script executed:

# Check if there's an eslint config file by looking at common patterns ls -la | grep eslint

Repository: openfrontio/OpenFrontIO

Length of output: 135

🏁 Script executed:

# The config might be js, let's search for it rg "eslint" . --type js --max-count 3 -l

Repository: openfrontio/OpenFrontIO

Length of output: 85

🏁 Script executed:

# Read the ESLint config cat eslint.config.js

Repository: openfrontio/OpenFrontIO

Length of output: 1754

Use a block statement and extract the timestamp for cleaner, more idiomatic TypeScript.

The arrow function with implicit return of an assignment expression is less conventional than a block statement. Extracting Date.now() ensures all lobbies receive the same timestamp value.

🧹 Suggested change

private getAllLobbies(): GameInfo[] { const lobbies = Array.from(this.workerLobbies.values()).flat(); - lobbies.forEach((g) => (g.serverTime = Date.now())); + const now = Date.now(); + lobbies.forEach((g) => { + g.serverTime = now; + }); return lobbies; }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

private getAllLobbies(): GameInfo[] {

const lobbies = Array.from(this.workerLobbies.values()).flat();

lobbies.forEach((g) => (g.serverTime = Date.now()));

return lobbies;

}

private getAllLobbies(): GameInfo[] {

const lobbies = Array.from(this.workerLobbies.values()).flat();

const now = Date.now();

lobbies.forEach((g) => {

g.serverTime = now;

});

return lobbies;

}

🧰 Tools

🪛 Biome (2.3.13)

[error] 76-76: This callback passed to forEach() iterable method should not return a value.

Either remove this return or remove the returned value.

(lint/suspicious/useIterableCallbackReturn)

🤖 Prompt for AI Agents

In `@src/server/MasterLobbyService.ts` around lines 74 - 78, The getAllLobbies method currently uses an implicit-return arrow with an assignment inside forEach; change it to compute a single timestamp (e.g., const now = Date.now()) and use a block-bodied forEach to set each GameInfo.serverTime to that extracted now value so all lobbies share the same timestamp and the update uses a more idiomatic block statement; adjust references to this.workerLobbies, lobbies, and serverTime inside getAllLobbies accordingly.

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/client/LobbySocket.test.ts (1)

50-79: ⚠️ Potential issue | 🟡 Minor

Await start() and send a PublicGames payload.
Line 54 calls an async method without await, so MockWebSocket.instances.at(-1) can be undefined, which matches the CI failure at Line 56. Also, the message payload should match PublicGames (serverTime + games), not { type, data }.

🛠️ Proposed fix

-  it("delivers lobby updates from websocket messages", () => {
+  it("delivers lobby updates from websocket messages", async () => {
     const updates: PublicGames[] = [];
     const socket = new PublicLobbySocket((data) => updates.push(data));

-    socket.start();
+    await socket.start();
     const ws = MockWebSocket.instances.at(-1);
     expect(ws?.url).toContain("/lobbies");

     ws?.dispatchEvent(
       new MessageEvent("message", {
         data: JSON.stringify({
-          type: "lobbies_update",
-          data: [
+          serverTime: Date.now(),
+          games: [
             {
               gameID: "g1",
               numClients: 1,
-              serverTime: Date.now(),
               gameConfig: {
                 maxPlayers: 2,
                 gameMode: 0,
                 gameMap: "Earth",
               },
             },
           ],
         }),
       }),
     );

🤖 Fix all issues with AI agents

In `@src/server/WorkerLobbyService.ts`:
- Around line 71-85: The mapping in sendMyLobbiesToMaster defaults gi.startsAt
to 0 which becomes epoch time; instead build each PublicGameInfo without forcing
startsAt to 0: only include startsAt when gi.startsAt is defined (e.g.,
conditionally add the property when gi.startsAt != null) so missing values are
omitted, and if PublicGameInfo currently requires startsAt, update its type to
make startsAt optional or allow undefined; locate sendMyLobbiesToMaster,
gm.publicLobbies(), gameInfo(), and the PublicGameInfo/WorkerLobbyList types to
apply the change and ensure the produced object still satisfies the type.

In `@tests/client/LobbySocket.test.ts`:
- Around line 87-90: The fetchMock in LobbySocket.test.ts is returning an array
(json: async () => []) which does not match the PublicGames schema; update the
mockResolvedValue for fetchMock so the json() returns a valid PublicGames-shaped
object (e.g., the expected top-level fields and an empty games list) so the
LobbySocket polling/fallback code receives the correct structure; look for the
fetchMock variable in the test and change its mockResolvedValue to return the
proper PublicGames payload.

🧹 Nitpick comments (4)

src/server/Master.ts (1)
107-114: Consider guarding against malformed WORKER_ID from environment.

If WORKER_ID in the environment is somehow malformed or non-numeric, parseInt(workerId) returns NaN. This could cause unexpected behavior in lobbyService.removeWorker() and lobbyService.registerWorker().
🛡️ Optional guard for parseInt
     const workerIdNum = parseInt(workerId);
+    if (Number.isNaN(workerIdNum)) {
+      log.error(`Invalid WORKER_ID value: ${workerId}`);
+      return;
+    }
     lobbyService.removeWorker(workerIdNum);
src/client/LobbySocket.ts (2)
17-18: Dead code: fallbackPollInterval is never set.

The fallbackPollInterval field and stopFallbackPolling() method remain, but HTTP fallback logic was removed. This is now dead code. Consider removing or keeping only if you plan to restore fallback polling.
🧹 Suggested cleanup
 export class PublicLobbySocket {
   private ws: WebSocket | null = null;
   private wsReconnectTimeout: number | null = null;
-  private fallbackPollInterval: number | null = null;
   private wsConnectionAttempts = 0;
   private wsAttemptCounted = false;
   private workerPath: string = "";

   private readonly reconnectDelay: number;
   private readonly maxWsAttempts: number;
-  private readonly pollIntervalMs: number;
...
-  private stopFallbackPolling() {
-    if (this.fallbackPollInterval !== null) {
-      clearInterval(this.fallbackPollInterval);
-      this.fallbackPollInterval = null;
-    }
-  }
Also remove the call to this.stopFallbackPolling() in handleOpen() (line 79) and the pollIntervalMs option handling.
Also applies to: 152-157

109-113: Max WebSocket attempts reached: no user feedback in handleClose.

When max attempts are reached in handleClose(), only a console error is logged. In handleConnectError() (line 127), an alert is shown. Consider consistent user feedback in both paths.
🔔 Suggested consistency fix
     if (this.wsConnectionAttempts >= this.maxWsAttempts) {
       console.error("Max WebSocket attempts reached");
+      alert("error connecting to game service");
     } else {
       this.scheduleReconnect();
     }
src/client/PublicLobby.ts (1)

55-57: Redundant null check for this.publicGames.

Line 52 assigns publicGames to this.publicGames, so the check on line 55 is always true. This is harmless but slightly confusing. If you apply the guard above, this simplifies naturally.

coderabbitai · 2026-02-01T04:35:45Z

src/server/WorkerLobbyService.ts

+  private sendMyLobbiesToMaster() {
+    const lobbies = this.gm
+      .publicLobbies()
+      .map((g) => g.gameInfo())
+      .map((gi) => {
+        return {
+          gameID: gi.gameID,
+          numClients: gi.clients?.length ?? 0,
+          startsAt: gi.startsAt ?? 0, // TODO: default value?
+          gameConfig: gi.gameConfig,
+        } satisfies PublicGameInfo;
+      });
+    const msg: WorkerLobbyList = { type: "lobbyList", lobbies };
+    process.send?.(msg);
+  }


⚠️ Potential issue | 🟡 Minor

Avoid defaulting startsAt to 0.
If startsAt is missing, 0 will render as 1970 and can confuse clients. It’s safer to skip or log missing values.

🛠️ Proposed fix

- const lobbies = this.gm - .publicLobbies() - .map((g) => g.gameInfo()) - .map((gi) => { - return { - gameID: gi.gameID, - numClients: gi.clients?.length ?? 0, - startsAt: gi.startsAt ?? 0, // TODO: default value? - gameConfig: gi.gameConfig, - } satisfies PublicGameInfo; - }); + const lobbies = this.gm + .publicLobbies() + .map((g) => g.gameInfo()) + .flatMap((gi) => { + if (gi.startsAt === undefined) { + this.log.warn("Public lobby missing startsAt", { gameID: gi.gameID }); + return []; + } + return [ + { + gameID: gi.gameID, + numClients: gi.clients?.length ?? 0, + startsAt: gi.startsAt, + gameConfig: gi.gameConfig, + } satisfies PublicGameInfo, + ]; + });

🤖 Prompt for AI Agents

In `@src/server/WorkerLobbyService.ts` around lines 71 - 85, The mapping in sendMyLobbiesToMaster defaults gi.startsAt to 0 which becomes epoch time; instead build each PublicGameInfo without forcing startsAt to 0: only include startsAt when gi.startsAt is defined (e.g., conditionally add the property when gi.startsAt != null) so missing values are omitted, and if PublicGameInfo currently requires startsAt, update its type to make startsAt optional or allow undefined; locate sendMyLobbiesToMaster, gm.publicLobbies(), gameInfo(), and the PublicGameInfo/WorkerLobbyList types to apply the change and ensure the produced object still satisfies the type.

coderabbitai · 2026-02-01T04:35:45Z

tests/client/LobbySocket.test.ts

    const fetchMock = vi.fn().mockResolvedValue({
      ok: true,
-      json: async () => ({ lobbies: [] }),
+      json: async () => [],
    });


⚠️ Potential issue | 🟡 Minor

HTTP polling mock should return PublicGames shape.
Returning [] won’t match the schema and can make the fallback poll fail silently.

🛠️ Proposed fix

- const fetchMock = vi.fn().mockResolvedValue({ - ok: true, - json: async () => [], - }); + const fetchMock = vi.fn().mockResolvedValue({ + ok: true, + json: async () => ({ serverTime: Date.now(), games: [] }), + });

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const fetchMock = vi.fn().mockResolvedValue({

ok: true,

json: async () => ({ lobbies: [] }),

json: async () => [],

});

const fetchMock = vi.fn().mockResolvedValue({

ok: true,

json: async () => ({ serverTime: Date.now(), games: [] }),

});

🤖 Prompt for AI Agents

In `@tests/client/LobbySocket.test.ts` around lines 87 - 90, The fetchMock in LobbySocket.test.ts is returning an array (json: async () => []) which does not match the PublicGames schema; update the mockResolvedValue for fetchMock so the json() returns a valid PublicGames-shaped object (e.g., the expected top-level fields and an empty games list) so the LobbySocket polling/fallback code receives the correct structure; look for the fetchMock variable in the test and change its mockResolvedValue to return the proper PublicGames payload.

github-actions bot assigned evanpelle Jan 20, 2026

openfrontio bot had a problem deploying to staging January 20, 2026 22:02 Failure

evanpelle force-pushed the move-ws-worker branch from e10aa22 to eed0c4f Compare January 20, 2026 22:09

openfrontio bot had a problem deploying to staging January 20, 2026 22:10 Failure

coderabbitai bot requested changes Jan 20, 2026

View reviewed changes

evanpelle force-pushed the move-ws-worker branch from eed0c4f to 72b7f28 Compare January 21, 2026 00:20

openfrontio bot had a problem deploying to staging January 21, 2026 00:23 Failure

coderabbitai bot requested changes Jan 21, 2026

View reviewed changes

evanpelle force-pushed the move-ws-worker branch from 72b7f28 to fbfd09c Compare January 21, 2026 00:30

openfrontio bot temporarily deployed to staging January 21, 2026 00:30 Inactive

coderabbitai bot reviewed Jan 21, 2026

View reviewed changes

evanpelle force-pushed the move-ws-worker branch from fbfd09c to 4c50428 Compare January 29, 2026 20:08

openfrontio bot temporarily deployed to staging January 29, 2026 20:08 Inactive

coderabbitai bot requested changes Jan 29, 2026

View reviewed changes

worker ws

3451eb8

evanpelle force-pushed the move-ws-worker branch from 4c50428 to 3451eb8 Compare February 1, 2026 04:30

openfrontio bot temporarily deployed to staging February 1, 2026 04:30 Inactive

coderabbitai bot requested changes Feb 1, 2026

View reviewed changes

move websockets to worker #2974

Are you sure you want to change the base?

move websockets to worker #2974

Uh oh!

Conversation

evanpelle commented Jan 20, 2026

Description:

Please complete the following:

Please put your Discord username so you can be contacted if a bug or regression is found:

Uh oh!

coderabbitai bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 29, 2026

Choose a reason for hiding this comment

coderabbitai bot commented Jan 20, 2026 •

edited

Loading