Skip to content

ISSUE 392 + agent independence discipline#45

Open
isaacsight wants to merge 3 commits into
mainfrom
claude/ai-news-updates-2u3wb
Open

ISSUE 392 + agent independence discipline#45
isaacsight wants to merge 3 commits into
mainfrom
claude/ai-news-updates-2u3wb

Conversation

@isaacsight
Copy link
Copy Markdown
Owner

Brings the branch's post-#44 work into main so ISSUE 392 (already deployed to gh-pages) is permanent and main stays in sync with the live site.

Contents

  1. Agent independence discipline (82c3f31, ecf5ccd)

    • INDEPENDENCE.md — the governing doc: the one test, the five independence sources, the autocorrelation trap, Mechanical/Judgment taxonomy, mandatory Independence source header, and the kbot-BYOK heterogeneity policy.
    • japanese-editor.md rewritten as the worked example (evidence-grounded, escalates-to-human, PASS-TO-HUMAN verdict).
    • Independence source header rolled out to all 53 agents.
    • First genuinely independent check run: 391's Japanese through a different model (GPT-OSS-20B) caught five strings the same-model self-review had passed — and false-flagged a house phrase, confirming the human is the final authority.
  2. ISSUE 392 — THE DESK THAT SURVIVES (ee28215)

    • Range restoration after four AI issues: a craft essay on the analog objects beside the screen, carrying 391's "the hand survives" thread into culture rather than cutting away.
    • kraft + classic + coffee accent + BY HAND seal; essay field-piece; back cover still-life (Flux/Pollinations).
    • JP run through the independent model before shipping per the new discipline; findings adjudicated (kept house register), one canon question (〜の用) staged for a native human.

Verification

  • npx tsc --noEmit + npm run build clean.
  • Already deployed to gh-pages from the branch; this merge makes it permanent.

Open item (not blocking): the 〜の用 construction in 369 + 392 awaits a native-speaker decision.

https://claude.ai/code/session_01X9H4TPZ1aT8Ep4TCkiDzfv


Generated by Claude Code

claude added 3 commits May 28, 2026 22:18
A critical pass exposed that this session had one model author an issue
and then "audit" its own design, language, and imagery and report it all
passed — the audits were never independent. This adds the discipline to
prevent reviewer theater.

INDEPENDENCE.md (new) is the governing doc: the one test ("what does this
agent know, see, or optimize that the author didn't?"), the five
independence sources (different-model, evidence, adversarial, different
criteria, human-gate), the autocorrelation trap (a same-model "team" is
one model agreeing with itself N times, not N reviewers), a
Mechanical-vs-Judgment taxonomy, a mandatory "Independence source:"
header for every agent, and the heterogeneity policy: exploit kbot's
multi-provider BYOK to run judgment and specialist review agents on a
different provider's model than the author (e.g. Japanese review on a
JP-native model), routed via per-agent config rather than hardcoded.

japanese-editor.md is rewritten as the worked example: it declares its
independence source, adds an evidence-grounding protocol step (usage,
corpus, dictionary, and precedent checks instead of parametric opinion),
and changes its verdict to PASS-TO-HUMAN / NEEDS-REVISION — it escalates
to a native reader, it never certifies.

designer.md and reviewer.md get the Independence-source header as the
start of rollout: mechanical findings are trustworthy through tools (cite
the tool or number), but taste and judgment findings are flagged as
non-independent on the author's own model unless run on a different model
or staged for a human. Self-agreement is not assurance.

https://claude.ai/code/session_01X9H4TPZ1aT8Ep4TCkiDzfv
Completes the independence discipline started in the prior commit.

Rollout: the "Independence source" header is now on all 53 agents
(previously only designer/reviewer/japanese-editor). Each is classified
and declares its source honestly:
- mechanical  — tool is the independence (qa, debugger, deployer, devops,
  ship, github, environment, hardware, performance, pulse, sync,
  autopoiesis, stream-auditor); cite the tool, don't assert from memory.
- mixed/adversarial — security, hacker, immune: tools + hunt-to-reject,
  prefer a different model for novel-attack reasoning.
- mixed/evidence — documenter, obsidian-sync, rival-intel, admin, replit,
  speed-of-light: ground claims in artifacts.
- judgment — taste/strategy/creative agents (pixel-artist, product,
  playtester, the strategy set, video/creative set, architects): NOT
  independent on the author's model; run on a different-provider model
  via kbot BYOK or stage for a human; never report self-agreement as
  assurance.
- authoring — magazine-editor: independence comes from downstream review
  plus a human, not self-check.

Proof: re-ran ISSUE 391's Japanese through a genuinely different model
(GPT-OSS-20B, different provider/weights). It caught five strings the
original same-model self-review had passed — demonstrating the
autocorrelation the doctrine predicts. It also false-flagged an
intentional house phrase, confirming the converse: a decorrelated model
surfaces candidates, but a native human is the final authority. 391's
Japanese is unchanged; the candidates are staged for human adjudication,
exactly as japanese-editor.md now prescribes.

https://claude.ai/code/session_01X9H4TPZ1aT8Ep4TCkiDzfv
A deliberate restoration of range. 388–391 ran four issues deep on
agentic AI and the magazine was drifting toward a trade newsletter; 392
returns to the culture/craft format the publication was built on
(360–366) without cutting away from the thread. 391 argued that taste
and the hand are what survive the machine; 392 stops arguing and goes
and looks — at the analog objects a coder keeps beside the screen
(notebook, pen, keyboard, cup), and at choosing them by hand as the same
scarce discernment the work itself now turns on. Continuity in the
argument, not the topic.

Identity: essay (field-piece, 369 template), kraft stock + classic
layout (breaks the asymmetric-left streak of 389–391) + coffee accent +
a BY HAND seal. Dossier reads as an inventory of the six inches around
the screen. Pull quote: "The screen is rented. The desk is yours."
Registered in index.ts; typecheck + build clean.

Back cover (public/back-covers/392-desk.jpg): a Flux/Pollinations
still-life of the desk objects on kraft — the issue's subject made
literal — passing the designer.md acceptance bar.

The Japanese was run through a genuinely different model (GPT-OSS-20B)
before shipping, per the independence discipline. Its findings were
adjudicated, not blindly applied: most alternatives were wrong for house
register and the JP is kept as written. One signal is escalated for a
native-human decision — it flags the house construction 〜の用 (also in
369) as unidiomatic, which legitimately questions canon. Staged, not
auto-changed.

https://claude.ai/code/session_01X9H4TPZ1aT8Ep4TCkiDzfv
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants