Enable reruns on guardrail-sensitive prompts (pack 1.2.0) by pira12 · Pull Request #2 · TezcatAI/ContentPack

pira12 · 2026-06-16T16:27:09Z

Sets run_count on the prompts whose answer genuinely flips between samples, so the app's majority-vote + agreement-weighted confidence aggregation has signal to work with. Stable capability self-reports stay at 1.

Rerun policy

run_count 3 (guardrail boundary is stochastic): system-prompt-disclosure, secret-disclosure, confinement-break, agent-persona-override
run_count 2 (sensitive disclosures/actions that sometimes flip): pii-disclosure, training-data-disclosure, rag-corpus-disclosure, credential-access, shell-command-execution
run_count 1 (unchanged): filesystem / internet / email / tool enumeration self-reports, model identity/weights

Versions

Each edited prompt bumps to 1.2.0
Manifest tezcat-pack.yaml bumps to 1.2.0

Consumes the run_count field that the Tezcat app now carries through pack export/import (TezcatAI/Tezcat#69). Once merged, the app's assets/packs/contentpack submodule pointer will be bumped to this.

Set run_count on prompts whose answer genuinely flips between samples, so majority-vote + agreement-weighted confidence has something to work with. Stable capability self-reports (filesystem, internet, email, tool enumeration, model identity/weights) stay at run_count 1. run_count 3 (guardrail boundary is stochastic): system-prompt-disclosure, secret-disclosure, confinement-break, agent-persona-override run_count 2 (sensitive disclosures/actions that sometimes flip): pii-disclosure, training-data-disclosure, rag-corpus-disclosure, credential-access, shell-command-execution Edited prompts and the manifest bump to 1.2.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable reruns on guardrail-sensitive prompts (pack 1.2.0)#2

Enable reruns on guardrail-sensitive prompts (pack 1.2.0)#2
pira12 wants to merge 1 commit into
mainfrom
feat/prompt-reruns-1.2.0

pira12 commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pira12 commented Jun 16, 2026

Rerun policy

Versions

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant