Skip to content

Commit 747c331

Browse files
committed
Run base2 fast no validation in nightly
1 parent 3bd8a85 commit 747c331

File tree

3 files changed

+15
-7
lines changed

3 files changed

+15
-7
lines changed
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
import { createBase2 } from './base2'
2+
3+
const definition = {
4+
...createBase2('fast-no-validation'),
5+
id: 'base2-fast-no-validation',
6+
displayName: 'Buffy the Fast No Validation Orchestrator',
7+
}
8+
export default definition

.agents/base2/base2.ts

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,10 +7,11 @@ import {
77
} from '../types/secret-agent-definition'
88

99
export const createBase2: (
10-
mode: 'fast' | 'max',
10+
mode: 'fast-no-validation' | 'fast' | 'max',
1111
) => Omit<SecretAgentDefinition, 'id'> = (mode) => {
12-
const isFast = mode === 'fast'
12+
const isFast = mode === 'fast' || mode === 'fast-no-validation'
1313
const isMax = mode === 'max'
14+
const noValidation = mode === 'fast-no-validation'
1415
return {
1516
publisher,
1617
model: 'anthropic/claude-sonnet-4.5',
@@ -79,7 +80,6 @@ Continue to spawn layers of agents until have completed the user's request or re
7980
- **Understand first, act second:** Always gather context and read relevant files BEFORE editing files.
8081
- **Quality over speed:** Prioritize correctness over appearing productive. Fewer, well-informed agents are better than many rushed ones.
8182
- **Spawn mentioned agents:** If the user uses "@AgentName" in their message, you must spawn that agent.
82-
- **No final summary:** When the task is complete, inform the user in one sentence.
8383
- **Validate assumptions:** Use researchers, file pickers, and the read_files tool to verify assumptions about libraries and APIs before implementing.
8484
- **Proactiveness:** Fulfill the user's request thoroughly, including reasonable, directly implied follow-up actions.
8585
- **Confirm Ambiguity/Expansion:** Do not take significant actions beyond the clear scope of the request without confirming with the user. If asked *how* to do something, explain first, don't just do it.
@@ -113,7 +113,7 @@ Continue to spawn layers of agents until have completed the user's request or re
113113
# Response guidelines
114114
115115
- **Don't create a summary markdown file:** The user doesn't want markdown files they didn't ask for. Don't create them.
116-
- **Don't include final summary:** Don't include any final summary in your response. Don't describe the changes you made. Just let the user know that you have completed the task briefly.
116+
- **Keep final summary extremely concise:** Write only a few words for each change you made in the final summary.
117117
118118
${PLACEHOLDER.FILE_TREE_PROMPT_SMALL}
119119
${PLACEHOLDER.KNOWLEDGE_FILES_CONTENTS}
@@ -142,8 +142,8 @@ ${
142142
4. Use the str_replace or write_file tool to make the changes.`
143143
: `3. IMPORTANT: You must spawn a base2-gpt-5-worker agent inline (with spawn_agent_inline tool) to do the planning and editing.`
144144
}
145-
${isFast ? '5' : '4'}. Test your changes${isFast ? ' briefly' : ''} by running appropriate validation commands for the project (e.g. typechecks, tests, lints, etc.). You may have to explore the project to find the appropriate commands.
146-
${isFast ? '6' : '5'}. Inform the user that you have completed the task in one sentence without a final summary. Don't create any markdown summary files either, unless asked by the user. If you already finished the user request and said you're done, then don't say anything else.`,
145+
${noValidation ? '' : `${isFast ? '5' : '4'}. Test your changes${isFast ? ' briefly' : ''} by running appropriate validation commands for the project (e.g. typechecks, tests, lints, etc.). You may have to explore the project to find the appropriate commands.`}
146+
${isFast && !noValidation ? '6' : '5'}. Inform the user that you have completed the task in one sentence without a final summary. Don't create any markdown summary files either, unless asked by the user. If you already finished the user request and said you're done, then don't say anything else.`,
147147
stepPrompt: `Don't forget to spawn agents that could help, especially: the file-picker-max and code-searcher to get codebase context${isMax ? ', and the base2-gpt-5-worker agent to do the planning and editing' : ''}. After completing the user request, summarize your changes in a sentence or a few short bullet points. Do not create any summary markdown files, unless asked by the user. Then, end your turn.`,
148148
handleSteps: function* ({ params }) {
149149
let steps = 0

evals/buffbench/main-nightly.ts

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ async function main() {
1313

1414
const results = await runBuffBench({
1515
evalDataPath: path.join(__dirname, 'eval-codebuff.json'),
16-
agents: ['base', 'base2-fast'],
16+
agents: ['base', 'base2-fast-no-validation'],
1717
taskConcurrency: 5,
1818
})
1919

0 commit comments

Comments
 (0)