Benchmarks
Benchmarks live in the GitHub repo. They are not bundled on npm. Copy a folder into your project before running shieldedshell loop --benchmark <name>.
Copy a benchmark
Section titled “Copy a benchmark”git clone https://github.com/connerkup/shielded-shell.gitcp -r shielded-shell/benchmark/02_ledger_consensus ./benchmark/Windows PowerShell:
Copy-Item -Recurse shielded-shell\benchmark\02_ledger_consensus .\benchmark\Available scenarios
Section titled “Available scenarios”| Name | Focus |
|---|---|
02_ledger_consensus | Concurrent ledger transfers; interval solver + secure validator |
04_api_gateway | Routing policy; sensitive paths must not map to public gateways |
06_poison_task | Adversarial / poison detection scenario (expects controlled failure modes) |
Each folder includes:
agent_a_prompt.txt/agent_b_prompt.txt— role promptsdeveloper_secret.txt/auditor_secret.txt— phase-gated fixture datavalidate.js— secure validator (where applicable)
Run a benchmark
Section titled “Run a benchmark”cd your-projectshieldedshell initshieldedshell loop --engine cline --benchmark 02_ledger_consensus --dir .Success ends with CRITICAL_SUCCESS in shared_context.txt and merged code in the merge target (default auth_service.js).
Mock agents (CI / local dev)
Section titled “Mock agents (CI / local dev)”The repo includes mock agent scripts under packages/core/fixtures/ for automated tests without live LLMs. See the GitHub repo for mock-dev-ledger.mjs and mock-audit-pass.mjs patterns.
Report issues
Section titled “Report issues”Beta feedback: bug report template. Include shieldedshell doctor output and engine name.