-n, --runs <count> | Number of runs (max 100) | 2 |
-t, --timeout <seconds> | Timeout per run in seconds (max 3600) | 600 |
-m, --model <model> | Evaluator model for probabilistic criteria | from evaluator.model config |
-o, --output <format> | Output format: terminal, json, junit | terminal |
--seed <name> | Override twin seed name | from scenario config |
--rate-limit <count> | Rate limit: max total requests before 429 | unlimited |
--pass-threshold <score> | Minimum passing satisfaction score (0–100) | 0 |
--tag <tag> | Only run if scenario has this tag (exits 0 if not matched) | — |
--api-key <key> | API key for the model provider (overrides env vars) | from env vars |
--engine-endpoint <url> | Agent gateway URL (remote /v1/responses endpoint) | from ARCHAL_ENGINE_ENDPOINT |
--engine-token <token> | Bearer token for API engine auth | from ARCHAL_ENGINE_TOKEN |
--agent-model <model> | Agent model identifier. Required in API mode. | from ARCHAL_ENGINE_MODEL |
--engine-twin-urls <path> | JSON file mapping twin names to remote-reachable MCP base URLs | from ARCHAL_ENGINE_TWIN_URLS |
--engine-timeout <seconds> | Timeout for API engine HTTP call per run | run timeout |
--harness <name> | Use a named harness (react, hardened, zero-shot, naive, openclaw, or ~/.archal/harnesses/<name>) | react (or engine.defaultHarness config) |
--harness-dir <path> | Local agent execution directory (archal-harness.json is optional) | from ARCHAL_HARNESS_DIR |
--api-base-urls <path> | JSON file mapping service names to clone API base URLs | off |
--api-proxy-url <url> | Proxy URL for raw API code routing metadata | from ARCHAL_API_PROXY_URL |
--preflight-only | Validate environment/config and exit before execution | false |
--seed-cache | Enable dynamic seed cache reuse | false (off by default) |
--replay-seed <path> | Replay a previously saved managed seed snapshot | off |
--save-seed <path> | Save the resolved managed seed snapshot used for this run | off |
--no-failure-analysis | Skip LLM failure analysis on imperfect scores | false |
--allow-ambiguous-seed | Allow dynamic seed generation when setup is underspecified | false |
--strict-seed | Treat seed FK and coverage warnings as hard errors | false |
--sandbox | Run agent in sandboxed Docker container with TLS proxy | false |
--no-docker | Skip Docker and run with local OpenClaw CLI + proxy | false |
--openclaw-home <dir> | Path to full OpenClaw home directory | ~/.openclaw |
--workspace <dir> | OpenClaw workspace directory to mount (workspace-only mode) | — |
--openclaw-config <path> | Path to openclaw.json (workspace-only mode) | — |
--openclaw-version <version> | OpenClaw version for sandbox image build | — |
--openclaw-eval-mode <mode> | OpenClaw eval mode: isolated or stateful | — |
-q, --quiet | Suppress non-error output | false |
-v, --verbose | Enable debug logging | false |