claude-flow
Version:
Ruflo - Enterprise AI agent orchestration for Claude Code. Deploy 60+ specialized agents in coordinated swarms with self-learning, fault-tolerant consensus, vector memory, and MCP integration
22 lines • 807 B
TypeScript
/**
* GAIA DAG 5-Question Pilot Runner — iter 64
*
* Runs 5 specific GAIA L1 questions through the Co-Sight DAG harness
* and compares results against single-Sonnet baseline (from iter 63b).
*
* Pilot question mix (selected from iter 63b failures):
* 1. 5d0080cb — calculation (fish bag volume from academic paper, multi-hop)
* 2. cffe0e32 — reasoning puzzle (Secret Santa assignment chain)
* 3. ec09fa32 — riddle (game show, requires careful reasoning)
* 4. 46719c30 — retrieval (paper authors -> their other publications)
* 5. b816bfce — retrieval+reasoning (journal name from Norse mythology)
*
* Cost cap: $2.00
*
* Usage:
* node dist/src/benchmarks/gaia-dag-pilot.js
*
* Refs: ADR-139, iter 64, #2156
*/
export {};
//# sourceMappingURL=gaia-dag-pilot.d.ts.map