aiwg

Version:

Cognitive architecture for AI-augmented software development with structured memory, ensemble validation, and closed-loop correction. FAIR-aligned artifacts, 84% cost reduction via human-in-the-loop, standards adopted by 100+ organizations.

aiwg.io

jmagly/aiwg

152 lines (107 loc) • 3.71 kB

Markdown

# RLM Cost Report: {{tree_id}} **Generated**: {{timestamp}} **Status**: {{status_emoji}} {{status_text}} --- ## Summary | Metric | Value | |--------|-------| | **Total Tokens** | {{total_tokens}} ({{input_tokens}} in + {{output_tokens}} out) | | **Total Cost** | ${{total_cost_usd}} | | **Sub-Calls** | {{total_sub_calls}} | | **Max Depth** | {{max_depth}} | | **Duration** | {{duration}} | | **Budget Usage** | {{budget_usage_percent}}% | --- ## Budget Status ``` {{budget_bar}} ``` {{#budget_warnings}} ⚠️ **Warnings**: {{#warnings}} - {{.}} {{/warnings}} {{/budget_warnings}} {{#budget_projection}} 📊 **Projection**: Based on current trajectory, final cost estimated at **${{projected_cost}}** ({{projected_percent}}% of budget) {{/budget_projection}} --- ## Cost Breakdown by Depth | Depth | Nodes | Tokens | Cost (USD) | % of Total | |-------|-------|--------|------------|------------| {{#depth_breakdown}} | {{depth}} | {{node_count}} | {{tokens}} | ${{cost}} | {{percent}}% | {{/depth_breakdown}} | **Total** | **{{total_nodes}}** | **{{total_tokens}}** | **${{total_cost}}** | **100%** | --- ## Cost Breakdown by Model | Model | Calls | Tokens | Cost (USD) | Avg Cost/Call | |-------|-------|--------|------------|---------------| {{#model_breakdown}} | {{model}} | {{call_count}} | {{tokens}} | ${{cost}} | ${{avg_cost}} | {{/model_breakdown}} | **Total** | **{{total_calls}}** | **{{total_tokens}}** | **${{total_cost}}** | **${{avg_cost_per_call}}** | --- ## Percentile Analysis Based on REF-089 (Zhang et al., 2026) recursive decomposition cost patterns: | Metric | p25 | p50 | p75 | p95 | |--------|-----|-----|-----|-----| | **Cost per Sub-Call** | ${{p25_cost}} | ${{p50_cost}} | ${{p75_cost}} | ${{p95_cost}} | | **Tokens per Sub-Call** | {{p25_tokens}} | {{p50_tokens}} | {{p75_tokens}} | {{p95_tokens}} | ### Comparison to Alternative Approaches | Approach | Estimated Cost | Comparison | |----------|----------------|------------| | **Current (RLM)** | ${{total_cost_usd}} | — | | **Base Model (no decomposition)** | ${{base_model_cost}} | {{base_model_comparison}} | | **Summarization Approach** | ${{summarization_cost}} | {{summarization_comparison}} | {{#cost_efficiency_note}} 💡 **Note**: {{cost_efficiency_note}} {{/cost_efficiency_note}} --- ## Top 5 Most Expensive Nodes | Node ID | Model | Tokens | Cost (USD) | Prompt Preview | |---------|-------|--------|------------|----------------| {{#top_expensive_nodes}} | `{{node_id}}` | {{model}} | {{tokens}} | ${{cost}} | {{prompt_preview}} | {{/top_expensive_nodes}} --- ## Recommendations {{#recommendations}} ### {{category}} {{#items}} - **{{title}}**: {{description}} {{#metrics}} - _Expected Impact_: {{metric}} {{/metrics}} {{/items}} {{/recommendations}} --- ## Cost Breakdown Details ### Depth Distribution {{#depth_chart}} ``` Depth {{depth}}: {{bar}} {{node_count}} nodes (${{cost}}) ``` {{/depth_chart}} ### Model Usage Over Time {{#model_timeline}} - **{{timestamp}}**: {{model}} ({{tokens}} tokens, ${{cost}}) {{/model_timeline}} --- ## Configuration | Parameter | Value | |-----------|-------| | Max Depth | {{config_max_depth}} | | Max Sub-Calls | {{config_max_sub_calls}} | | Default Sub-Model | {{config_default_sub_model}} | | Budget Tokens | {{config_budget_tokens}} | | Parallel Sub-Calls | {{config_parallel}} | --- ## References - **Schema**: `@agentic/code/addons/rlm/schemas/rlm-cost.yaml` - **Research**: REF-089 (Zhang et al., 2026) - Recursive Language Models - **Documentation**: `@agentic/code/addons/rlm/docs/cost-analysis.md` --- **Report ID**: `{{report_id}}` **Tree ID**: `{{tree_id}}` **Generated by**: AIWG RLM Addon v{{addon_version}}