AI Arbiter System

Status: Production Ready
Crate: astraweave-ai (requires llm_orchestrator feature)
Documentation: See also Complete Implementation Guide

The AIArbiter is a hybrid AI control system that combines instant tactical decisions (GOAP) with deep strategic reasoning (LLM), achieving zero user-facing latency while maintaining LLM-level intelligence.

The Problem

Traditional game AI faces a dilemma:

Approach	Latency	Intelligence
Fast AI (GOAP, BT)	~100 ns	Limited reasoning
Smart AI (LLM)	13-21 seconds	Deep understanding

Players either wait 20 seconds for smart AI or get immediate but shallow responses.

The Solution

The arbiter provides zero user-facing latency by:

Instant GOAP control - Returns tactical actions in 101.7 ns
Background LLM planning - Generates strategic plans asynchronously
Seamless transitions - Switches to LLM plans when ready
Non-blocking polling - Checks LLM completion in 104.7 ns

Performance

Operation	Latency	Target	Speedup
GOAP update	101.7 ns	100 µs	982×
LLM polling	575 ns	50 µs	86×
Mode transition	221.9 ns	10 µs	45×
Full 3-step cycle	313.7 ns	—	—

Scalability

Agents	Overhead	Frame Budget	Status
1,000	101.7 µs	0.6%	✅
10,000	1.02 ms	6.1%	✅
50,000	5.09 ms	30.5%	⚠️

Quick Start

#![allow(unused)]
fn main() {
use astraweave_ai::{AIArbiter, LlmExecutor, GoapOrchestrator, RuleOrchestrator};
use std::sync::Arc;

// Create arbiter
let llm_orch = Arc::new(LlmOrchestrator::new(/* config */));
let runtime = tokio::runtime::Handle::current();
let llm_executor = LlmExecutor::new(llm_orch, runtime);

let goap = Box::new(GoapOrchestrator::new());
let bt = Box::new(RuleOrchestrator);

let mut arbiter = AIArbiter::new(llm_executor, goap, bt);

// Game loop
loop {
    let snapshot = build_world_snapshot(/* ... */);
    let action = arbiter.update(&snapshot);  // 101.7 ns
    execute_action(action);
}
}

Architecture

Three-Tier Control System

┌─────────────────────────────────────────────────────┐
│                   AIArbiter                         │
│  (Orchestration Layer - 101.7 ns overhead)          │
└────┬────────────────────┬────────────────────┬──────┘
     │                    │                    │
     ▼                    ▼                    ▼
┌──────────┐      ┌──────────────┐     ┌──────────┐
│   GOAP   │      │   Qwen3-8B    │     │    BT    │
│ (3-5 ns) │      │ (13-21s async)│     │ Fallback │
└──────────┘      └──────────────┘     └──────────┘

Mode State Machine

        ┌──────────────┐
        │     GOAP     │ ◄─────────┐
        │ (Instant AI) │           │
        └───────┬──────┘           │
                │                  │
                │ LLM ready        │ Plan exhausted
                │                  │
        ┌───────▼──────────┐       │
        │   ExecutingLLM   │───────┘
        │  (Step-by-step)  │
        └──────────────────┘
                │
                │ Empty plan
                ▼
        ┌──────────────┐
        │ BehaviorTree │
        │  (Fallback)  │
        └──────────────┘

API Reference

AIArbiter

#![allow(unused)]
fn main() {
pub struct AIArbiter { /* ... */ }

impl AIArbiter {
    /// Create new arbiter in GOAP mode
    pub fn new(
        llm_executor: LlmExecutor,
        goap: Box<dyn Orchestrator>,
        bt: Box<dyn Orchestrator>,
    ) -> Self;
    
    /// Set LLM request cooldown (default: 15s)
    pub fn with_llm_cooldown(self, cooldown: f32) -> Self;
    
    /// Main control loop - call every frame
    pub fn update(&mut self, snap: &WorldSnapshot) -> ActionStep;
    
    /// Get current mode
    pub fn mode(&self) -> AIControlMode;
    
    /// Check if LLM task is active
    pub fn is_llm_active(&self) -> bool;
    
    /// Get performance metrics
    pub fn metrics(&self) -> (
        usize,  // mode_transitions
        usize,  // llm_requests
        usize,  // llm_successes
        usize,  // llm_failures
        usize,  // goap_actions
        usize,  // llm_steps_executed
    );
}
}

AIControlMode

#![allow(unused)]
fn main() {
pub enum AIControlMode {
    GOAP,                              // Fast tactical mode
    ExecutingLLM { step_index: usize }, // Executing LLM plan
    BehaviorTree,                      // Emergency fallback
}
}

Common Patterns

Pattern 1: Basic Agent

#![allow(unused)]
fn main() {
pub struct AIAgent {
    arbiter: AIArbiter,
}

impl AIAgent {
    pub fn update(&mut self, snap: &WorldSnapshot) -> ActionStep {
        self.arbiter.update(snap)
    }
}
}

Pattern 2: Shared LLM Executor

#![allow(unused)]
fn main() {
// Create once, clone for each agent
let base_executor = LlmExecutor::new(llm_orch, runtime);

let agents: Vec<AIAgent> = (0..100)
    .map(|_| AIAgent::new(base_executor.clone()))
    .collect();
}

Pattern 3: Custom Cooldown

#![allow(unused)]
fn main() {
// Aggressive (more LLM requests)
let arbiter = AIArbiter::new(executor, goap, bt)
    .with_llm_cooldown(5.0);

// Passive (fewer LLM requests)
let arbiter = AIArbiter::new(executor, goap, bt)
    .with_llm_cooldown(30.0);
}

Pattern 4: Metrics Monitoring

#![allow(unused)]
fn main() {
let (transitions, requests, successes, failures, goap_actions, llm_steps) = 
    arbiter.metrics();

let success_rate = 100.0 * successes as f64 / requests as f64;
if success_rate < 50.0 {
    warn!("LLM success rate low: {:.1}%", success_rate);
}
}

Pattern 5: Mode-Specific Logic

#![allow(unused)]
fn main() {
match arbiter.mode() {
    AIControlMode::GOAP => {
        ui.show_status("Tactical Mode");
    }
    AIControlMode::ExecutingLLM { step_index } => {
        ui.show_status(&format!("Strategic Step {}", step_index));
        ui.show_indicator("LLM Active");
    }
    AIControlMode::BehaviorTree => {
        ui.show_warning("Fallback Mode");
    }
}
}

Cooldown Configuration

The LLM cooldown controls how frequently the arbiter requests new strategic plans:

Cooldown	Use Case
5s	Aggressive - Frequent strategic updates
15s	Default - Balanced performance
30s	Passive - Reduce LLM costs
0s	Immediate - Testing only

#![allow(unused)]
fn main() {
let arbiter = AIArbiter::new(executor, goap, bt)
    .with_llm_cooldown(15.0);  // Default
}

Troubleshooting

LLM Never Completes

Symptoms: is_llm_active() always true, never transitions to ExecutingLLM

Causes:

Ollama not running
Model not loaded
Network issues

Fix:

# Verify Ollama is running
ollama list

# Test model directly
ollama run qwen3:8b

High Failure Rate

Symptoms: llm_failures > 50% of requests

Causes:

Model quality issues
Bad prompts
Timeout too short

Fix:

#![allow(unused)]
fn main() {
// Increase cooldown to reduce impact
let arbiter = AIArbiter::new(executor, goap, bt)
    .with_llm_cooldown(30.0);
}

Stuck in ExecutingLLM

Symptoms: Same action repeated, step_index doesn’t advance

Causes:

Plan has duplicate steps
Plan too long

Fix: Validate plan length before execution:

#![allow(unused)]
fn main() {
if plan.steps.len() > 50 {
    warn!("LLM plan too long: {} steps", plan.steps.len());
}
}

Running the Demo

# GOAP-only mode
cargo run -p hello_companion --release

# Arbiter mode (GOAP + Qwen3-8B)
cargo run -p hello_companion --release --features llm_orchestrator -- --arbiter

Expected output:

Frame 0: MoveTo { x: 5, y: 5 } (GOAP)
Frame 1: TakeCover { position: Some((3, 2)) } (GOAP)
[INFO] LLM plan ready: 3 steps
Frame 3: MoveTo { x: 4, y: 0 } (ExecutingLLM[step 1])
Frame 4: TakeCover { position: Some((4, 1)) } (ExecutingLLM[step 2])
Frame 5: Attack { target: 1 } (ExecutingLLM[step 3])
[INFO] Plan exhausted, returning to GOAP
Frame 6: MoveTo { x: 5, y: 5 } (GOAP)

AstraWeave: AI-Native Game Engine Documentation

AI Arbiter System

The Problem

The Solution

Performance

Scalability

Quick Start

Architecture

Three-Tier Control System

Mode State Machine

API Reference

AIArbiter

AIControlMode

Common Patterns

Pattern 1: Basic Agent

Pattern 2: Shared LLM Executor

Pattern 3: Custom Cooldown

Pattern 4: Metrics Monitoring

Pattern 5: Mode-Specific Logic

Cooldown Configuration

Troubleshooting

LLM Never Completes

High Failure Rate

Stuck in ExecutingLLM

Running the Demo

See Also

Keyboard shortcuts

AstraWeave: AI-Native Game Engine Documentation