Module: Octo::Agent::LlmCaller

Included in:: Octo::Agent

Defined in:: lib/octo/agent/llm_caller.rb

Overview

LLM API call management Handles API calls with retry logic, fallback model support, and progress indication

Constant Summary collapse

RETRIES_BEFORE_FALLBACK = Number of consecutive RetryableError failures (503/429/5xx) before switching to fallback. Network-level errors (connection failures, timeouts) do NOT trigger fallback — they are retried on the primary model for the full max_retries budget, since they are likely transient infrastructure blips rather than a model-level outage.

MAX_RETRIES_ON_FALLBACK = After switching to the fallback model, allow this many retries before giving up. Kept lower than max_retries (10) because we have already exhausted the primary model.

Instance Method Summary collapse

#collect_iteration_tokens(usage) ⇒ Hash

Collect token usage data for current iteration and return it.

Instance Method Details

#collect_iteration_tokens(usage) ⇒ `Hash`

Collect token usage data for current iteration and return it. Does NOT calculate cost — cost tracking has been removed.

Parameters:

usage (Hash) —

Usage data from API

Returns:

(Hash) —

token_data ready for show_token_usage