Class: Phronomy::Agent::Base

Inherits:

Object

Object
Phronomy::Agent::Base

show all

Includes:: Concerns::BeforeCompletion, Concerns::Guardrailable, Concerns::Retryable, Concerns::Suspendable, Runnable

Defined in:: lib/phronomy/agent/base.rb

Overview

Base class for all Phronomy agents.

Subclass this to create a conversational agent powered by an LLM. DSL class methods configure the model, instructions, tools, memory, and retry behaviour. Instance methods handle invocation.

Examples:

Minimal agent

class GreetingAgent < Phronomy::Agent::Base
  model "gpt-4o-mini"
  instructions "You are a friendly greeter."
end
result = GreetingAgent.new.invoke("Hello!")
puts result[:output]

Agent with tools

class ResearchAgent < Phronomy::Agent::Base
  model "gpt-4o"
  instructions "You are a research assistant."
  tools WebSearchTool, CalculatorTool
  max_iterations 15
end

Direct Known Subclasses

Orchestrator, ReactAgent

Instance Attribute Summary

Attributes included from Concerns::BeforeCompletion

#before_completion

Class Method Summary collapse

._on_compact_callback ⇒ Proc^?
._on_compaction_trigger_callback ⇒ Proc^?
._on_trim_callback ⇒ Proc^?
.cache_instructions(enabled = nil) ⇒ Object
When enabled, attaches Anthropic prompt-cache markers to the system message so that the fixed instructions are served from cache on subsequent turns, reducing input-token costs.
.context_overhead(val = nil) ⇒ Object
Tokens reserved for the system prompt + tool definitions overhead.
.context_window(val = nil) ⇒ Object
Overrides the context window size used for token budget calculations.
.instructions(text = nil) { ... } ⇒ String, ...
Sets or reads the system instructions for this agent.
.max_iterations(val = nil) ⇒ Integer
Sets or reads the maximum number of LLM call cycles for ReAct agents.
.max_output_tokens(val = nil) ⇒ Object
Tokens to reserve for the model's output.
.model(name = nil) ⇒ String^?
Sets or reads the LLM model identifier for this agent.
.on_compact {|ctx| ... } ⇒ Object
Registers a callback that performs the actual compaction when the +on_compaction_trigger+ callback fires.
.on_compaction_trigger {|ctx| ... } ⇒ Boolean
Registers a callback that decides whether compaction should run.
.on_trim {|ctx| ... } ⇒ Object
Registers a callback that is invoked before every LLM call so the application can remove stale or irrelevant messages from the conversation history.
.provider(name = nil) ⇒ Symbol^?
Sets or reads the LLM provider for this agent.
.static_knowledge(*sources) ⇒ Object
Registers one or more static knowledge sources on the agent class.
.static_knowledge_sources ⇒ Array<Phronomy::KnowledgeSource::Base>
Returns the registered static knowledge sources.
.temperature(val = nil) ⇒ Float^?
Sets or reads the sampling temperature sent to the LLM.
.tool_aliases ⇒ Hash{Class => String}
Returns the alias map registered via the hash form of .tools.
.tools(*args) ⇒ Object
Registers tool classes for this agent.

Instance Method Summary collapse

#_add_handoff_tool(tool_class) ⇒ self
Registers an anonymous handoff tool class on this agent instance.
#_handoff_tools ⇒ Array<Class>
Returns handoff tool classes registered on this instance by Runner.
#context_version_cache ⇒ Object private
Returns the Context::ContextVersionCache for the current thread.
#invoke(input, messages: [], thread_id: nil, config: {}) ⇒ Hash
Invokes the agent with the given input and returns a result Hash.
#stream(input, messages: [], thread_id: nil, config: {}) {|Phronomy::Agent::StreamEvent| ... } ⇒ Hash
Streaming version of #invoke.

Class Method Details

._on_compact_callback ⇒ `Proc`^?

Returns:

(Proc, nil)



261
262
263

# File 'lib/phronomy/agent/base.rb', line 261

def _on_compact_callback
  @on_compact_callback
end

._on_compaction_trigger_callback ⇒ `Proc`^?

Returns:

(Proc, nil)



239
240
241

# File 'lib/phronomy/agent/base.rb', line 239

def _on_compaction_trigger_callback
  @on_compaction_trigger_callback
end

._on_trim_callback ⇒ `Proc`^?

Returns:

(Proc, nil)



216
217
218

# File 'lib/phronomy/agent/base.rb', line 216

def _on_trim_callback
  @on_trim_callback
end

.cache_instructions(enabled = nil) ⇒ `Object`

When enabled, attaches Anthropic prompt-cache markers to the system message so that the fixed instructions are served from cache on subsequent turns, reducing input-token costs.

Only has an effect when the agent also declares provider :anthropic. The cache_control field is provider-specific (the format differs between Anthropic direct, Bedrock, etc.), so the agent must explicitly declare its provider via the DSL rather than having it inferred from the model name.

Examples:

class MyAgent < Phronomy::Agent::Base
  provider :anthropic
  cache_instructions true
end

# File 'lib/phronomy/agent/base.rb', line 280

def cache_instructions(enabled = nil)
  if enabled.nil?
    @cache_instructions
  else
    @cache_instructions = enabled
  end
end

.context_overhead(val = nil) ⇒ `Object`

Tokens reserved for the system prompt + tool definitions overhead. Subtract this from the context window before computing the memory budget.

Examples:

class MyAgent < Phronomy::Agent::Base
  context_overhead 500
end

# File 'lib/phronomy/agent/base.rb', line 327

def context_overhead(val = nil)
  if val.nil?
    @context_overhead || 0
  else
    @context_overhead = val.to_i
  end
end

.context_window(val = nil) ⇒ `Object`

Overrides the context window size used for token budget calculations. When set, this value takes precedence over the RubyLLM model registry, which is useful for locally-hosted models (e.g. LM Studio) where the actually-loaded context length may differ from the catalogue value.

Examples:

class MyAgent < Phronomy::Agent::Base
  context_window 4096
end

# File 'lib/phronomy/agent/base.rb', line 312

def context_window(val = nil)
  if val.nil?
    @context_window
  else
    @context_window = val.to_i
  end
end

.instructions(text = nil) { ... } ⇒ `String`, ...

Sets or reads the system instructions for this agent. Accepts a String, a PromptTemplate, or a block (Proc). When used as a reader (no argument, no block), returns the stored value.

Examples:

String instructions

class MyAgent < Phronomy::Agent::Base
  instructions "You are a helpful assistant."
end

Block instructions

class MyAgent < Phronomy::Agent::Base
  instructions { |input| "Answer in #{input[:lang]}." }
end

Parameters:

text (String, Phronomy::PromptTemplate, nil) (defaults to: nil)

Yields:

optionally provide instructions as a block

Returns:

(String, Phronomy::PromptTemplate, Proc, nil)

# File 'lib/phronomy/agent/base.rb', line 73

def instructions(text = nil, &block)
  if text || block_given?
    @instructions = text || block
  else
    return @instructions if instance_variable_defined?(:@instructions)
    superclass.respond_to?(:instructions) ? superclass.instructions : nil
  end
end

.max_iterations(val = nil) ⇒ `Integer`

Sets or reads the maximum number of LLM call cycles for ReAct agents. Each tool call and follow-up counts as one iteration. Defaults to 10.

Examples:

class MyAgent < Phronomy::Agent::Base
  max_iterations 5
end

Parameters:

val (Integer, nil) (defaults to: nil)

Returns:

(Integer)

# File 'lib/phronomy/agent/base.rb', line 168

def max_iterations(val = nil)
  if val
    @max_iterations = val
  else
    @max_iterations || 10
  end
end

.max_output_tokens(val = nil) ⇒ `Object`

Tokens to reserve for the model's output. When nil, the model's max_output_tokens from the registry is used.

Examples:

class MyAgent < Phronomy::Agent::Base
  max_output_tokens 4096
end

# File 'lib/phronomy/agent/base.rb', line 295

def max_output_tokens(val = nil)
  if val.nil?
    @max_output_tokens
  else
    @max_output_tokens = val.to_i
  end
end

.model(name = nil) ⇒ `String`^?

Sets or reads the LLM model identifier for this agent. When called without an argument, returns the stored model or the global default from Phronomy.configuration.

Examples:

class MyAgent < Phronomy::Agent::Base
  model "gpt-4o"
end

Parameters:

name (String, nil) (defaults to: nil) —
model identifier (e.g. "gpt-4o", "claude-3-5-sonnet")

Returns:

(String, nil) —
the model name when used as a reader

# File 'lib/phronomy/agent/base.rb', line 50

def model(name = nil)
  if name
    @model = name
  else
    @model || Phronomy.configuration.default_model
  end
end

.on_compact {|ctx| ... } ⇒ `Object`

Registers a callback that performs the actual compaction when the +on_compaction_trigger+ callback fires. The block receives a Context::CompactionContext and should call +ctx.compact+ to specify which messages to summarise.

Examples:

Replace the first 4 messages with a short summary

on_compact do |ctx|
  ctx.compact(0..3) do |elements|
    texts = elements.map { |e| e[:message].content }.join(" | ")
    "Earlier conversation summary: #{texts}"
  end
end

Yields:

(ctx) —
Phronomy::Context::CompactionContext



256
257
258

# File 'lib/phronomy/agent/base.rb', line 256

def on_compact(&block)
  @on_compact_callback = block
end

.on_compaction_trigger {|ctx| ... } ⇒ `Boolean`

Registers a callback that decides whether compaction should run. Evaluated before every LLM call (after on_trim). If the block returns truthy AND an +on_compact+ callback is also registered, the compact pipeline is executed.

The block receives a read-only Context::TriggerContext.

Examples:

Trigger when messages exceed 70% of token budget

on_compaction_trigger do |ctx|
  limit = ctx.budget&.available(used: 0) || Float::INFINITY
  ctx.total_tokens > limit * 0.7
end

Yields:

(ctx) —
Phronomy::Context::TriggerContext

Returns:

(Boolean) —
truthy → run on_compact; falsy → skip



234
235
236

# File 'lib/phronomy/agent/base.rb', line 234

def on_compaction_trigger(&block)
  @on_compaction_trigger_callback = block
end

.on_trim {|ctx| ... } ⇒ `Object`

Registers a callback that is invoked before every LLM call so the application can remove stale or irrelevant messages from the conversation history.

The block receives a Context::TrimContext and may call +ctx.remove(seqs)+ to drop messages by seq number. Changes affect only the current invocation; the underlying memory store is unchanged.

Examples:

Drop the oldest message when over 80% of budget is used

on_trim do |ctx|
  limit = ctx.budget&.available(used: 0) || Float::INFINITY
  ctx.remove(ctx.message_elements.first[:seq]) if ctx.total_tokens > limit * 0.8
end

Yields:

(ctx) —
Phronomy::Context::TrimContext



211
212
213

# File 'lib/phronomy/agent/base.rb', line 211

def on_trim(&block)
  @on_trim_callback = block
end

.provider(name = nil) ⇒ `Symbol`^?

Sets or reads the LLM provider for this agent. Required when using a model not registered in RubyLLM's model registry (e.g. locally-hosted models via LM Studio or Ollama).

Examples:

class MyAgent < Phronomy::Agent::Base
  model "openai/gpt-oss-20b"
  provider :openai
end

Parameters:

name (Symbol, nil) (defaults to: nil) —
e.g. +:openai+, +:anthropic+, +:ollama+

Returns:

(Symbol, nil)

# File 'lib/phronomy/agent/base.rb', line 133

def provider(name = nil)
  if name
    @provider = name
  else
    return @provider if instance_variable_defined?(:@provider)
    superclass.respond_to?(:provider) ? superclass.provider : nil
  end
end

.static_knowledge(*sources) ⇒ `Object`

Registers one or more static knowledge sources on the agent class. Static sources are fetched once per agent instance and their content is cached in ContextVersionCache keyed by a fingerprint of the instruction text + source content. The cache is invalidated automatically when the fingerprint changes (e.g. because a source was updated).

Examples:

class PolicyAgent < Phronomy::Agent::Base
  static_knowledge Phronomy::KnowledgeSource::StaticKnowledge.new(POLICY_TEXT)
end

Parameters:

sources (Array<Phronomy::KnowledgeSource::Base>)



187
188
189

# File 'lib/phronomy/agent/base.rb', line 187

def static_knowledge(*sources)
  @static_knowledge_sources = sources.flatten
end

.static_knowledge_sources ⇒ `Array<Phronomy::KnowledgeSource::Base>`

Returns the registered static knowledge sources.

Returns:

(Array<Phronomy::KnowledgeSource::Base>)



193
194
195

# File 'lib/phronomy/agent/base.rb', line 193

def static_knowledge_sources
  @static_knowledge_sources || []
end

.temperature(val = nil) ⇒ `Float`^?

Sets or reads the sampling temperature sent to the LLM. When nil, the provider's default is used.

Examples:

class MyAgent < Phronomy::Agent::Base
  temperature 0.2
end

Parameters:

val (Float, nil) (defaults to: nil) —
temperature (0.0 to 2.0 depending on provider)

Returns:

(Float, nil)

# File 'lib/phronomy/agent/base.rb', line 151

def temperature(val = nil)
  if val
    @temperature = val
  else
    @temperature
  end
end

.tool_aliases ⇒ `Hash{Class => String}`

Returns the alias map registered via the hash form of .tools.

Returns:

(Hash{Class => String})



118
119
120

# File 'lib/phronomy/agent/base.rb', line 118

def tool_aliases
  @tool_aliases ||= {}
end

.tools(*args) ⇒ `Object`

Registers tool classes for this agent.

Accepts either a splat of classes (backward-compatible) or a Hash mapping each class to an explicit alias name (String) or nil (use tool's own name). The alias form is useful when two tools share the same auto-generated name (e.g. two SearchTool classes from different modules).

Examples:

Splat form (no alias)

tools WeatherTool, TimeTool

Hash form (with optional per-tool alias)

tools(
  Weather::SearchTool => "weather_search",
  Places::SearchTool  => "places_search",
  CurrentTimeTool     => nil
)

# File 'lib/phronomy/agent/base.rb', line 98

def tools(*args)
  if args.empty?
    if instance_variable_defined?(:@tools)
      return @tools
    end
    return superclass.respond_to?(:tools) ? superclass.tools : []
  end

  if args.length == 1 && args.first.is_a?(Hash)
    hash = args.first
    @tools = hash.keys
    @tool_aliases = hash.transform_values { |v| v&.to_s }.reject { |_, v| v.nil? }
  else
    @tools = args
    @tool_aliases = {}
  end
end

Instance Method Details

#_add_handoff_tool(tool_class) ⇒ `self`

Registers an anonymous handoff tool class on this agent instance. Called by Runner during construction when routes are configured.

Parameters:

tool_class (Class<Phronomy::Tool::Base>)

Returns:

(self)

# File 'lib/phronomy/agent/base.rb', line 340

def _add_handoff_tool(tool_class)
  @_handoff_tools ||= []
  @_handoff_tools << tool_class
  self
end

#_handoff_tools ⇒ `Array<Class>`

Returns handoff tool classes registered on this instance by Runner.

Returns:

(Array<Class>)



348
349
350

# File 'lib/phronomy/agent/base.rb', line 348

def _handoff_tools
  @_handoff_tools || []
end

#context_version_cache ⇒ `Object`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns the Context::ContextVersionCache for the current thread.



415
416
417

# File 'lib/phronomy/agent/base.rb', line 415

def context_version_cache
  (Thread.current[:phronomy_context_version_caches] ||= {})[object_id]
end

#invoke(input, messages: [], thread_id: nil, config: {}) ⇒ `Hash`

Invokes the agent with the given input and returns a result Hash. Applies the retry policy configured via retry_policy when transient errors occur. GuardrailError is never retried.

Examples:

Normal invocation

result = MyAgent.new.invoke("What is Ruby?")
puts result[:output]

Multi-turn conversation

result1 = agent.invoke("Hi, I'm Alice.")
result2 = agent.invoke("What's my name?", messages: result1[:messages])

Suspend / resume flow

result = agent.invoke("Perform task X")
if result[:suspended]
  result = agent.resume(result[:checkpoint], approved: true)
end
puts result[:output]

Parameters:

input (String, Hash) —
the user message; a Hash may supply +:message+, +:query+, or +:user+ as the text key, plus any template variables consumed by the configured instructions template.
messages (Array<RubyLLM::Message>) (defaults to: []) —
conversation history from a previous invocation. The application owns and persists this array; pass it on every turn to maintain multi-turn context.
thread_id (String, nil) (defaults to: nil) —
conversation thread identifier, forwarded to the compaction context when on_compact is configured.
config (Hash) (defaults to: {}) —
additional runtime options: +:knowledge_sources+ (Array) — dynamic knowledge sources for this turn +:user_id+ (+String+, optional) — caller identity forwarded to the tracer +:session_id+ (+String+, optional) — session identity forwarded to the tracer

Returns:

(Hash) —
+{ output: String, messages: Array, usage: Phronomy::TokenUsage }+, or +{ output: nil, suspended: true, checkpoint: Phronomy::Agent::Checkpoint, messages: Array }+ when the invocation was suspended awaiting tool approval.

Raises:

(Phronomy::GuardrailError) —
when an input or output guardrail rejects the value



384
385
386

# File 'lib/phronomy/agent/base.rb', line 384

def invoke(input, messages: [], thread_id: nil, config: {})
  _invoke_impl(input, messages: messages, thread_id: thread_id, config: config)
end

#stream(input, messages: [], thread_id: nil, config: {}) {|Phronomy::Agent::StreamEvent| ... } ⇒ `Hash`

Streaming version of #invoke. Yields StreamEvent objects as they are produced by the underlying LLM.

Events emitted (in order): :token — each content delta from the LLM :tool_call — when the LLM requests a tool (ReactAgent subclasses only) :tool_result — after a tool completes (ReactAgent subclasses only) :done — final event carrying output, messages, and usage :error — if an unrecoverable error occurs

Parameters:

input (String, Hash) —
same as #invoke
messages (Array<RubyLLM::Message>) (defaults to: []) —
same as #invoke
thread_id (String, nil) (defaults to: nil) —
same as #invoke
config (Hash) (defaults to: {}) —
same as #invoke

Yields:

(Phronomy::Agent::StreamEvent)

Returns:

(Hash) —
{ output:, messages:, usage: } — same as #invoke

# File 'lib/phronomy/agent/base.rb', line 404

def stream(input, messages: [], thread_id: nil, config: {}, &block)
  return invoke(input, messages: messages, thread_id: thread_id, config: config) unless block

  _stream_impl(input, messages: messages, thread_id: thread_id, config: config, &block)
rescue => e
  block&.call(StreamEvent.new(type: :error, payload: {error: e}))
  raise
end

Class: Phronomy::Agent::Base

Overview

Examples:

Minimal agent

Agent with tools

Direct Known Subclasses

Instance Attribute Summary

Attributes included from Concerns::BeforeCompletion

Class Method Summary collapse

Instance Method Summary collapse

Methods included from Concerns::Suspendable

Methods included from Concerns::BeforeCompletion

Methods included from Concerns::Guardrailable

Methods included from Concerns::Retryable

Methods included from Runnable

Class Method Details

._on_compact_callback ⇒ Proc?

._on_compaction_trigger_callback ⇒ Proc?

._on_trim_callback ⇒ Proc?

.cache_instructions(enabled = nil) ⇒ Object

Examples:

.context_overhead(val = nil) ⇒ Object

Examples:

.context_window(val = nil) ⇒ Object

Examples:

.instructions(text = nil) { ... } ⇒ String, ...

Examples:

String instructions

Block instructions

.max_iterations(val = nil) ⇒ Integer

Examples:

.max_output_tokens(val = nil) ⇒ Object

Examples:

.model(name = nil) ⇒ String?

Examples:

.on_compact {|ctx| ... } ⇒ Object

Examples:

Replace the first 4 messages with a short summary

.on_compaction_trigger {|ctx| ... } ⇒ Boolean

Examples:

Trigger when messages exceed 70% of token budget

.on_trim {|ctx| ... } ⇒ Object

Examples:

Drop the oldest message when over 80% of budget is used

.provider(name = nil) ⇒ Symbol?

Examples:

.static_knowledge(*sources) ⇒ Object

Examples:

.static_knowledge_sources ⇒ Array<Phronomy::KnowledgeSource::Base>

.temperature(val = nil) ⇒ Float?

Examples:

.tool_aliases ⇒ Hash{Class => String}

.tools(*args) ⇒ Object

Examples:

Splat form (no alias)

Hash form (with optional per-tool alias)

Instance Method Details

#_add_handoff_tool(tool_class) ⇒ self

#_handoff_tools ⇒ Array<Class>

#context_version_cache ⇒ Object

#invoke(input, messages: [], thread_id: nil, config: {}) ⇒ Hash

Examples:

Normal invocation

Multi-turn conversation

Suspend / resume flow

#stream(input, messages: [], thread_id: nil, config: {}) {|Phronomy::Agent::StreamEvent| ... } ⇒ Hash

._on_compact_callback ⇒ `Proc`^?

._on_compaction_trigger_callback ⇒ `Proc`^?

._on_trim_callback ⇒ `Proc`^?

.cache_instructions(enabled = nil) ⇒ `Object`

.context_overhead(val = nil) ⇒ `Object`

.context_window(val = nil) ⇒ `Object`

.instructions(text = nil) { ... } ⇒ `String`, ...

.max_iterations(val = nil) ⇒ `Integer`

.max_output_tokens(val = nil) ⇒ `Object`

.model(name = nil) ⇒ `String`^?

.on_compact {|ctx| ... } ⇒ `Object`

.on_compaction_trigger {|ctx| ... } ⇒ `Boolean`

.on_trim {|ctx| ... } ⇒ `Object`

.provider(name = nil) ⇒ `Symbol`^?

.static_knowledge(*sources) ⇒ `Object`

.static_knowledge_sources ⇒ `Array<Phronomy::KnowledgeSource::Base>`

.temperature(val = nil) ⇒ `Float`^?

.tool_aliases ⇒ `Hash{Class => String}`

.tools(*args) ⇒ `Object`

#_add_handoff_tool(tool_class) ⇒ `self`

#_handoff_tools ⇒ `Array<Class>`

#context_version_cache ⇒ `Object`

#invoke(input, messages: [], thread_id: nil, config: {}) ⇒ `Hash`

#stream(input, messages: [], thread_id: nil, config: {}) {|Phronomy::Agent::StreamEvent| ... } ⇒ `Hash`