Class: LLM::Compactor

Inherits:

Object

Object
LLM::Compactor

show all

Defined in:: lib/llm/compactor.rb

Overview

LLM::Compactor summarizes older context messages into a smaller replacement message when a context grows too large.

This work is directly inspired by the compaction approach developed by General Intelligence Systems in [Brute](github.com/general-intelligence-systems/brute).

The compactor can also use a different model from the main context by setting ‘model:` in the compactor config. Compaction thresholds are opt-in: provide `message_threshold:` and/or `token_threshold:` to enable policy- driven compaction.

Constant Summary collapse

DEFAULTS =

{
  retention_window: 8,
  model: nil
}.freeze

Instance Attribute Summary collapse

#config ⇒ Hash readonly

Instance Method Summary collapse

#compact!(prompt = nil) ⇒ LLM::Message^?

Summarize older messages and replace them with a compact summary.
#compact?(prompt = nil) ⇒ Boolean

Returns true when the context should be compacted.
#initialize(ctx, config = {}) ⇒ Compactor constructor

A new instance of Compactor.

Constructor Details

#initialize(ctx, config = {}) ⇒ `Compactor`

Returns a new instance of Compactor.

Parameters:

ctx (LLM::Context)
config (Hash) (defaults to: {})

Options Hash (config):

:token_threshold (Integer, nil) —

Enables token-based compaction.
:message_threshold (Integer, nil) —

Enables message-count-based compaction.
:retention_window (Integer)
:model (String, nil) —

The model to use for the summarization request. Defaults to the current context model.

# File 'lib/llm/compactor.rb', line 36

def initialize(ctx, config = {})
  @ctx = ctx
  @config = DEFAULTS.merge(config)
end

Instance Attribute Details

#config ⇒ `Hash` (readonly)

Returns:

(Hash)



23
24
25

# File 'lib/llm/compactor.rb', line 23

def config
  @config
end

Instance Method Details

#compact!(prompt = nil) ⇒ `LLM::Message`^?

Summarize older messages and replace them with a compact summary.

Parameters:

prompt (Object) (defaults to: nil) —

The next prompt or turn input

Returns:

(LLM::Message, nil)

# File 'lib/llm/compactor.rb', line 60

def compact!(prompt = nil)
  return nil if ctx.functions.any? || [*prompt].grep(LLM::Function::Return).any?
  messages = ctx.messages.reject(&:system?)
  retention_window = [config[:retention_window], messages.size].min
  return nil unless messages.size > retention_window
  stream = ctx.params[:stream]
  stream.on_compaction(ctx, self) if LLM::Stream === stream
  recent = retained_messages
  older = messages[0...(messages.size - recent.size)]
  summary = LLM::Message.new(ctx.llm.user_role, "[Previous conversation summary]\n\n#{summarize(older)}", {compaction: true})
  ctx.messages.replace([*ctx.messages.take_while(&:system?), summary, *recent])
  stream.on_compaction_finish(ctx, self) if LLM::Stream === stream
  summary
end

#compact?(prompt = nil) ⇒ `Boolean`

Returns true when the context should be compacted

Parameters:

prompt (Object) (defaults to: nil) —

The next prompt or turn input

Returns:

(Boolean)

# File 'lib/llm/compactor.rb', line 46

def compact?(prompt = nil)
  return false if ctx.functions.any? || [*prompt].grep(LLM::Function::Return).any?
  messages = ctx.messages.reject(&:system?)
  return true if config[:message_threshold] && messages.size > config[:message_threshold]
  usage = ctx.usage
  return true if config[:token_threshold] && usage && usage.total_tokens > config[:token_threshold]
  false
end

Class: LLM::Compactor

Overview

Constant Summary collapse

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(ctx, config = {}) ⇒ Compactor

Instance Attribute Details

#config ⇒ Hash (readonly)

Instance Method Details

#compact!(prompt = nil) ⇒ LLM::Message?

#compact?(prompt = nil) ⇒ Boolean

#initialize(ctx, config = {}) ⇒ `Compactor`

#config ⇒ `Hash` (readonly)

#compact!(prompt = nil) ⇒ `LLM::Message`^?

#compact?(prompt = nil) ⇒ `Boolean`