Class: Phronomy::Eval::Runner

Inherits:

Object

Object
Phronomy::Eval::Runner

show all

Defined in:: lib/phronomy/eval/runner.rb

Overview

Runs a Dataset through a callable and collects EvalResult objects.

The callable must respond to +#call(input)+ and may return either:

a plain +String+ — treated as the output; usage is nil
a +Hash+ with +:output+ and optional +:usage+ (TokenUsage) keys

Examples:

With a simple proc

runner  = Runner.new(scorer: Scorer::ExactMatch.new)
dataset = Dataset.from_array([{ input: "2+2", expected: "4" }])
results = runner.run(dataset, ->(input) { "4" })

With a Phronomy agent

agent   = MyAgent.new
results = runner.run(dataset, ->(input) { agent.invoke(input) })

Instance Method Summary collapse

#initialize(scorer: Scorer::ExactMatch.new) ⇒ Runner constructor
A new instance of Runner.
#run(dataset, callable, concurrency: 1) ⇒ Array<EvalResult>

Constructor Details

#initialize(scorer: Scorer::ExactMatch.new) ⇒ `Runner`

Returns a new instance of Runner.

Parameters:

scorer (Scorer::Base) (defaults to: Scorer::ExactMatch.new) —
scorer used to evaluate each result



21
22
23

# File 'lib/phronomy/eval/runner.rb', line 21

def initialize(scorer: Scorer::ExactMatch.new)
  @scorer = scorer
end

Instance Method Details

#run(dataset, callable, concurrency: 1) ⇒ `Array<EvalResult>`