Class: Phronomy::Eval::Runner

Inherits:

Object

Object
Phronomy::Eval::Runner

show all

Defined in:: lib/phronomy/eval/runner.rb

Overview

Runs a Dataset through a callable and collects EvalResult objects.

The callable must respond to +#call(input)+ and may return either:

a plain +String+ — treated as the output; usage is nil
a +Hash+ with +:output+ and optional +:usage+ (TokenUsage) keys

Examples:

With a simple proc

runner  = Runner.new(scorer: Scorer::ExactMatch.new)
dataset = Dataset.from_array([{ input: "2+2", expected: "4" }])
results = runner.run(dataset, ->(input) { "4" })

With a Phronomy agent

agent   = MyAgent.new
results = runner.run(dataset, ->(input) { agent.invoke(input) })

Instance Method Summary collapse

#initialize(scorer: Scorer::ExactMatch.new) ⇒ Runner constructor
A new instance of Runner.
#run(dataset, callable, concurrency: 1) ⇒ Array<EvalResult>
mutant:disable - concurrency default value mutations (0/2) are genuine equivalent because sequential and concurrent paths produce identical results; if concurrency<=1 boundary mutations (==1 / <1 / <=0 / .eql? / .equal? / false / nil / <=2) are genuine equivalent because the concurrent path with concurrency=1 still produces the same Array via each_slice(1); spawn name: mutations are genuine equivalent (name is only used for logging).

Constructor Details

#initialize(scorer: Scorer::ExactMatch.new) ⇒ `Runner`

Returns a new instance of Runner.

Parameters:

scorer (Scorer::Base) (defaults to: Scorer::ExactMatch.new) —
scorer used to evaluate each result



22
23
24

# File 'lib/phronomy/eval/runner.rb', line 22

def initialize(scorer: Scorer::ExactMatch.new)
  @scorer = scorer
end

Instance Method Details

#run(dataset, callable, concurrency: 1) ⇒ `Array<EvalResult>`

mutant:disable - concurrency default value mutations (0/2) are genuine equivalent because sequential and concurrent paths produce identical results; if concurrency<=1 boundary mutations (==1 / <1 / <=0 / .eql? / .equal? / false / nil / <=2) are genuine equivalent because the concurrent path with concurrency=1 still produces the same Array via each_slice(1); spawn name: mutations are genuine equivalent (name is only used for logging)