Class: Phronomy::Eval::Runner
- Inherits:
-
Object
- Object
- Phronomy::Eval::Runner
- Defined in:
- lib/phronomy/eval/runner.rb
Overview
Runs a Dataset through a callable and collects EvalResult objects.
The callable must respond to +#call(input)+ and may return either:
- a plain +String+ — treated as the output; usage is nil
- a +Hash+ with +:output+ and optional +:usage+ (TokenUsage) keys
Instance Method Summary collapse
-
#initialize(scorer: Scorer::ExactMatch.new) ⇒ Runner
constructor
A new instance of Runner.
-
#run(dataset, callable, concurrency: 1) ⇒ Array<EvalResult>
mutant:disable - concurrency default value mutations (0/2) are genuine equivalent because sequential and concurrent paths produce identical results; if concurrency<=1 boundary mutations (==1 / <1 / <=0 / .eql? / .equal? / false / nil / <=2) are genuine equivalent because the concurrent path with concurrency=1 still produces the same Array
via each_slice(1); spawn name: mutations are genuine equivalent (name is only used for logging).
Constructor Details
#initialize(scorer: Scorer::ExactMatch.new) ⇒ Runner
Returns a new instance of Runner.
22 23 24 |
# File 'lib/phronomy/eval/runner.rb', line 22 def initialize(scorer: Scorer::ExactMatch.new) @scorer = scorer end |
Instance Method Details
#run(dataset, callable, concurrency: 1) ⇒ Array<EvalResult>
mutant:disable - concurrency default value mutations (0/2) are genuine equivalent because sequential and concurrent paths produce identical results; if concurrency<=1 boundary mutations (==1 / <1 / <=0 / .eql? / .equal? / false / nil / <=2) are genuine equivalent because the concurrent path with concurrency=1 still produces the same Array
32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
# File 'lib/phronomy/eval/runner.rb', line 32 def run(dataset, callable, concurrency: 1) cases = dataset.to_a return cases.map { |eval_case| run_one(eval_case, callable) } if concurrency <= 1 # Run cases in slices of +concurrency+ tasks. Each slice is joined # before the next starts, bounding peak task count to +concurrency+. # Writing to pre-allocated slots (one per task) is safe because each # task writes to a unique index and all tasks in a slice are joined # before the next slice begins. # Exceptions in worker tasks are collected and re-raised after all # tasks in the slice are joined, preventing orphaned tasks. results = Array.new(cases.length) cases.each_with_index.each_slice(concurrency) do |batch| errors = [] errors_mu = Mutex.new tasks = batch.map do |eval_case, i| Phronomy::Runtime.instance.spawn(name: "eval-case-#{i}") do results[i] = run_one(eval_case, callable) rescue => e errors_mu.synchronize { errors << e } end end tasks.each(&:join) raise errors.first if errors.any? end results end |