Module: SkillBench::Evaluation
- Defined in:
- lib/skill_bench/evaluation.rb,
lib/skill_bench/evaluation/runner.rb,
lib/skill_bench/evaluation/generator.rb
Overview
Namespace for the evaluation orchestration subsystem.
Coordinates evaluation workflows across multiple tasks, including blind judging and delta computation.