Most Used Tags
Create evaluations for AI skills from scratch, including test scenarios and benchmarks.
snapeval is a harness-agnostic evaluation runner for agentskills.io skills, enabling performance benchmarking.
Run and iterate on existing skill evaluations with ease.