Most Used Tags
Evaluates the execution quality of skills and agents using a 7-dimension scoring system.
Impartial agent that evaluates the execution quality of skills and agents.
View historical scores and trends for evaluated skills.
Evaluate code quality effortlessly with verdict, using multiple scoring dimensions.
Compare skill scores against ideal benchmarks to identify strengths and weaknesses.
Manage Verdict auto-judge configuration with ease.