The agent eval standard for MCP. Score output quality, catch safety failures, enforce cost budgets. Open source.
Most Used Tags
Evaluate AI agent outputs for quality, safety, and cost using the Iris MCP server.