Agent Eval - Test Platform

System Overview

API Status

Status: Loading...

Version: -

Last Check: -

About

Automated testing platform based on the Eval framework proposed by Anthropic. Supports deterministic and probabilistic evaluation of LLM agent behavior.

Provider Management - Configure LLM providers
Mock Users - Simulated agent interactions
Eval Scoring - Anthropic eval framework
Test Runs - Automated test execution

Roadmap

LLM Provider Management
Mock User Configuration
Eval Test Cases (CRUD + Schema)
Eval Test Runner

System Overview

API Status

Quick Links

About

Roadmap