System Overview
API Status
Status:
Loading...
Version:
-
Last Check:
-
Quick Links
About
Automated testing platform based on the Eval framework proposed by Anthropic. Supports deterministic and probabilistic evaluation of LLM agent behavior.
- Provider Management - Configure LLM providers
- Mock Users - Simulated agent interactions
- Eval Scoring - Anthropic eval framework
- Test Runs - Automated test execution
Roadmap
- LLM Provider Management
- Mock User Configuration
- Eval Test Cases (CRUD + Schema)
- Eval Test Runner