System Overview

API Status

Status: Loading...
Version: -
Last Check: -

About

Automated testing platform based on the Eval framework proposed by Anthropic. Supports deterministic and probabilistic evaluation of LLM agent behavior.

  • Provider Management - Configure LLM providers
  • Mock Users - Simulated agent interactions
  • Eval Scoring - Anthropic eval framework
  • Test Runs - Automated test execution

Roadmap

  • LLM Provider Management
  • Mock User Configuration
  • Eval Test Cases (CRUD + Schema)
  • Eval Test Runner