Vibe-Coded Evals with LLM-as-a-Judge

December 13, 20252 min read

Using Claude Code and OpenRouter infrastructure to rapidly build model evaluations with Claude Opus 4.5 as the judge

Loading...