LLM outputs vary from run to run—ask the same question twice and you might get different quality answers. To find out which model actually performs best, you need to test each one multiple times and look at the spread. Variants let you test different models side-by-side. Groups repeat each test so you see the full distribution, not just one lucky or unlucky result.Documentation Index
Fetch the complete documentation index at: https://hud-f5fd7c15-feat-agent-orchestrator-cookbook.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Variants
Pass the configurations you want to test:Groups
Run each variant multiple times to get a distribution:hud.eval manager will parallelize your evals automatically and show the distribution across all your runs on hud.ai.