Setup
Code
How It Works
gpt-4o-mini(draft model) handles the query first- Quality validation checks the response
- If quality passes, the draft response is returned (60-70% of queries)
- If quality fails,
gpt-4o(verifier model) handles the query - Cost tracking reports per-query and aggregate metrics