KPI Dimensions
| Dimension | Score Source | Range | What it means |
|---|---|---|---|
quality | Model quality priors | 0.0-1.0 | Higher = better output quality |
cost | Inverse of model cost | 0.0-1.0 | Higher = cheaper model |
latency | Model latency priors | 0.0-1.0 | Higher = faster response |
energy | Inverse of energy coefficient | 0.0-1.0 | Higher = lower compute intensity |
Configuration
Weights
Weights are relative — they don’t need to sum to 1.0 (they are normalized internally). They control the relative importance of each dimension in the composite score.Targets
Targets set minimum acceptable values. If a model’s score for a dimension falls below the target, it is penalized in the composite score.Scoring Formula
The composite score for a model is:w_* are the normalized weights and utility values are computed from model priors.
Quality Priors
Built-in quality priors for common models (OpenAI):| Model | Quality | Latency |
|---|---|---|
| o1 | 0.95 | 0.40 |
| gpt-4o | 0.90 | 0.72 |
| gpt-4-turbo | 0.88 | 0.66 |
| gpt-4 | 0.87 | 0.52 |
| gpt-5-mini | 0.86 | 0.84 |
| o1-mini | 0.82 | 0.60 |
| o3-mini | 0.80 | 0.78 |
| gpt-4o-mini | 0.75 | 0.93 |
| gpt-3.5-turbo | 0.65 | 1.00 |