OpenAI compatible API. Attested gateway. Public status.
Google: Gemini 3.1 Pro Preview Benchmarks
Benchmark and measurement links for Google: Gemini 3.1 Pro Preview, with TrustedRouter route data first.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
google/gemini-3.1-pro-preview
Benchmarks
Published benchmark scores
Benchmark scores for Google: Gemini 3.1 Pro Preview — every row links to its source, and a score is only ever attached to the exact checkpoint it was measured on. Vendor model-card and open-leaderboard numbers are cited, not run by us. Rows marked TrustedRouter · replays published are our own runs of this model through the gateway, with the full per-item replay published in trustedrouter-benchmarks so anyone can re-grade them.
| Benchmark | Category | Score | Source |
|---|---|---|---|
| IFEval 100-prompt subset, 0-shot; Google's deterministic verifiers (no judge); score = avg of strict/loose x prompt/instruction |
Instruction following | 98.4% | TrustedRouter Benchmarks replay 2026-06-18 |
| GSM8K 30-problem subset, deterministic numeric match (no judge); near-saturated, kept as a sanity check |
Math | 96.7% | TrustedRouter Benchmarks replay 2026-06-18 |
TrustedRouter measurements
TrustedRouter publishes route and status measurements without storing prompt or output content. Provider latency and uptime are exposed through the model performance and uptime pages.
External benchmark references
- TrustedRouter performance pageTrustedRouter measurement
- TrustedRouter uptime pageTrustedRouter measurement
- Gemini model docsOfficial model information
- LMArena leaderboardIndependent benchmark index
- LiveBenchIndependent benchmark index
- Artificial Analysis modelsIndependent benchmark index
- HELMIndependent benchmark index