Benchmarks

The models we run, measured.

We pick models for results, then make them affordable. Here's how the models available on Vikasit perform against the global frontier — competitive on math, honest about the gap on the very newest closed models, and far ahead on cost.

Running with the frontier

On AIME 2025, the flagship we serve is within ~3 points of the best — and far ahead on cost.

AIME 2025 — competition math

No tools. The newest Anthropic/Google flagships no longer report AIME — shown are the most recent that do.

Gemini 3 Pro

95%

GPT-5.1

94%

vikasit-235b-moe

92.3%

MiniMax-M2

78%

Efficiency

85.0%

AIME'25 from vikasit-30b-moe — just ~3B active params.

Math

92.9%

MATH from vikasit-math (72B). 7B hits 85.3%.

A tier for every budget.

From compact models to a 235B-class flagship — and the trillion-scale tiers above them — quality climbs cleanly with scale. Our efficient MoE options deliver frontier-class results at small-model inference cost, so you only pay for what you need.

Vikasit family — AIME 2025

Thinking mode. The 30B MoE (3B active) rivals far bigger dense models.

vikasit-235b-moe

92.3%

vikasit-30b-moe

85%

vikasit-32b

72.9%

vikasit-14b

70.4%

vikasit-8b

67.3%

The bar keeps rising

The newest closed models (June 2026) have pushed coding and graduate reasoning higher. Here's where they stand — and where the best models on Vikasit sit today. We show it straight.

GPQA Diamond — graduate reasoning

Latest closed frontier vs the best model on Vikasit. No tools.

Gemini 3.1 Pro

94.3%

Claude Fable 5

94.1%

Claude Opus 4.8

93.6%

vikasit-235b-moe

81.1%

SWE-bench Verified — agentic coding

Our coding-tuned tier targets the open state of the art.

Claude Mythos 5

95.5%

Claude Fable 5

95%

Claude Opus 4.8

88.6%

Gemini 3.1 Pro

80.6%

vikasit-titan-1.6t

67.8%

Sources. Scores for models on Vikasit reflect published results for those models (thinking mode, no tools). Frontier comparison scores from official cards/reports (June 2026): Claude Fable 5, Mythos 5, Opus 4.8 (Anthropic), Gemini 3.1 Pro & Gemini 3 Pro (Google DeepMind), GPT-5.1 (via Google's published comparison), MiniMax-M2 (vendor card). GPT-5.5 shipped April 2026 but OpenAI has not published accessible capability benchmarks, so GPT-5.1 is shown where comparable. Benchmarks differ in protocol; figures are no-tools where available.