Gemini 3.1 Pro vs GLM-5.2: API Pricing
Input and output token rates, context windows, and real monthly cost for Gemini 3.1 Pro (Google) and GLM-5.2 (Z.ai), side by side. Prices are standard on-demand rates as of June 2026.
The short answer
For a typical coding-agent workload (60M in / 12M out per month), GLM-5.2 is the cheaper option at $137/mo versus $264/mo for Gemini 3.1 Pro - about 48% less (1.9× cheaper). On the headline sticker of 1M input + 1M output, Gemini 3.1 Pro is $14.00 and GLM-5.2 is $5.80.
Rates at a glance
| Gemini 3.1 Pro | GLM-5.2 | |
|---|---|---|
| Input ($/1M tokens) | $2.00 | $1.40 |
| Output ($/1M tokens) | $12.00 | $4.40 |
| Blended (1M in + 1M out) | $14.00 | $5.80 |
| Context window | 1,000K | 200K |
| Type | Proprietary | Open-weight |
| Provider | Z.ai |
Monthly cost by workload
Estimated monthly API spend at each workload's token volume. Output usually costs several times input, so the winner can flip with your mix.
| Workload | Gemini 3.1 Pro | GLM-5.2 | Cheaper |
|---|---|---|---|
| Chatbot / assistant 10M in / 3M out | $56.00/mo | $27.20/mo | GLM-5.2 |
| Coding agent 60M in / 12M out | $264/mo | $137/mo | GLM-5.2 |
| RAG / summarization 40M in / 4M out | $128/mo | $73.60/mo | GLM-5.2 |
| Batch / classification 20M in / 2M out | $64.00/mo | $36.80/mo | GLM-5.2 |
Want your own in/out split? Use the full interactive comparator to rank every model and provider for your exact workload.
Frequently asked questions
Is Gemini 3.1 Pro or GLM-5.2 cheaper?
It depends on your input/output mix, but for a typical coding-agent workload (60M in / 12M out per month) GLM-5.2 costs $137/mo versus $264/mo for Gemini 3.1 Pro - about 48% less (1.9x). On the headline sticker (1M input + 1M output), Gemini 3.1 Pro is $14.00 and GLM-5.2 is $5.80.
What are the token rates for Gemini 3.1 Pro and GLM-5.2?
Gemini 3.1 Pro (Google) is $2.00 per 1M input and $12.00 per 1M output. GLM-5.2 (Z.ai) is $1.40 per 1M input and $4.40 per 1M output. These are standard on-demand rates, not cached or batch.
Is Gemini 3.1 Pro or GLM-5.2 open-weight?
Gemini 3.1 Pro is proprietary and GLM-5.2 is open-weight. Open-weight models can be self-hosted or run on third-party hosts at different rates, so the first-party price shown here is a starting point, not the only option.
What context window do Gemini 3.1 Pro and GLM-5.2 support?
Gemini 3.1 Pro supports 1,000K tokens and GLM-5.2 supports 200K tokens. Some models also step up pricing past a size threshold - check the source pricing pages for long-context tiers.
More pricing comparisons
Stay ahead of the AI tools curve
Picks, reviews, and automation tips every weekday. Free, no spam.