Zhipu AI

    GLM 5.1

    A flagship model for agents and complex coding

    GLM-5.1 is Zhipu AI's next-generation flagship text model, with stronger thinking, coding, and agent-task capabilities. It supports long context, context caching, structured output, and function calling, making it suitable for complex coding, tool use, multi-step reasoning, and long-running agent workflows.

    Context200K
    Released2026-04
    Relays36 sites
    200K context windowAgents and tool useStructured outputCoding and long workflows

    Zhipu AI Official Pricing

    CNY
    Updated: 2026-04-01T00:00:00.000+08:00Source

    Input

    ¥6/ 1M tokens

    Output

    ¥24/ 1M tokens

    Cache read

    ¥1.3/ 1M tokens

    Relay Comparison

    Compare token, per-request, or per-second pricing by relay channel.

    How should GLM 5.1 relay pricing be compared?

    This GLM 5.1 pricing page compares official pricing with public prices from 36 listed AI gateways. Token prices are shown in CNY per 1M tokens, while per-request, per-second, and per-character rows use the unit shown in the table. Last updated: 06/11/2026, 19:57.

    Data sources
    Public price catalogs, official pricing records, and monitoring results.
    Metric definitions
    Uptime means successful probe response rate, fake-rate signals possible model mismatch or abnormal output risk, and latency is average API response time.
    Risk note
    Relay gateways are third-party services. Pricing, billing, privacy, and stability can change; start with a small top-up and verify reliability before continued use.