Google

    Gemini 3.5 Flash

    快又便宜的模型, 适合Agent使用

    Gemini 3.5 Flash provides sustained frontier-level intelligence optimized for real-world tasks at a higher speed and lower cost. Designed for the agentic era, it excels at sub-agent deployment, multi-step workflows, and long-horizon tasks at scale. This model is particularly effective for rapid agentic loops involving complex coding cycles and iterations.

    Context1M
    Released2026-05
    Relays15 sites

    Google Official Pricing

    CNY
    Updated: 2026-05-20T11:35:32.379+08:00Source

    Input

    ¥10.5/ 1M tokens

    Output

    ¥63/ 1M tokens

    Cache read

    ¥1.05/ 1M tokens

    Relay Comparison

    Compare token, per-request, or per-second pricing by relay channel.

    How should Gemini 3.5 Flash relay pricing be compared?

    This Gemini 3.5 Flash pricing page compares official pricing with public prices from 15 listed AI gateways. Token prices are shown in CNY per 1M tokens, while per-request, per-second, and per-character rows use the unit shown in the table. Last updated: 05/20/2026, 14:29.

    Data sources
    Public price catalogs, official pricing records, and monitoring results.
    Metric definitions
    Uptime means successful probe response rate, fake-rate signals possible model mismatch or abnormal output risk, and latency is average API response time.
    Risk note
    Relay gateways are third-party services. Pricing, billing, privacy, and stability can change; start with a small top-up and verify reliability before continued use.