Gemini 2.5 Flash-Lite

Available

Google DeepMindProprietary

Google's lowest-latency, lowest-cost Gemini 2.5 tier, designed for summarization, classification, extraction, routing, and other high-volume production tasks. Proprietary API model with a 1M-token context and multimodal support.

Official page ↗Model card ↗

Specifications

License: Proprietary
Weights: Not released
Architecture: unknown
Parameters: Undisclosed
Context window: 1M tokens
Max output: —
Knowledge cutoff: —
Price (in / out, $/M): $0.1 / $0.4
Modalities: TextVisionAudioVideoCode

Benchmarks

No benchmark scores recorded yet. Spotted some? Submit a correction.

Vendor-reported figures are claims until independently verified. See methodology.