LLM Releases
← Catalog

Gemini 2.5 Flash-Lite

Available
Google DeepMindProprietary

Google's lowest-latency, lowest-cost Gemini 2.5 tier, designed for summarization, classification, extraction, routing, and other high-volume production tasks. Proprietary API model with a 1M-token context and multimodal support.

Specifications

License
Proprietary
Weights
Not released
Architecture
unknown
Parameters
Undisclosed
Context window
1M tokens
Max output
β€”
Knowledge cutoff
β€”
Price (in / out, $/M)
$0.1 / $0.4
Modalities
TextVisionAudioVideoCode

Benchmarks

No benchmark scores recorded yet. Spotted some? Submit a correction.

Vendor-reported figures are claims until independently verified. See methodology.