Gemini 2.0 Flash-Lite is our fastest and most cost efficient
Flash model. It's an upgrade path for 1.5 Flash users who want better quality
for the same price and speed. For even more detailed technical information on
Gemini 2.0 Flash-Lite (such as performance benchmarks, information
on our training datasets, efforts on sustainability, intended usage and
limitations, and our approach to ethics and safety), see the model card for
Gemini 2.0 Flash-Lite.
Try in Vertex AI View in Model Garden (Preview) Deploy example app
Model availability (Includes dynamic shared quota & Provisioned Throughput) ML processing
Model ID
gemini-2.0-flash-lite
Supported inputs & outputs
Token limits
Capabilities
Usage types
Input size limit
500 MB
Technical specifications
Images
image/png
,
image/jpeg
,
image/webp
Documents
application/pdf
,
text/plain
Video
video/x-flv
,
video/quicktime
,
video/mpeg
,
video/mpegs
,
video/mpg
,
video/mp4
,
video/webm
,
video/wmv
,
video/3gpp
Audio
audio/x-aac
,
audio/flac
,
audio/mp3
,
audio/m4a
,
audio/mpeg
,
audio/mpga
,
audio/mp4
,
audio/opus
,
audio/pcm
,
audio/wav
,
audio/webm
Parameter defaults
Supported regions
See Data residency for more information.
Knowledge cutoff date
June 2024
Versions
gemini-2.0-flash-lite-001
Security controls
Online prediction
Batch prediction
Tuning
See Security controls for more information.
Pricing
See Pricing.
Gemini 2.0 Flash-Lite
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.