Gemini 3.1 Flash Lite

TextVision

Context

Max output

65.5K

Input

$0.25

/M tokens

Output

$1.5

/M tokens

Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for high-volume, latency-sensitive use cases.

Quick start

How to call

POST /v1/chat/completions

bash

curl https://conduit-api.aurixel.ai/v1/chat/completions \
  -H "Authorization: Bearer ck-YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"gemini-3.1-flash-lite","messages":[{"role":"user","content":"Hello!"}]}'

→ OpenAI-compatible. Add "stream": true (+ "stream_options":{"include_usage":true}) for SSE.

Reasoning effort: use model-name variants (-low / -agent). The reasoning_effort param is unreliable on Gemini.

Auth, billing & streaming → Quickstart

Gemini 3.1 Flash Lite

Quick start

How to call

More from Google

Gemini 3 Flash

Gemini 3.1 Flash · Nano Banana 2