Aurixel
Models/Google/gemini-3.1-flash-lite
Google
NEW

Gemini 3.1 Flash Lite

TextVision
Context
1M
Max output
65.5K
Input
$0.25
/M tokens
Output
$1.5
/M tokens

Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for high-volume, latency-sensitive use cases.

Quick start

How to call

POST /v1/chat/completions
bash
curl https://conduit-api.aurixel.ai/v1/chat/completions \
  -H "Authorization: Bearer ck-YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"gemini-3.1-flash-lite","messages":[{"role":"user","content":"Hello!"}]}'

→ OpenAI-compatible. Add "stream": true (+ "stream_options":{"include_usage":true}) for SSE.

Reasoning effort: use model-name variants (-low / -agent). The reasoning_effort param is unreliable on Gemini.

Auth, billing & streaming → Quickstart

More from Google