Gemini 3.1 Flash optimized for real-time interactions and live streaming scenarios. Features low-latency responses with audio input support at dedicated pricing.
Copy usage instructions for Claude, ChatGPT, or other AI
| Token Type | Credits | USD Equivalent |
|---|---|---|
| Input Tokens | 750 | $0.75 |
| Output Tokens | 4,500 | $4.50 |
* 1 credit β $0.001 (actual charges may vary based on usage)
curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer cdt_your_api_key" \
-d '{
"model": "gemini-3.1-flash-live-preview",
"messages": [
{
"role": "system",
"content": "You are a real-time assistant. Respond quickly and concisely."
},
{
"role": "user",
"content": "What are the key differences between HTTP/2 and HTTP/3?"
}
],
"max_tokens": 1000,
"stream": true
}'| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
messages | array | Yes | - | Array of message objects (OpenAI format) |
temperature | float | No | 1 | Sampling temperature (0-2) |
top_p | float | No | 0.95 | Nucleus sampling parameter |
max_tokens | integer | No | - | Maximum output tokens |
stream | boolean | No | true | Enable Server-Sent Events streaming (recommended for live use) |
Real-time streaming conversation
curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer cdt_your_api_key" \
-d '{
"model": "gemini-3.1-flash-live-preview",
"messages": [
{
"role": "system",
"content": "You are a real-time assistant. Respond quickly and concisely."
},
{
"role": "user",
"content": "What are the key differences between HTTP/2 and HTTP/3?"
}
],
"max_tokens": 1000,
"stream": true
}'POST /llm/gemini/v1beta/openai/chat/completions