Skip to main content
Core.Today
|
GoogleMediumUltra

Gemini 3.1 Pro Preview

Google's latest and most capable Gemini model in preview. Features dynamic pricing that adjusts based on context length, with enhanced pricing for inputs over 200K tokens.

500 credits
per request
Dynamic pricing (standard / long-context >200K)
Advanced reasoning and analysis
1,048,576 token context window
65,536 max output tokens
Multimodal input: text, image, audio, video, PDF
Function calling, structured outputs, thinking, search grounding, code execution, caching, Batch API

Run it right now

Test this model instantly in the Console Playground โ€” no code required

Sign in to try

Use with AI Assistant

Copy usage instructions for Claude, ChatGPT, or other AI

llms.txt

Model Specifications

Context Window
1.0M
tokens
Max Output
66K
tokens
Training Cutoff
January 2025
Compatible SDK
OpenAI, Google AI

Capabilities

Vision
Function Calling
Streaming
JSON Mode
System Prompt

Token Pricing (per 1M tokens)

Token TypeCreditsUSD Equivalent
Input Tokens2,000$2.00
Output Tokens12,000$12.00

* 1 credit โ‰ˆ $0.001 (actual charges may vary based on usage)

Quick Start

curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gemini-3.1-pro-preview",
  "messages": [
    {
      "role": "user",
      "content": "Analyze the current state of quantum error correction research and identify the most promising approaches for achieving fault-tolerant quantum computing by 2030."
    }
  ],
  "max_tokens": 4096,
  "temperature": 0.5
}'

Parameters

ParameterTypeRequiredDefaultDescription
messagesarrayYes-Array of message objects (OpenAI format). Supports text, image, video, audio, and PDF inputs.
temperaturefloatNo1Sampling temperature (0-2). Lower values produce more deterministic outputs.
top_pfloatNo0.95Nucleus sampling parameter (0-1).
max_tokensintegerNo-Maximum output tokens. Max: 65,536. Context window (input + output): 1,048,576 tokens.
stopstring | arrayNo-Up to 4 sequences where the model stops generating.
response_formatobjectNo-Output format constraint. Use `{ type: 'json_object' }` for structured JSON output.
presence_penaltyfloatNo0Penalty (-2.0 to 2.0) for repeating tokens.
frequency_penaltyfloatNo0Penalty (-2.0 to 2.0) by token frequency.
seedintegerNo-Seed for deterministic sampling (best-effort).
streambooleanNofalseEnable Server-Sent Events streaming.

Examples

Research Analysis

In-depth analysis with Gemini 3.1 Pro

curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gemini-3.1-pro-preview",
  "messages": [
    {
      "role": "user",
      "content": "Analyze the current state of quantum error correction research and identify the most promising approaches for achieving fault-tolerant quantum computing by 2030."
    }
  ],
  "max_tokens": 4096,
  "temperature": 0.5
}'

Tips & Best Practices

1Max output tokens: 65,536 โ€” set max_tokens up to this limit
2Context window 1,048,576 tokens (input + output)
3Dynamic pricing: inputs >200K tokens use long-context pricing ($4.00/$18.00 per M)
4Keep inputs under 200K tokens when possible for standard pricing
5Excellent for research, analysis, and complex reasoning tasks
6Combine with vision inputs for document and diagram analysis

Use Cases

Complex reasoning and research tasks
Long document analysis and summarization
Multimodal content understanding
Advanced code generation and review
Scientific and mathematical problem solving

Model Info

ProviderGoogle
Version3.1-preview
CategoryLLM
Price500 credits

API Endpoint

POST /llm/gemini/v1beta/openai/chat/completions
Try in PlaygroundBack to Docs