Skip to main content
Core.Today
|
GoogleFastHigh

Gemini 2.0 Flash

Google's fastest and most capable model. Features a massive 1M token context window, native multimodal support, and real-time capabilities.

1 credits
per 1K tokens (avg)
1,048,576 token context window
8,192 max output tokens
Native multimodal (text, image, audio, video)
Real-time streaming
Code execution, search grounding, caching, Batch API
Grounding with Google Search
Deprecated โ€” migrate to Gemini 3 Flash or 2.5 Flash

Run it right now

Test this model instantly in the Console Playground โ€” no code required

Sign in to try

Use with AI Assistant

Copy usage instructions for Claude, ChatGPT, or other AI

llms.txt

Model Specifications

Context Window
1.0M
tokens
Max Output
8K
tokens
Training Cutoff
August 2024
Compatible SDK
OpenAI, Google

Capabilities

Vision
Function Calling
Streaming
JSON Mode
System Prompt

Token Pricing (per 1M tokens)

Token TypeCreditsUSD Equivalent
Input Tokens75$0.07
Output Tokens300$0.30

* 1 credit โ‰ˆ $0.001 (actual charges may vary based on usage)

Quick Start

curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gemini-2.0-flash",
  "messages": [
    {
      "role": "system",
      "content": "You are a document analyst. Provide comprehensive analysis."
    },
    {
      "role": "user",
      "content": "Analyze this entire codebase and identify potential security vulnerabilities: [code files]"
    }
  ],
  "max_tokens": 4000
}'

Parameters

ParameterTypeRequiredDefaultDescription
messagesarrayYes-Array of message objects (OpenAI format supported)
temperaturefloatNo1.0Sampling temperature (0-2)
max_tokensintegerNo-Maximum output tokens. Max: 8,192. Context window (input + output): 1,048,576 tokens.
streambooleanNofalseEnable streaming

Examples

Long Document Analysis

Analyze documents up to 1M tokens

curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gemini-2.0-flash",
  "messages": [
    {
      "role": "system",
      "content": "You are a document analyst. Provide comprehensive analysis."
    },
    {
      "role": "user",
      "content": "Analyze this entire codebase and identify potential security vulnerabilities: [code files]"
    }
  ],
  "max_tokens": 4000
}'

Video Understanding

Analyze video content

curl -X POST "https://api.core.today/llm/gemini/v1beta/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gemini-2.0-flash",
  "messages": [
    {
      "role": "user",
      "content": "Describe what's happening in this video and identify key moments."
    }
  ]
}'

Tips & Best Practices

1Deprecated โ€” migrate to Gemini 3 Flash or Gemini 2.5 Flash
2Max output tokens: 8,192 โ€” much lower than 2.5/3.x Flash (65,536)
3Context window 1,048,576 tokens (input + output)
4Use for tasks requiring very long context
5Supports OpenAI SDK format for easy migration
6Can process video and audio natively

Use Cases

Large document processing
Multi-file code analysis
Real-time applications
Video understanding
Long conversation memory
Research assistance