Skip to main content
OpenAIFastHigh

GPT-5 Mini

Fast and efficient variant of GPT-5. Delivers strong performance across reasoning, coding, and creative tasks with a 1M token context window and 32K output tokens, at a fraction of the cost of GPT-5.

1 credits
per 1K tokens (avg)
1M token context window
32K max output tokens
Fast inference speed
Native vision (text + image)
Function calling & JSON mode
Structured outputs
Cost-effective GPT-5 performance

Use with AI Assistant

Copy usage instructions for Claude, ChatGPT, or other AI

llms.txt

Model Specifications

Context Window
1.0M
tokens
Max Output
33K
tokens
Training Cutoff
2025-03
Compatible SDK
OpenAI

Capabilities

Vision
Function Calling
Streaming
JSON Mode
System Prompt

Token Pricing (per 1M tokens)

Token TypeCreditsUSD Equivalent
Input Tokens500$0.50
Output Tokens4,000$4.00
Cached Tokens125$0.13

* 1 credit โ‰ˆ $0.001 (actual charges may vary based on usage)

Quick Start

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-mini",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant. Be concise and clear."
    },
    {
      "role": "user",
      "content": "Explain the difference between REST and GraphQL APIs, including when to use each."
    }
  ],
  "max_completion_tokens": 2000
}'

Parameters

ParameterTypeRequiredDefaultDescription
messagesarrayYes-Array of message objects with role and content
modelstringYesgpt-5-miniModel identifier
max_completion_tokensintegerNo4096Maximum tokens in response (up to 32768). Note: use max_completion_tokens, not max_tokens
reasoning_effortstringNomediumReasoning effort level: low, medium, or high
streambooleanNofalseEnable Server-Sent Events streaming
response_formatobjectNo-Format of response: { type: 'json_object' } for JSON mode
toolsarrayNo-List of tools (functions) the model can call

Examples

Quick Chat

Efficient general-purpose conversation with GPT-5 Mini

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-mini",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant. Be concise and clear."
    },
    {
      "role": "user",
      "content": "Explain the difference between REST and GraphQL APIs, including when to use each."
    }
  ],
  "max_completion_tokens": 2000
}'

Code Debugging

Debug code efficiently with GPT-5 Mini

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-mini",
  "messages": [
    {
      "role": "system",
      "content": "You are an expert debugger. Find and fix bugs in the provided code."
    },
    {
      "role": "user",
      "content": "This Python function should return the nth Fibonacci number but it's running too slowly for large n. Optimize it:\n\ndef fib(n):\n    if n <= 1:\n        return n\n    return fib(n-1) + fib(n-2)"
    }
  ],
  "max_completion_tokens": 2000
}'

Tips & Best Practices

1Best value for most everyday tasks โ€” strong performance at low cost
25x cheaper than GPT-5 on input tokens, 5x cheaper on output tokens
31M context window same as GPT-5 for long document processing
4Ideal for high-volume applications where cost matters
5Use for chat, summarization, and code tasks
6Streaming recommended for responsive user experience

Use Cases

General-purpose chat and assistance
Code generation and debugging
Document summarization
Content creation and editing
Data extraction and classification
API integration and automation

Model Info

ProviderOpenAI
Version2025-08-07
CategoryLLM
Price1 credits

API Endpoint

POST /llm/openai/v1/chat/completions
Try in PlaygroundBack to Docs