Skip to main content
Core.Today
|
OpenAIFastHigh

GPT-5.4 Mini

Fast and cost-efficient variant of GPT-5.4 with 400K context window and 128K output tokens. Excellent balance of performance and affordability for everyday tasks.

2 credits
per request
400K context window
128K max output tokens
Fast inference speed
Cost-effective pricing
Full function calling support

Use with AI Assistant

Copy usage instructions for Claude, ChatGPT, or other AI

llms.txt

Model Specifications

Context Window
400K
tokens
Max Output
128K
tokens
Training Cutoff
2025-08
Compatible SDK
OpenAI

Capabilities

Vision
Function Calling
Streaming
JSON Mode
System Prompt

Token Pricing (per 1M tokens)

Token TypeCreditsUSD Equivalent
Input Tokens750$0.75
Output Tokens4,500$4.50

* 1 credit โ‰ˆ $0.001 (actual charges may vary based on usage)

Quick Start

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5.4-mini",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Explain the difference between REST and GraphQL APIs with pros and cons."
    }
  ],
  "temperature": 0.7,
  "max_completion_tokens": 2000
}'

Parameters

ParameterTypeRequiredDefaultDescription
messagesarrayYes-Array of message objects with role and content
modelstringYesgpt-5.4-miniModel identifier
max_completion_tokensintegerNo4096Maximum tokens in response (up to 128000). Note: use max_completion_tokens, not max_tokens
temperaturefloatNo1.0Sampling temperature (0-2)
streambooleanNofalseEnable Server-Sent Events streaming
top_pfloatNo1.0Nucleus sampling threshold (0-1)

Examples

Quick Chat

Efficient conversation with GPT-5.4 Mini

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5.4-mini",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Explain the difference between REST and GraphQL APIs with pros and cons."
    }
  ],
  "temperature": 0.7,
  "max_completion_tokens": 2000
}'

Tips & Best Practices

1Great balance of speed and quality for most everyday tasks
2128K output tokens at a fraction of GPT-5.4 cost
3Use for chat, code generation, and content creation
4Streaming recommended for long responses

Use Cases

General-purpose chat and assistants
Code generation and review
Content creation and editing
Data extraction and classification
Real-time applications

Model Info

ProviderOpenAI
Version2026-03
CategoryLLM
Price2 credits

API Endpoint

POST /llm/openai/v1/chat/completions
Try in PlaygroundBack to Docs