A significant leap in small model performance. Matches or exceeds GPT-4o in intelligence while reducing latency by nearly half and cost by 83%. Ideal balance of speed, quality, and affordability.
Copy usage instructions for Claude, ChatGPT, or other AI
| Token Type | Credits | USD Equivalent |
|---|---|---|
| Input Tokens | 400 | $0.40 |
| Output Tokens | 1,600 | $1.60 |
| Cached Tokens | 100 | $0.10 |
* 1 credit โ $0.001 (actual charges may vary based on usage)
curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer cdt_your_api_key" \
-d '{
"model": "gpt-4.1-mini",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant. Be concise."
},
{
"role": "user",
"content": "What are the top 3 design patterns for microservices?"
}
],
"temperature": 0.7,
"max_tokens": 1000
}'| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
messages | array | Yes | - | Array of message objects with role and content |
model | string | Yes | gpt-4.1-mini | Model identifier |
temperature | float | No | 1.0 | Sampling temperature (0-2). Lower = more focused, higher = more creative |
max_tokens | integer | No | 4096 | Maximum tokens in response (up to 32768) |
stream | boolean | No | false | Enable Server-Sent Events streaming |
response_format | object | No | - | Format of response: { type: 'json_object' } for JSON mode |
tools | array | No | - | List of tools (functions) the model can call |
top_p | float | No | 1.0 | Nucleus sampling threshold (0-1) |
Fast, cost-effective conversation
curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer cdt_your_api_key" \
-d '{
"model": "gpt-4.1-mini",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant. Be concise."
},
{
"role": "user",
"content": "What are the top 3 design patterns for microservices?"
}
],
"temperature": 0.7,
"max_tokens": 1000
}'Classify and tag content at scale
curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer cdt_your_api_key" \
-d '{
"model": "gpt-4.1-mini",
"messages": [
{
"role": "system",
"content": "Classify the given text into categories. Respond with JSON."
},
{
"role": "user",
"content": "Classify this support ticket: 'My payment was charged twice and I need a refund for the duplicate charge'"
}
],
"response_format": {
"type": "json_object"
},
"max_tokens": 200
}'POST /llm/openai/v1/chat/completions