Skip to main content
Core.Today
|
OpenAIFastHigh

GPT Audio Mini

Lightweight multimodal model with native audio input/output capabilities. Optimized for voice-based interactions and audio processing tasks.

1 credits
per 1K tokens (avg)
Native audio input/output
Voice-based interactions
Cost-effective multimodal
Fast response times
Text + audio combined

Run it right now

Test this model instantly in the Console Playground β€” no code required

Sign in to try

Use with AI Assistant

Copy usage instructions for Claude, ChatGPT, or other AI

llms.txt

Model Specifications

Context Window
128K
tokens
Max Output
16K
tokens
Training Cutoff
2024-10
Compatible SDK
OpenAI

Capabilities

Vision
Function Calling
Streaming
JSON Mode
System Prompt

Token Pricing (per 1M tokens)

Token TypeCreditsUSD Equivalent
Input Tokens1,200$1.20
Output Tokens4,800$4.80

* 1 credit β‰ˆ $0.001 (actual charges may vary based on usage)

Quick Start

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-audio-mini",
  "messages": [
    {
      "role": "user",
      "content": "Describe the key differences between spoken and written language."
    }
  ],
  "max_tokens": 1000
}'

Parameters

ParameterTypeRequiredDefaultDescription
messagesarrayYes-Array of message objects (supports audio content)
temperaturefloatNo1.0Sampling temperature (0-2)
max_tokensintegerNo-Maximum tokens in response

Examples

Audio Chat

Text-based interaction with audio model

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-audio-mini",
  "messages": [
    {
      "role": "user",
      "content": "Describe the key differences between spoken and written language."
    }
  ],
  "max_tokens": 1000
}'

Tips & Best Practices

1Optimized for audio-related tasks
2Cost-effective for voice applications
3Supports both text and audio modalities
4Use for real-time voice interactions

Use Cases

Voice assistants
Audio transcription and analysis
Spoken language understanding
Podcast and media processing
Accessibility applications