OpenAI빠름높음

GPT-5 Nano

GPT-5의 초고속 경량 변형 모델입니다. 1M 토큰 컨텍스트 윈도우와 32K 출력 토큰을 갖추고 최소 비용으로 고처리량, 저지연 애플리케이션을 위해 설계되었습니다.

1 크레딧

1K 토큰당 (평균)

1M 토큰 컨텍스트 윈도우

32K 최대 출력 토큰

초고속 추론

최저 비용 GPT-5 변형

함수 호출 & JSON 모드

구조화된 출력

고처리량 작업에 이상적

지금 바로 실행해보세요

콘솔의 Playground에서 별도 코드 없이 이 모델을 즉시 테스트할 수 있어요

로그인 후 사용해보기

AI 어시스턴트에서 사용하기

이 모델의 사용법을 Claude, ChatGPT 등에 복사

llms.txt

모델 상세 사양

컨텍스트 윈도우

1.0M

토큰

최대 출력

33K

토큰

학습 데이터

2025-03

호환 SDK

OpenAI

기능 지원

비전

함수 호출

스트리밍

JSON 모드

시스템 프롬프트

토큰별 가격 (1M 토큰당)

토큰 종류	크레딧	달러 환산
입력 토큰	100	$0.10
출력 토큰	800	$0.80
캐시된 토큰	25	$0.03

* 1 크레딧 ≈ $0.001 (실제 요금은 사용량에 따라 달라질 수 있습니다)

빠른 시작

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-nano",
  "messages": [
    {
      "role": "system",
      "content": "Classify the following customer message into one of these categories: billing, technical, general, feedback. Respond with only the category name."
    },
    {
      "role": "user",
      "content": "I was charged twice for my subscription last month and need a refund."
    }
  ],
  "max_completion_tokens": 50
}'

파라미터

파라미터	타입	필수	기본값	설명
`messages`	array	Yes	-	role과 content를 포함한 메시지 객체 배열
`model`	string	Yes	gpt-5-nano	모델 식별자
`max_completion_tokens`	integer	No	4096	응답의 최대 토큰 수 (최대 32768). 주의: max_tokens 대신 max_completion_tokens 사용
`reasoning_effort`	string	No	medium	추론 노력 수준: low, medium, high
`stream`	boolean	No	false	Server-Sent Events 스트리밍 활성화
`response_format`	object	No	-	응답 형식: JSON 모드의 경우 { type: 'json_object' }
`tools`	array	No	-	모델이 호출할 수 있는 도구(함수) 목록

예제

빠른 분류

GPT-5 Nano를 활용한 고속 텍스트 분류

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-nano",
  "messages": [
    {
      "role": "system",
      "content": "Classify the following customer message into one of these categories: billing, technical, general, feedback. Respond with only the category name."
    },
    {
      "role": "user",
      "content": "I was charged twice for my subscription last month and need a refund."
    }
  ],
  "max_completion_tokens": 50
}'

데이터 추출

빠른 구조화된 데이터 추출

curl -X POST "https://api.core.today/llm/openai/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer cdt_your_api_key" \
  -d '{
  "model": "gpt-5-nano",
  "messages": [
    {
      "role": "system",
      "content": "Extract key information from the text and return as JSON."
    },
    {
      "role": "user",
      "content": "Meeting scheduled with John Smith from Acme Corp on March 15, 2026 at 2:00 PM in Conference Room B to discuss Q1 budget review."
    }
  ],
  "response_format": {
    "type": "json_object"
  },
  "max_completion_tokens": 500
}'

팁 & 모범 사례

1가장 비용 효율적인 GPT-5 모델 — GPT-5 대비 입력 토큰 25배 저렴

2실시간 애플리케이션에 이상적인 초고속 응답 시간

3분류, 라우팅, 간단한 추출 작업에 완벽

4결정적 분류 결과를 위해 온도 0 사용

5대량 배치 처리에 적합

6최저 가격대에서도 1M 컨텍스트 윈도우 사용 가능

사용 사례

대량 분류 및 라우팅

빠른 텍스트 생성 및 완성

실시간 채팅 애플리케이션

경량 데이터 추출

자동 태깅 및 분류

간단한 코드 지원

모델 정보

제공자OpenAI

버전2025-08-07

카테고리LLM

가격1 크레딧

API Endpoint

POST /llm/openai/v1/chat/completions

Playground에서 테스트