Serverless Endpoint
GPT OSS 120B
GPT OSS 20B
Devstral Small 2505
Qwen 3 235B A22B Instruct 2507
Qwen 3 235B A22B
Qwen3 4B Instruct 2507
Qwen3 4B Thinking 2507
Qwen 2.5 Coder 3B Instruct
Qwen 2.5 Coder 7B Instruct
Qwen 2.5 Coder 32B Instruct
Qwen QwQ 32B
Llama 4 Scout 17B 16E Instruct
Llama 3.3 70B Instruct
Llama 3.2 11B Vision Instruct
Llama 3.1 8B Instruct
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 8B
DeepSeek R1 Distill Qwen 32B
DeepSeek R1 Distill Qwen 14B
DeepSeek R1 Distill Qwen 7B (Math)
DeepSeek R1 Distill Qwen 1.5B (Math)
Mixtral 8x22B Instruct v0.1
Flux.1 [schnell]
Stable Diffusion XL 1.0
SDXL Lightning 8-step
SDXL Lightning 4-step
Type
Price
Text Generation
$0.1 input / $0.4 output
per 1m tokens
Text Generation
$0.05 input / $0.2 output
per 1m tokens
Text Generation
$0.1 Input / $0.3 Output
per 1m tokens
Text Generation
$0.20 Input / $0.60 Output
per 1m tokens
Text Generation
$0.20 Input / $0.60 Output
per 1m tokens
Text Generation
$0.01 input / $0.03 output
per 1m tokens
Text Generation
$0.01 input / $0.03 output
per 1m tokens
Text Generation
$0.01 Input / $0.03 Output
per 1m tokens
Text Generation
$0.01 Input / $0.03 Output
per 1m tokens
Text Generation
$0.06 Input / $0.20 Output
per 1m tokens
Text Generation
$0.18 Input / $0.20 Output
per 1m tokens
Text Generation
$0.09 Input / $0.29 Output
per 1m tokens
Text Generation
$0.4
per 1m tokens
Image-Text-to-Text
$0.06
per 1m tokens
Text Generation
$0.06
per 1m tokens
Text Generation
$0.75
per 1m tokens
Text Generation
$0.05
per 1m tokens
Text Generation
$0.3
per 1m tokens
Text Generation
$0.2
per 1m tokens
Text Generation
$0.15
per 1m tokens
Text Generation
$0.1
per 1m tokens
Text Generation
$1.2
per 1m tokens
Text-to-Image
$0.0013 (@ 4 steps)
per mega-pixel
Text-to-Image
$0.003 (@ 20 steps)
per mega-pixel
Text-to-Image
$0.0016
per mega-pixel
Text-to-Image
$0.0008
per mega-pixel