together
Llama 3.3 70B on Together
llama-3.3-70b
Fields
| provider | "together" |
|---|---|
| model_id | "llama-3.3-70b" |
| display_name | "Llama 3.3 70B on Together" |
| status | "ga" |
| release_date | null |
| deprecation_date | null |
| retirement_date | null |
| pricing | {"input_per_mtok": 1.04, "output_per_mtok": 1.04, "cached_input_per_mtok": null, "batch_discount_pct": null} |
| context_window_tokens | null |
| max_output_tokens | null |
| modalities | {"input": ["text"], "output": ["text"]} |
| knowledge_cutoff | null |
| verified_at | "2026-07-04T08:00:00Z" |
| notes | "Provider is Together AI serving platform; upstream model family named in display_name. Context window not captured from Together model page." |
| permalink | "/models/together/llama-3_3-70b.html" |
Sources
https://www.together.ai/pricing
Accessed: 2026-07-04T08:00:00Z
Fields: pricing.input_per_mtok, pricing.output_per_mtok
Serverless Inference Price per 1M tokens Model Input output DeepSeek V4 Pro $1.74 $0.20 (cached) $3.48 MiniMax M3 $0.30 $0.06 (cached) $1.20 Kimi K2.7 Code $0.95 $0.19 (cached) $4.00 GLM-5.2 $1.40 $0.26 (cached) $4.40 LFM2 24B A2B $0.03 $0.12 Gemma 4 31B $0.39 $0.97 NVIDIA Nemotron 3 Ultra $0.60 $0.20 (cached) $3.60 Qwen3.7-Plus $0.32 $1.28 Kimi K2.6 $1.20 $0.20 (cached) $4.50 Qwen3.7-Max $1.25 $0.13 (cached) $3.75 gpt-oss-120B $0.15 $0.60 Qwen3.5-397B-A17B $0.60 $0.35 (cached) $3.60 Qwen3.5 9B $0.17 $0.25 Gemma-4-31B-it-Pearl $0.28 $0.86 Cogito v2.1 671B $1.25 $1.25 Rnj-1 Instruct $0.15 $0.15 Llama 3.3 70B $1.04 $1.04 Gemma 3n E4B Instruct $0.06 $0.12 gpt-oss-20B $0.05 $0.20 Qwen3 235B A22B FP8 Throughput $0.20 $0.60 MiniMax M2.5 $0.30 $0.06 (cached) $1.20 GLM-5.1 $1.40 $0.26 (cached) $4.40 MiniMax M2.7 $0.30 $0.06 (cached) $1.20 Qwen3.6-Plus $0.50 $3.00
Verified At
2026-07-04T08:00:00Z
Changelog History
- 2026-07-04 v0.3: together/llama-3.3-70b added