Together AI provides fast, cost-effective inference for popular open-source models through an OpenAI-compatible API, making it a drop-in replacement in any codebase already using the OpenAI SDK. It is one of the leading platforms for teams that want open-model performance without managing GPU infrastructure.
AI engineers, startups, and enterprises use Together AI when they want the flexibility and cost advantages of open-source models with the reliability and convenience of a managed API. Its OpenAI compatibility means existing LangChain, LlamaIndex, and Vercel AI SDK integrations work immediately.
Together AI is particularly strong for teams that need to evaluate multiple open models quickly — you can switch between Llama 3 70B, Mixtral 8x22B, and DBRX in a single line of code without provisioning different infrastructure for each.
from openai import OpenAI
client = OpenAI(
api_key='YOUR_TOGETHER_API_KEY',
base_url='https://api.together.xyz/v1'
)
response = client.chat.completions.create(
model='meta-llama/Llama-3-70b-chat-hf',
messages=[{'role': 'user', 'content': 'Hello!'}]
)pip install togetherimport together
client = together.Together(api_key='YOUR_KEY')
response = client.chat.completions.create(
model='meta-llama/Llama-3-70b-chat-hf',
messages=[{'role': 'user', 'content': 'Hello!'}]
)Pricing: Pay per token. Llama 3 8B: ~$0.20/M tokens. Llama 3 70B: ~$0.90/M tokens. Mixtral 8x7B: ~$0.60/M tokens. Fine-tuning priced per training token. Full pricing details.
-turbo in the model list. For cost optimization, use JSON mode to get structured outputs without wasting tokens on formatting instructions.Be the first to share a Together AI case study and get discovered by clients.
Submit a case studyThought leaders
Follow for insights, tutorials, and thought leadership
Together AI
Co-Founder and CEO of Together AI, providing fast and affordable infrastructure for open-source AI models. Building the platform to make open-source AI competitive with closed-source alternatives.
Stanford University / Together AI
Associate Professor of Computer Science at Stanford and Co-Founder of Together AI. Directs Stanford's Center for Research on Foundation Models (CRFM). Created HELM (Holistic Evaluation of Language Models). Pioneering researcher in NLP and AI transparency.
Submit a brief and we'll match you with vetted specialists who have proven Together AI experience.