Key features
- Serverless Inference API for open-source LLMs
- Custom GPU cluster rentals for large-scale training
- Together Fine-tuning for domain-specific model optimization
- Support for Llama, Mistral, and Qwen model families
- High-throughput inference via Together Flash
