Included catalogue
Every tier can access the same catalogue. Tiers differ by capacity, SLA, support, context and priority, not by hiding models behind a premium plan.
| Model | Endpoint | Primary use | License |
|---|---|---|---|
| Qwen/Qwen3.6-35B-A3B | /v1/chat/completions | Chat, RAG, classification, extraction | Apache 2.0 |
| Qwen3-Embedding-8B | /v1/embeddings | Semantic search and retrieval | Apache 2.0 |
| Qwen3-Reranker-4B | /v1/rerank | Second-stage reranking for RAG | Apache 2.0 |
| Whisper large-v3 | /v1/audio/transcriptions | Multilingual transcription | MIT |
| Kokoro 82M TTS | /v1/audio/speech | Voice for IVR and agents | Apache 2.0 |
Practical compatibility
- Use the official OpenAI SDK and change `base_url` to `https://api.tesseraai.cloud/v1`.
- Keep your `messages`, `tools`, `response_format` and streaming structures where the endpoint supports them.
- Commercial model aliases can map internally to preserve contractual stability.
Model freeze and upgrades
Contracts can freeze a model family for 12 months. Upgrades are offered as opt-in to avoid silent changes in quality, token cost or latency.