Migrating from OpenAI
base_url swap, model mapping table and documented behavioural differences.
Managed AI inference for mid-market companies. OpenAI-compatible API.
Change a base_url, export your API key, fire the first request. No custom SDK: your OpenAI client works as-is.
from openai import OpenAI
client = OpenAI(
base_url="https://api.tesseraai.cloud/v1",
api_key="sk-tessera-…",
)
resp = client.chat.completions.create(
model="Qwen/Qwen3.6-35B-A3B",
messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)What most developers reach for in their first week.
base_url swap, model mapping table and documented behavioural differences.
Qwen/Qwen3.6-35B-A3B, Whisper large-v3, Kokoro TTS and Qwen3-Embedding-8B. Capabilities, context and licenses.
Sustained RPM, bursts, per-bundle-service sublimits. How to handle 429 and backoff.
OpenAI-compatible function calling. Parallel tools and structured JSON output examples.
Per-request usage events, plugged into your billing or cost-center system without the dashboard.
Signable document, public subprocessor list and physical GPU location for your DPO.