How the TokModel API gateway works

TokModel sits between your application and the AI providers you want to use. Instead of integrating each provider separately — managing different SDKs, authentication schemes, and API shapes — you send every request to https://tokmodel.com and TokModel handles the routing. Your code stays the same regardless of which model or provider you choose.

How the routing works

When you send a request to TokModel, three things happen:

TokModel authenticates the request using your API key.
TokModel reads the model field in your request body to determine which provider and model to call.
TokModel forwards the request to that provider, receives the response, and returns it to you in a consistent format.

Your application never communicates directly with individual providers. All requests go through https://tokmodel.com.

Drop-in replacement

TokModel is fully compatible with the OpenAI client API. If your application already calls OpenAI, you can switch to TokModel by changing one value — the base URL:

from openai import OpenAI

client = OpenAI(
    api_key="your-tokmodel-api-key",
    base_url="https://tokmodel.com/v1",
)

response = client.chat.completions.create(
    model="openai/gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
)

The same change works in any OpenAI-compatible SDK: the official OpenAI clients for Python, Node.js, Go, Java, and others, as well as third-party tools that accept a configurable base URL.

Supported endpoint families

TokModel exposes endpoints across six capability areas:

Capability	Endpoints
Chat completions	`POST /v1/chat/completions`
Responses	`POST /v1/responses`, `POST /v1/responses/compact`
Anthropic messages	`POST /v1/messages`
Text embeddings	`POST /v1/embeddings`
Reranking	`POST /v1/rerank`
Image generation	`POST /v1/images/generations`, `POST /v1/images/edits`, `POST /v1/images/variations`
Speech & audio	`POST /v1/audio/speech`, `POST /v1/audio/transcriptions`, `POST /v1/audio/translations`
Model listing	`GET /v1beta/models`

Each endpoint accepts the same request format as its OpenAI equivalent, so existing integration code requires no changes beyond the base URL.

Use the model field to target any provider available on TokModel. Model IDs follow a provider/model-name convention — for example, anthropic/claude-3-5-sonnet, google/gemini-2.0-flash, or mistral/mistral-large-latest. See Models and providers for the full list.

Authentication

Every request requires an Authorization header with a Bearer token:

Authorization: Bearer your-tokmodel-api-key

You create and manage API keys in the TokModel console. Each key is scoped to your account and carries the credit balance associated with that account.

Get Started

Core Concepts

Guides

Account

How the TokModel API gateway works

How the routing works

Drop-in replacement

Supported endpoint families

Authentication

Get Started

Core Concepts

Guides

Account

Documentation Index

​How the routing works

​Drop-in replacement

​Supported endpoint families

​Authentication

How the routing works

Drop-in replacement

Supported endpoint families

Authentication