Back to API services

Developer API OpenAI-compatible Smart routing

Smart Routing AI Model

One OpenAI-compatible endpoint with smart plan-based routing.

Smart Routing AI Model gives developers one stable AI endpoint for chat completions, completions, Responses-compatible calls, and model discovery. It is built for practical AI apps, coding tools, internal automations, and external clients that need strong output quality, predictable quotas, and simple token-based access without managing provider complexity.

POST /api/smart-routing-ai-model/v1/chat/completions

View API documentation Login

Starting at Free

VS Code extension

Use Smart Routing directly inside VS Code

Smart AI Agent is a free VS Code extension that uses this Smart Routing model as its agent backend. Install the extension, copy your service token from the documentation page, and run coding tasks from the VS Code Activity Bar.

View extension Get token

API overview

A smart routing layer for AI applications, agents, and developer tools

Smart Routing AI Model is an OpenAI-compatible REST API designed for teams that want one practical endpoint instead of manually switching between model vendors, reasoning profiles, and cost settings. You send a familiar request shape, and the platform chooses the right routing profile for your active plan.

The service is built to be one of the smartest and most powerful options available for practical AI automation: optimized for quality, cost, and day-to-day usability instead of forcing every developer to maintain separate provider logic. It is especially useful when you want to connect AI assistants, coding tools, internal apps, or external clients to a single stable endpoint.

Every account receives a service token, daily and monthly quota tracking, plan-based routing profiles, and OpenAI-style responses for chat completions, classic completions, and Responses-compatible clients.

Smart routing profiles

Each plan maps to a different public routing profile, balancing reasoning depth, speed, and quota without exposing backend implementation details.

OpenAI-compatible shape

Use familiar endpoints such as chat completions, completions, responses, and models with bearer-token authentication.

Private token per service

Each user gets a dedicated API token for this service, with regeneration and token revocation built into the documentation page.

Daily and monthly quotas

The API enforces both short-term daily limits and monthly plan limits so usage stays predictable.

How it works

How the routing API works

Create or login to your AI-tools account

Opening the documentation page activates the free plan if needed and creates a service token for your account.

Call an OpenAI-compatible endpoint

Send JSON to /v1/chat/completions, /v1/completions, or /v1/responses with Authorization: Bearer {token}.

The plan selects the routing profile

The active subscription controls the public model alias, reasoning profile, speed tier, and daily/monthly prompt quota.

Receive a standard response

The response follows familiar OpenAI-style fields so client libraries and developer tools can integrate with minimal changes.

Use cases

Where Smart Routing AI Model helps

Custom AI assistants that need one stable API endpoint.
Developer tools that can use OpenAI-compatible model providers.
Codex CLI and Kilo Code configurations that accept an external compatible endpoint.
Internal business apps where plan-based quota control matters.
Prototype agents where switching provider logic would slow down development.
High-volume automations that need predictable daily and monthly prompt budgets.

Routing Capabilities

OpenAI-compatible endpoint structure with controlled quota and public routing profiles.

OpenAI-compatible endpoints

Use chat completions, completions, Responses-compatible calls, and model listing through one REST API base URL.

Smart routing profiles

Each plan selects a different public model alias, reasoning profile, speed tier, and prompt capacity.

Daily and monthly quotas

Prompt usage is tracked by service token across both daily and monthly windows for predictable API usage.

External tool ready

Designed for tools that accept OpenAI-compatible provider settings, including coding assistants and internal agent frameworks.

API Plans

Daily and monthly prompt quotas with plan-based public routing profiles.

Free

Included profile for testing and lightweight routing usage.

Public model smart-route-mini Smart-Route Mini · Standard

Essential Best for lightweight prompts and quick experiments.

Free

200 prompts/day · 1000 prompts/month

Smart-Route Mini public routing profile
200 prompts per day
1000 prompts per month
Chat, completions, responses, and models endpoints

Plus

More capacity for regular API usage and small automations.

Public model smart-route Smart-Route · Standard

Standard power More room for everyday assistants and app integrations.

EUR 9.99/mo billed monthly

500 prompts/day · 5000 prompts/month

Smart-Route public routing profile
500 prompts per day
5000 prompts per month
Standard speed tier

Recommended

Business

Balanced routing for teams, internal tools, and stronger prompt reasoning.

Public model smart-route-focus Smart-Route Focus · Standard

Focused reasoning Stronger planning for structured workflows and teams.

EUR 29.99/mo billed monthly

1000 prompts/day · 10000 prompts/month

Smart-Route Focus public routing profile
1000 prompts per day
10000 prompts per month
Balanced business reasoning profile

Pro

Deep routing profile for heavy prompts and professional automation.

Public model smart-route-deep Smart-Route Deep · Standard

Deep reasoning Built for deeper analysis, coding support, and larger prompts.

EUR 59.99/mo billed monthly

3000 prompts/day · 30000 prompts/month

Smart-Route Deep public routing profile
3000 prompts per day
30000 prompts per month
Deep reasoning profile

Ultimate

High-depth routing with a larger monthly capacity for advanced workflows.

Public model smart-route-ultra Smart-Route Ultra · Standard

Ultra depth Maximum reasoning profile for demanding automation.

EUR 159.99/mo billed monthly

5000 prompts/day · 70000 prompts/month

Smart-Route Ultra public routing profile
5000 prompts per day
70000 prompts per month
Ultra-depth routing profile

Max

Fast premium routing for high-volume teams and production workloads.

Public model smart-route-turbo Smart-Route Turbo · Fast

Turbo power Maximum reasoning with fast routing for high-volume work.

EUR 559.99/mo billed monthly

10000 prompts/day · 100000 prompts/month

Smart-Route Turbo public routing profile
10000 prompts per day
100000 prompts per month
Fast speed tier

Unlimited

Top routing profile with unrestricted prompt volume for demanding teams.

Public model smart-route-infinite Smart-Route Infinite · Fast

Infinite scale Top public profile for unrestricted prompt volume.

EUR 1,599.99/mo billed monthly

Unlimited prompts

Smart-Route Infinite public routing profile
Unlimited daily prompts
Unlimited monthly prompts
Fast speed tier

Endpoint pattern

Use it like a familiar AI model API

Open the documentation page to copy your token, inspect quotas, test requests, and configure external tools.

POST https://ai.mihajlo.mk/api/smart-routing-ai-model/v1/chat/completions