Back to API services
Developer API OpenAI-compatible Smart routing

Smart Routing AI Model

One OpenAI-compatible endpoint with smart plan-based routing.

Smart Routing AI Model gives developers one stable AI endpoint for chat completions, completions, Responses-compatible calls, and model discovery. It is built for practical AI apps, coding tools, internal automations, and external clients that need strong output quality, predictable quotas, and simple token-based access without managing provider complexity.

POST /api/smart-routing-ai-model/v1/chat/completions
Smart Routing AI Model
Starting at Free

VS Code extension

Use Smart Routing directly inside VS Code

Smart AI Agent is a free VS Code extension that uses this Smart Routing model as its agent backend. Install the extension, copy your service token from the documentation page, and run coding tasks from the VS Code Activity Bar.

API overview

A smart routing layer for AI applications, agents, and developer tools

Smart Routing AI Model is an OpenAI-compatible REST API designed for teams that want one practical endpoint instead of manually switching between model vendors, reasoning profiles, and cost settings. You send a familiar request shape, and the platform chooses the right routing profile for your active plan.

The service is built to be one of the smartest and most powerful options available for practical AI automation: optimized for quality, cost, and day-to-day usability instead of forcing every developer to maintain separate provider logic. It is especially useful when you want to connect AI assistants, coding tools, internal apps, or external clients to a single stable endpoint.

Every account receives a service token, daily and monthly quota tracking, plan-based routing profiles, and OpenAI-style responses for chat completions, classic completions, and Responses-compatible clients.

Smart routing profiles

Each plan maps to a different public routing profile, balancing reasoning depth, speed, and quota without exposing backend implementation details.

OpenAI-compatible shape

Use familiar endpoints such as chat completions, completions, responses, and models with bearer-token authentication.

Private token per service

Each user gets a dedicated API token for this service, with regeneration and token revocation built into the documentation page.

Daily and monthly quotas

The API enforces both short-term daily limits and monthly plan limits so usage stays predictable.

How it works

How the routing API works

01

Create or login to your AI-tools account

Opening the documentation page activates the free plan if needed and creates a service token for your account.

02

Call an OpenAI-compatible endpoint

Send JSON to /v1/chat/completions, /v1/completions, or /v1/responses with Authorization: Bearer {token}.

03

The plan selects the routing profile

The active subscription controls the public model alias, reasoning profile, speed tier, and daily/monthly prompt quota.

04

Receive a standard response

The response follows familiar OpenAI-style fields so client libraries and developer tools can integrate with minimal changes.

Use cases

Where Smart Routing AI Model helps

  • Custom AI assistants that need one stable API endpoint.
  • Developer tools that can use OpenAI-compatible model providers.
  • Codex CLI and Kilo Code configurations that accept an external compatible endpoint.
  • Internal business apps where plan-based quota control matters.
  • Prototype agents where switching provider logic would slow down development.
  • High-volume automations that need predictable daily and monthly prompt budgets.

Routing Capabilities

OpenAI-compatible endpoint structure with controlled quota and public routing profiles.

OpenAI-compatible endpoints

Use chat completions, completions, Responses-compatible calls, and model listing through one REST API base URL.

Smart routing profiles

Each plan selects a different public model alias, reasoning profile, speed tier, and prompt capacity.

Daily and monthly quotas

Prompt usage is tracked by service token across both daily and monthly windows for predictable API usage.

External tool ready

Designed for tools that accept OpenAI-compatible provider settings, including coding assistants and internal agent frameworks.

API Plans

Daily and monthly prompt quotas with plan-based public routing profiles.

Free

Included profile for testing and lightweight routing usage.

Public model smart-route-mini Smart-Route Mini · Standard
Essential Best for lightweight prompts and quick experiments.

Free

Login to continue

200 prompts/day · 1000 prompts/month

  • Smart-Route Mini public routing profile
  • 200 prompts per day
  • 1000 prompts per month
  • Chat, completions, responses, and models endpoints

Plus

More capacity for regular API usage and small automations.

Public model smart-route Smart-Route · Standard
Standard power More room for everyday assistants and app integrations.

EUR 9.99/mo billed monthly

Login to continue

500 prompts/day · 5000 prompts/month

  • Smart-Route public routing profile
  • 500 prompts per day
  • 5000 prompts per month
  • Standard speed tier

Pro

Deep routing profile for heavy prompts and professional automation.

Public model smart-route-deep Smart-Route Deep · Standard
Deep reasoning Built for deeper analysis, coding support, and larger prompts.

EUR 59.99/mo billed monthly

Login to continue

3000 prompts/day · 30000 prompts/month

  • Smart-Route Deep public routing profile
  • 3000 prompts per day
  • 30000 prompts per month
  • Deep reasoning profile

Ultimate

High-depth routing with a larger monthly capacity for advanced workflows.

Public model smart-route-ultra Smart-Route Ultra · Standard
Ultra depth Maximum reasoning profile for demanding automation.

EUR 159.99/mo billed monthly

Login to continue

5000 prompts/day · 70000 prompts/month

  • Smart-Route Ultra public routing profile
  • 5000 prompts per day
  • 70000 prompts per month
  • Ultra-depth routing profile

Max

Fast premium routing for high-volume teams and production workloads.

Public model smart-route-turbo Smart-Route Turbo · Fast
Turbo power Maximum reasoning with fast routing for high-volume work.

EUR 559.99/mo billed monthly

Login to continue

10000 prompts/day · 100000 prompts/month

  • Smart-Route Turbo public routing profile
  • 10000 prompts per day
  • 100000 prompts per month
  • Fast speed tier

Unlimited

Top routing profile with unrestricted prompt volume for demanding teams.

Public model smart-route-infinite Smart-Route Infinite · Fast
Infinite scale Top public profile for unrestricted prompt volume.

EUR 1,599.99/mo billed monthly

Login to continue

Unlimited prompts

  • Smart-Route Infinite public routing profile
  • Unlimited daily prompts
  • Unlimited monthly prompts
  • Fast speed tier

Endpoint pattern

Use it like a familiar AI model API

Open the documentation page to copy your token, inspect quotas, test requests, and configure external tools.

POST https://ai.mihajlo.mk/api/smart-routing-ai-model/v1/chat/completions