LLM modelsBYOK supported

OpenRouter for AI voice agents

OpenRouter is one of the LLMs you can run a Telenow AI voice agent on. One key, hundreds of models through an OpenAI-compatible gateway. Type or paste ANY model ID from openrouter.ai/models (e.g. anthropic/claude-sonnet-4.5) — the suggestions are just popular call-friendly picks; pricing roughly tracks the underlying provider. Pick a OpenRouter model as the agent's brain and it drives the whole conversation — understanding the caller, deciding what to say, and calling your tools — fast enough to keep a live call natural. Mix and match it with any LLM, STT, TTS and carrier — Telenow is component-level, so you're never locked in.

Last updated 2026-06-25

OpenRouter models for voice

ModelContextInput /MOutput /MFirst tokenTools
OpenAI · GPT-4o mini128K$0.15$0.6~300ms
Anthropic · Claude 3.5 Haiku200K$0.8$4~320ms
Google · Gemini 2.0 Flash1000K$0.1$0.4~280ms
Meta · Llama 3.3 70B131K$0.12$0.3~320ms
Google · Gemini 2.5 Flash1049K$0.3$2.5~300ms
xAI · Grok 4 Fast2000K$0.2$0.5~300ms

Frequently asked questions

Can I use OpenRouter for an AI voice agent?

Yes — OpenRouter is a first-class LLM option in Telenow. Select a OpenRouter model as your agent's brain and it powers real-time voice and chat conversations.

Which OpenRouter models can I use?

OpenAI · GPT-4o mini, Anthropic · Claude 3.5 Haiku, Google · Gemini 2.0 Flash, Meta · Llama 3.3 70B, Google · Gemini 2.5 Flash, and xAI · Grok 4 Fast — Telenow lists the low-latency OpenRouter models that keep the first reply fast on a live call.

Does OpenRouter support tools / function calling?

Yes — the listed OpenRouter models support function calling, so your agent can take actions (book, pay, look up, transfer) mid-conversation.

How much does OpenRouter cost for a voice agent?

You pay OpenRouter's usage at cost plus Telenow's transparent platform fee — billed per component (speech, model, telephony) and per minute, with new accounts getting free signup credit to try it.

Can I bring my own OpenRouter API key?

Yes. Telenow supports BYOK — paste your own OpenRouter key to bill OpenRouter usage to your account, or use the platform key and pay through Telenow.

Other llm models options

OpenAI
Industry-leading reasoning, broad tool-call support. Only low-latency chat models are listed — slow reasoning models (o1 / o3) are deliberately excluded, they stall the first reply on a live call.
Anthropic
Strong reasoning and tool use; great for nuanced personas. For live calls prefer Haiku (fastest/cheapest) or Sonnet (balanced) — Opus is higher quality but pricier and slower to first token.
Groq
Fastest tokens-per-second on hosted open models.
Google Gemini
Gemini 2.5 family via the Gemini API (OpenAI-compatible). Flash and Flash-Lite are fast + cheap with huge context — strong defaults for live calls; Pro for the hardest reasoning.
xAI Grok
Grok via the xAI API (OpenAI-compatible). Grok 4 Fast (non-reasoning) is built for low-latency chat with a 2M context; Grok 3 mini is the budget pick. Always-on reasoning Grok 4 is excluded — too slow to first token for live calls.
Azure OpenAI
OpenAI models hosted in your Azure tenancy. The deployment you configure below selects the model; the chosen model here is used for the cost/latency estimate. Requires endpoint + deployment + API version.
AWS Bedrock
Claude, Llama and Amazon Nova through AWS Bedrock (unified ConverseStream API). Bring your own AWS credentials, or leave them blank to use the platform AWS account.
Custom API (full agentic workflow)
You already built an agentic workflow on your server — tools, RAG, memory, system prompt all live with you. We just attach voice (STT + TTS) and pipe each user turn through your SSE endpoint.
Custom LLM (open-source / self-hosted)
Point at any OpenAI-compatible endpoint — Ollama, vLLM, llama.cpp, LM Studio, or a self-deployed model. We treat it exactly like OpenAI: same chat-completions wire format, your model + your URL. Tools work too.
₹300.00 free credit on signup

Build a voice agent with OpenRouter

Sign up free and get ₹300.00 in credit — no card required. Connect your number, pick a template, and go live in minutes.