Model integration · Azure OpenAI

Azure for compliance, ProxyLLM for the rest.

Azure OpenAI lives inside enterprise cloud policy. ProxyLLM covers the standard OpenAI side: your own key passes through one endpoint, and OpenAI-bound work can run on Codex Hosted against a flat ChatGPT subscription instead of per-token pricing.

$129/month SaaS. Bring your own model keys. No inference markup.

Three steps to connect.

01

Know the Azure boundary

Azure OpenAI uses deployment-specific URLs and api-version parameters. ProxyLLM does not claim native Azure key storage today, so keep policy-bound deployments where they are.

02

Centralize the standard OpenAI side

Point OpenAI-compatible clients at https://api.proxyllm.ai/v1 with your own OpenAI or OpenRouter key. Same request shape, one endpoint, a log line for every call.

03

Scope team access

Issue scoped sub-keys with budget caps so each app spends against a limit you set, not against a shared secret.

The standard OpenAI side, one endpoint.

Pass gpt-4o-mini and other standard OpenAI models through ProxyLLM when a request does not need an Azure-specific deployment.

fallback.ts
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.proxyllm.ai/v1",
  apiKey: "pk_live_...",
});

const r = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Run the non-Azure fallback path." }],
});
Codex Hosted · the main feature

Run your AI workloads on your ChatGPT subscription.

ProxyLLM runs OpenAI's Codex for you, signed in with your own ChatGPT account. Your apps call one OpenAI-compatible endpoint and the work bills to your flat plan instead of per-token API pricing.

$129/month · normal SaaS pricing

Give OpenAI traffic a flat bill.

Codex Hosted runs OpenAI's Codex on ProxyLLM servers, signed in with your ChatGPT account. Azure-specific deployments stay on Azure; everything standard gets caps, logs, and no inference markup.