Model integration · OpenAI

OpenAI models, unchanged. The bill, very changed.

Keep your SDK, prompts, and model names exactly as they are. Swap the base URL and your OpenAI traffic runs on the flat ChatGPT subscription you already pay for, through Codex Hosted. Your API key stays connected as fallback.

Start free How to connect

$129/month SaaS. Bring your own model keys. No inference markup.

Three steps to connect.

Connect your ChatGPT account

Sign in with OpenAI's official device-code login. ProxyLLM runs Codex in an isolated container for your account, so OpenAI workloads bill to your ChatGPT plan instead of per-token pricing.

Point your SDK

Set the base URL to https://api.proxyllm.ai/v1 and authenticate with your ProxyLLM key. Python, Node, curl: anything that speaks the OpenAI API works unchanged.

Request models as usual

gpt-5, gpt-4o, gpt-4o-mini: same names you use today. Requests run through Codex Hosted. At a plan limit they fall back to a second Codex account or your own OpenAI key, stored AES-256-GCM encrypted.

One client change.

Swap the base URL and key. Everything else in your codebase stays put.

client.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.proxyllm.ai/v1",
    api_key="pk_live_…",  # your ProxyLLM key
)

r = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Classify this ticket."}],
)

Codex Hosted · the main feature

Run your AI workloads on your ChatGPT subscription.

ProxyLLM runs OpenAI's Codex for you, signed in with your own ChatGPT account. Your apps call one OpenAI-compatible endpoint and the work bills to your flat plan instead of per-token API pricing.

Get Codex Hosted How it works

$129/month · normal SaaS pricing

Connect OpenAI in two minutes.

$129/month flat, no inference markup. OpenAI traffic moves off the per-token meter and onto your existing ChatGPT plan.

Start free All integrations