Integrations

One gateway for every model, agent, and workflow.

ProxyLLM is an OpenAI-compatible endpoint. Anything that lets you set a base URL works with it: SDKs, automation platforms, coding agents, LLM routers, and model providers. OpenAI-bound work runs through Codex Hosted on your ChatGPT subscription.

Integrations · Models
OpenAI

OpenAI

Keep your OpenAI models and SDK. Run the work on your ChatGPT subscription, keep your key as fallback.

Anthropic

Claude

Sonnet, Haiku, and Opus behind the same OpenAI-compatible endpoint and budget controls.

Google Gemini

Gemini

Flash for cheap high-volume work, Pro for harder tasks, with one ProxyLLM key.

DeepSeek

Cheap reasoning and code models for agent loops, with spend caps per app or workflow.

xAI / Grok

Grok models for live-style assistant workloads without exposing provider keys everywhere.

Meta

Meta Llama

Open-weight Llama models for cheap summarization and classification behind one endpoint.

Mistral

Fast European models with app-level usage controls and request logs in front.

Cohere

Enterprise text and retrieval-heavy workloads tracked through one gateway.

Groq

Very fast inference for interactive agents and latency-sensitive work.

Perplexity

Search-answer models with separate keys, budgets, and request logs.

Together

Open model catalog access with ProxyLLM keys and cost visibility.

Fireworks

Production open-model serving behind scoped app keys.

Cerebras

High-throughput inference for fast agent steps and batch jobs.

Replicate

Hosted open models and specialized workloads behind one auditable endpoint.

Hugging Face

Inference endpoints and open models tracked alongside the rest of your LLM traffic.

Azure OpenAI

Enterprise OpenAI deployments with centralized keys, budgets, and logs.

Amazon Bedrock

AWS-hosted model access framed as a controlled enterprise provider lane.

Vertex AI

Google Cloud model deployments with ProxyLLM keys and governance in front.

Codex Hosted · the main feature

Run your AI workloads on your ChatGPT subscription.

ProxyLLM runs OpenAI's Codex for you, signed in with your own ChatGPT account. Your apps call one OpenAI-compatible endpoint and the work bills to your flat plan instead of per-token API pricing.

OpenAI-compatible · /v1/chat/completions

Missing your tool? It probably still works.

LangChain, Cursor, Vercel AI SDK, plain curl: if it accepts an OpenAI base URL, it accepts ProxyLLM. Set the base URL, use your ProxyLLM key, done.