Open Source
LLM Proxy & Gateway
Cost Control

Deploy LiteLLM on CometVPS

One OpenAI-format proxy in front of 100+ LLM providers. Issue per-user virtual keys with budgets, track spend per request, and fail over between providers — all self-hosted on your own server.

What is LiteLLM?

LiteLLM is a self-hosted proxy that puts a unified OpenAI-compatible API in front of every major LLM provider. Apps talk to one endpoint; LiteLLM handles the provider-specific quirks, key management, fallbacks, rate limits, and per-user cost tracking.

One API, Every Model

Apps speak the OpenAI format. LiteLLM translates to Anthropic, Google, Bedrock, Ollama, and 100+ other providers transparently.

Virtual Keys & Budgets

Provider API keys live in LiteLLM. Hand out short-lived virtual keys with spend caps, model allow-lists, and rate limits per user or app.

Visibility & Control

Detailed logs of tokens, latency, and cost per request, per user, per model. Finally know what your AI bill is going to before it arrives.

Key Features

Everything you need to run a production AI gateway on your own infrastructure

100+ LLM Providers

Unified OpenAI-format API across OpenAI, Anthropic, Google, Azure, Bedrock, Groq, OpenRouter, Ollama, vLLM, and 100+ others. One SDK, every model.

Virtual Keys & Budgets

Issue per-user or per-team virtual API keys with rate limits, monthly spend caps, and allowed-model lists. Revoke instantly without rotating provider keys.

Cost Tracking & Analytics

Track tokens, cost, and latency per request, per user, per model. Postgres-backed dashboards make AI spend finally visible across your org.

Fallbacks & Load Balancing

Round-robin, weighted, or latency-based routing across model deployments. Automatic fallback to a backup model when a provider rate-limits or fails.

Installation Guide

Get LiteLLM running on your CometVPS server in 5 simple steps

Security Tip

Always set a strong LITELLM_MASTER_KEY environment variable and never hand it out — it's the root key. Issue per-user virtual keys for everything else. Front the proxy with HTTPS, and store provider API keys in environment variables (not in config.yaml committed to git). Rotate keys regularly.

Recommended VPS Plans

LiteLLM is light on resources — pick a plan based on traffic volume

Recommended
Personal Proxy

Core VPS - Basic

$14/mo

2 vCPU Cores, 4GB RAM, 60GB NVMe

LiteLLM is lightweight — a small VPS happily proxies thousands of requests per day for personal use and side projects.

Team / Production

Supernova VPS - Spark

$28/mo

4 AMD Ryzen Cores, 8GB DDR5, 100GB NVMe

Faster Ryzen cores and DDR5 keep latency low under team load, with room for the Postgres database holding usage logs and virtual keys.

High-Volume Gateway

Supernova VPS - Flare

$58/mo

6 AMD Ryzen Cores, 16GB DDR5, 200GB NVMe

Plenty of capacity for a company-wide AI gateway routing across many providers, with detailed cost analytics in Postgres.

Why Deploy on CometVPS?

Get the best performance and reliability for your AI gateway

Centralized API Keys

Provider keys live in one place — apps and users only ever hold short-lived virtual keys.

NVMe SSD Storage

Fast disk I/O for the Postgres database storing usage logs, keys, and budget state.

10Gbps Network

Premium connectivity keeps proxy latency near zero on top of provider response times.

24/7 Expert Support

Our team is here to help with server issues around the clock.

Ready to Deploy LiteLLM?

Centralize your AI keys, track spend per user, and route around provider outages — all from one self-hosted proxy.