VOOZH about

URL: https://pypi.org/project/litellm/

โ‡ฑ litellm ยท PyPI


Skip to main content

litellm 1.89.2

pip install litellm

Latest release

Released:

Library to easily interface with LLM API providers

Navigation

Verified details

These details have been verified by PyPI
Owner

Unverified details

These details have not been verified by PyPI
Project links
Meta
  • License Expression: MIT
    SPDX License Expression
  • Author: BerriAI
  • Requires: Python <3.14, >=3.10
  • Provides-Extra: caching , extra-proxy , google , grpc , mlflow , proxy , proxy-runtime , semantic-router , stt-nvidia-riva , utils

Project description

๐Ÿš… LiteLLM

LiteLLM AI Gateway

Open Source AI Gateway for 100+ LLMs. Self-hosted. Enterprise-ready. Call any LLM in OpenAI format.

๐Ÿ‘ Deploy to Render
๐Ÿ‘ Deploy on Railway

LiteLLM Proxy Server (AI Gateway) | Hosted Proxy | Enterprise Tier | Website

๐Ÿ‘ PyPI Version
๐Ÿ‘ GitHub Stars
๐Ÿ‘ Y Combinator W23
๐Ÿ‘ Whatsapp
๐Ÿ‘ Discord
๐Ÿ‘ Slack
๐Ÿ‘ CodSpeed

๐Ÿ‘ LiteLLM AI Gateway

What is LiteLLM

LiteLLM is an open source AI Gateway that gives you a single, unified interface to call 100+ LLM providers โ€” OpenAI, Anthropic, Gemini, Bedrock, Azure, and more โ€” using the OpenAI format.

Use it as a Python SDK for direct library integration, or deploy the AI Gateway (Proxy Server) as a centralized service for your team or organization.

Jump to LiteLLM Proxy (LLM Gateway) Docs
Jump to Supported LLM Providers


Why LiteLLM

Managing LLM calls across providers gets complicated fast โ€” different SDKs, auth patterns, request formats, and error types for every model. LiteLLM removes that friction:

  • Unified API โ€” one interface for 100+ LLMs, no provider-specific SDK juggling
  • Drop-in OpenAI compatibility โ€” swap providers without rewriting your code
  • Production-ready gateway โ€” virtual keys, spend tracking, guardrails, load balancing, and an admin dashboard out of the box
  • 8ms P95 latency at 1k RPS (benchmarks)

OSS Adopters

๐Ÿ‘ Stripe
๐Ÿ‘ image
๐Ÿ‘ Google ADK
๐Ÿ‘ Greptile
๐Ÿ‘ OpenHands

Netflix

๐Ÿ‘ OpenAI Agents SDK

Features

Supported Providers (Website Supported Models | Docs)

Provider /chat/completions /messages /responses /embeddings /image/generations /audio/transcriptions /audio/speech /moderations /batches /rerank
Abliteration (abliteration) โœ…
AI/ML API (aiml) โœ… โœ… โœ… โœ… โœ…
AI21 (ai21) โœ… โœ… โœ…
AI21 Chat (ai21_chat) โœ… โœ… โœ…
Aleph Alpha โœ… โœ… โœ…
Amazon Nova โœ… โœ… โœ…
Anthropic (anthropic) โœ… โœ… โœ… โœ…
Anthropic Text (anthropic_text) โœ… โœ… โœ… โœ…
Anyscale โœ… โœ… โœ…
AssemblyAI (assemblyai) โœ… โœ… โœ… โœ…
Auto Router (auto_router) โœ… โœ… โœ…
AWS - Bedrock (bedrock) โœ… โœ… โœ… โœ… โœ…
AWS - Sagemaker (sagemaker) โœ… โœ… โœ… โœ…
Azure (azure) โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ…
Azure AI (azure_ai) โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ…
Azure Text (azure_text) โœ… โœ… โœ… โœ… โœ… โœ… โœ…
Baseten (baseten) โœ… โœ… โœ…
Bytez (bytez) โœ… โœ… โœ…
Cerebras (cerebras) โœ… โœ… โœ…
Clarifai (clarifai) โœ… โœ… โœ…
Cloudflare AI Workers (cloudflare) โœ… โœ… โœ…
Codestral (codestral) โœ… โœ… โœ…
Cohere (cohere) โœ… โœ… โœ… โœ… โœ…
Cohere Chat (cohere_chat) โœ… โœ… โœ…
CometAPI (cometapi) โœ… โœ… โœ… โœ…
CompactifAI (compactifai) โœ… โœ… โœ…
Custom (custom) โœ… โœ… โœ…
Custom OpenAI (custom_openai) โœ… โœ… โœ… โœ… โœ… โœ… โœ…
Dashscope (dashscope) โœ… โœ… โœ… โœ… โœ…
Databricks (databricks) โœ… โœ… โœ…
DataRobot (datarobot) โœ… โœ… โœ…
Deepgram (deepgram) โœ… โœ… โœ… โœ…
DeepInfra (deepinfra) โœ… โœ… โœ…
Deepseek (deepseek) โœ… โœ… โœ…
ElevenLabs (elevenlabs) โœ… โœ… โœ… โœ… โœ…
Empower (empower) โœ… โœ… โœ…
Fal AI (fal_ai) โœ… โœ… โœ… โœ…
Featherless AI (featherless_ai) โœ… โœ… โœ…
Fireworks AI (fireworks_ai) โœ… โœ… โœ…
FriendliAI (friendliai) โœ… โœ… โœ…
Galadriel (galadriel) โœ… โœ… โœ…
GitHub Copilot (github_copilot) โœ… โœ… โœ… โœ…
GitHub Models (github) โœ… โœ… โœ…
Google - PaLM โœ… โœ… โœ…
Google - Vertex AI (vertex_ai) โœ… โœ… โœ… โœ… โœ…
Google AI Studio - Gemini (gemini) โœ… โœ… โœ…
GradientAI (gradient_ai) โœ… โœ… โœ…
Groq AI (groq) โœ… โœ… โœ…
Heroku (heroku) โœ… โœ… โœ…
Hosted VLLM (hosted_vllm) โœ… โœ… โœ…
Huggingface (huggingface) โœ… โœ… โœ… โœ… โœ…
Hyperbolic (hyperbolic) โœ… โœ… โœ…
IBM - Watsonx.ai (watsonx) โœ… โœ… โœ… โœ…
Infinity (infinity) โœ…
Jina AI (jina_ai) โœ…
Lambda AI (lambda_ai) โœ… โœ… โœ…
Lemonade (lemonade) โœ… โœ… โœ…
LiteLLM Proxy (litellm_proxy) โœ… โœ… โœ… โœ… โœ…
Llamafile (llamafile) โœ… โœ… โœ…
LM Studio (lm_studio) โœ… โœ… โœ…
Maritalk (maritalk) โœ… โœ… โœ…
Meta - Llama API (meta_llama) โœ… โœ… โœ…
Mistral AI API (mistral) โœ… โœ… โœ… โœ…
Moonshot (moonshot) โœ… โœ… โœ…
Morph (morph) โœ… โœ… โœ…
Nebius AI Studio (nebius) โœ… โœ… โœ… โœ…
NLP Cloud (nlp_cloud) โœ… โœ… โœ…
Novita AI (novita) โœ… โœ… โœ…
Nscale (nscale) โœ… โœ… โœ…
Nvidia NIM (nvidia_nim) โœ… โœ… โœ…
OCI (oci) โœ… โœ… โœ…
Ollama (ollama) โœ… โœ… โœ… โœ…
Ollama Chat (ollama_chat) โœ… โœ… โœ…
Oobabooga (oobabooga) โœ… โœ… โœ… โœ… โœ… โœ… โœ…
OpenAI (openai) โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ… โœ…
OpenAI-like (openai_like) โœ…
OpenRouter (openrouter) โœ… โœ… โœ…
OVHCloud AI Endpoints (ovhcloud) โœ… โœ… โœ…
Perplexity AI (perplexity) โœ… โœ… โœ…
Petals (petals) โœ… โœ… โœ…
Predibase (predibase) โœ… โœ… โœ…
Recraft (recraft) โœ…
Replicate (replicate) โœ… โœ… โœ…
Sagemaker Chat (sagemaker_chat) โœ… โœ… โœ…
Sambanova (sambanova) โœ… โœ… โœ…
Snowflake (snowflake) โœ… โœ… โœ…
Text Completion Codestral (text-completion-codestral) โœ… โœ… โœ…
Text Completion OpenAI (text-completion-openai) โœ… โœ… โœ… โœ… โœ… โœ… โœ…
Together AI (together_ai) โœ… โœ… โœ…
Topaz (topaz) โœ… โœ… โœ…
Triton (triton) โœ… โœ… โœ…
V0 (v0) โœ… โœ… โœ…
Vercel AI Gateway (vercel_ai_gateway) โœ… โœ… โœ…
VLLM (vllm) โœ… โœ… โœ…
Volcengine (volcengine) โœ… โœ… โœ…
Voyage AI (voyage) โœ…
WandB Inference (wandb) โœ… โœ… โœ…
Watsonx Text (watsonx_text) โœ… โœ… โœ…
xAI (xai) โœ… โœ… โœ…
Xinference (xinference) โœ…

Read the Docs


Get Started

You can use LiteLLM through either the Proxy Server or Python SDK. Both give you a unified interface to access multiple LLMs (100+ LLMs). Choose the option that best fits your needs:

LiteLLM AI Gateway LiteLLM Python SDK
Use Case Central service (LLM Gateway) to access multiple LLMs Use LiteLLM directly in your Python code
Who Uses It? Gen AI Enablement / ML Platform Teams Developers building LLM projects
Key Features Centralized API gateway with authentication and authorization, multi-tenant cost tracking and spend management per project/user, per-project customization (logging, guardrails, caching), virtual keys for secure access control, admin dashboard UI for monitoring and management Direct Python library integration in your codebase, Router with retry/fallback logic across multiple deployments (e.g. Azure/OpenAI) - Router, application-level load balancing and cost tracking, exception handling with OpenAI-compatible errors, observability callbacks (Lunary, MLflow, Langfuse, etc.)

Stable Release: Use docker images with the -stable tag. These have undergone 12 hour load tests, before being published. More information about the release cycle here

Support for more providers. Missing a provider or LLM Platform, raise a feature request.

Run in Developer Mode

Services

  1. Setup .env file in root
  2. Run dependent services docker-compose up db prometheus

Backend

  1. (In root) create virtual environment python -m venv .venv
  2. Activate virtual environment source .venv/bin/activate
  3. Install dependencies uv sync --all-extras --group proxy-dev
  4. uv run prisma generate
  5. prisma generate
  6. Start proxy backend python litellm/proxy/proxy_cli.py

Frontend

  1. Navigate to ui/litellm-dashboard
  2. Install dependencies npm install
  3. Run npm run dev to start the dashboard

Verify Docker Image Signatures

All LiteLLM Docker images published to GHCR are signed with cosign. Every release is signed with the same key introduced in commit 0112e53.

Verify using the pinned commit hash (recommended):

A commit hash is cryptographically immutable, so this is the strongest way to ensure you are using the original signing key:

cosignverify\
--keyhttps://raw.githubusercontent.com/BerriAI/litellm/0112e53046018d726492c814b3644b7d376029d0/cosign.pub\
ghcr.io/berriai/litellm:<release-tag>

Verify using a release tag (convenience):

Tags are protected in this repository and resolve to the same key. This option is easier to read but relies on tag protection rules:

cosignverify\
--keyhttps://raw.githubusercontent.com/BerriAI/litellm/<release-tag>/cosign.pub\
ghcr.io/berriai/litellm:<release-tag>

Replace <release-tag> with the version you are deploying (e.g. v1.83.0-stable).


Enterprise

For companies that need better security, user management and professional support

Get an Enterprise License Talk to founders

This covers:

  • โœ… Features under the LiteLLM Commercial License:
  • โœ… Feature Prioritization
  • โœ… Custom Integrations
  • โœ… Professional Support - Dedicated discord + slack
  • โœ… Custom SLAs
  • โœ… Secure access with Single Sign-On

Contributing

We welcome contributions to LiteLLM! Whether you're fixing bugs, adding features, or improving documentation, we appreciate your help.

Quick Start for Contributors

This requires uv to be installed.

gitclonehttps://github.com/BerriAI/litellm.git
cdlitellm
makeinstall-dev# Install development dependencies
makeformat# Format your code
makelint# Run all linting checks
maketest-unit# Run unit tests
makeformat-check# Check formatting only

For detailed contributing guidelines, see CONTRIBUTING.md.

๐Ÿ“– Contributing to documentation? The LiteLLM docs have moved to a separate repository: BerriAI/litellm-docs. Please open doc PRs there. Docs are served at docs.litellm.ai.

Code Quality / Linting

LiteLLM follows the Google Python Style Guide.

Our automated checks include:

  • Black for code formatting
  • Ruff for linting and code quality
  • MyPy for type checking
  • Circular import detection
  • Import safety checks

All these checks must pass before your PR can be merged.

Support / talk with founders

Contributors

๐Ÿ‘ Image

Project details

Verified details

These details have been verified by PyPI
Owner

Unverified details

These details have not been verified by PyPI
Project links
Meta
  • License Expression: MIT
    SPDX License Expression
  • Author: BerriAI
  • Requires: Python <3.14, >=3.10
  • Provides-Extra: caching , extra-proxy , google , grpc , mlflow , proxy , proxy-runtime , semantic-router , stt-nvidia-riva , utils

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

litellm-1.89.2.tar.gz (14.1 MB view details)

Uploaded Source

Built Distribution

Filter files by name, interpreter, ABI, and platform.

If you're not sure about the file name format, learn more about wheel file names.

Copy a direct link to the current filters

litellm-1.89.2-py3-none-any.whl (15.5 MB view details)

Uploaded Python 3

File details

Details for the file litellm-1.89.2.tar.gz.

File metadata

  • Download URL: litellm-1.89.2.tar.gz
  • Upload date:
  • Size: 14.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for litellm-1.89.2.tar.gz
Algorithm Hash digest
SHA256 b2534d69568eed026310f4e006407db2d46494eb629bd1e71eb9603ec146540d
MD5 944188d1adf07a26183032ba2d230c8f
BLAKE2b-256 c129865d38325f9c424daf0bcc7ab61908cf51a87862b3a0dfebd059b60dac97

See more details on using hashes here.

File details

Details for the file litellm-1.89.2-py3-none-any.whl.

File metadata

  • Download URL: litellm-1.89.2-py3-none-any.whl
  • Upload date:
  • Size: 15.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for litellm-1.89.2-py3-none-any.whl
Algorithm Hash digest
SHA256 07e8e43b1a70fe919021376742897d18ffe7577ccfbb84632c949670f9abdc03
MD5 02b02cd14a5aea0d46af41b4cb90c4b9
BLAKE2b-256 32c851c93e8d017e7af02600171b5b6cc3805b07507acb2b94de7235ce764015

See more details on using hashes here.

Supported by

๐Ÿ‘ Image
AWS Cloud computing and Security Sponsor ๐Ÿ‘ Image
Datadog Monitoring ๐Ÿ‘ Image
Depot Continuous Integration ๐Ÿ‘ Image
Fastly CDN ๐Ÿ‘ Image
Google Download Analytics ๐Ÿ‘ Image
Pingdom Monitoring ๐Ÿ‘ Image
Sentry Error logging ๐Ÿ‘ Image
StatusPage Status page