LiteLLM Proxy CLI

The LiteLLM Proxy CLI is a command-line tool for managing your LiteLLM proxy server. It provides commands for managing models, viewing server status, and interacting with the proxy server.

Installation

pip install 'litellm[proxy]'

Configuration

The CLI can be configured using environment variables or command-line options:

LITELLM_PROXY_URL: Base URL of the LiteLLM proxy server (default: http://localhost:4000)
LITELLM_PROXY_API_KEY: API key for authentication

Global Options

--version, -v: Print the LiteLLM Proxy client and server version and exit.

Example:

litellm-proxy version
# or
litellm-proxy --version
# or
litellm-proxy -v

Commands

Models Management

The CLI provides several commands for managing models on your LiteLLM proxy server:

List Models

View all available models:

litellm-proxy models list [--format table|json]

Options:

--format: Output format (table or json, default: table)

Model Information

Get detailed information about all models:

litellm-proxy models info [options]

Options:

--format: Output format (table or json, default: table)
--columns: Comma-separated list of columns to display. Valid columns:
- public_model
- upstream_model
- credential_name
- created_at
- updated_at
- id
- input_cost
- output_cost

Default columns: public_model, upstream_model, updated_at

Add Model

Add a new model to the proxy:

litellm-proxy models add <model-name> [options]

Options:

--param, -p: Model parameters in key=value format (can be specified multiple times)
--info, -i: Model info in key=value format (can be specified multiple times)

Example:

litellm-proxy models add gpt-4 -p api_key=sk-123 -p api_base=https://api.openai.com -i description="GPT-4 model"

Get Model Info

Get information about a specific model:

litellm-proxy models get [--id MODEL_ID] [--name MODEL_NAME]

Options:

--id: ID of the model to retrieve
--name: Name of the model to retrieve

Delete Model

Delete a model from the proxy:

litellm-proxy models delete <model-id>

Update Model

Update an existing model's configuration:

litellm-proxy models update <model-id> [options]

Options:

--param, -p: Model parameters in key=value format (can be specified multiple times)
--info, -i: Model info in key=value format (can be specified multiple times)

Import Models

Import models from a YAML file:

litellm-proxy models import models.yaml

Options:

--dry-run: Show what would be imported without making any changes.
--only-models-matching-regex <regex>: Only import models where litellm_params.model matches the given regex.
--only-access-groups-matching-regex <regex>: Only import models where at least one item in model_info.access_groups matches the given regex.

Examples:

Import all models from a YAML file:

litellm-proxy models import models.yaml

Dry run (show what would be imported):

litellm-proxy models import models.yaml --dry-run

Only import models where the model name contains 'gpt':

litellm-proxy models import models.yaml --only-models-matching-regex gpt

Only import models with access group containing 'beta':

litellm-proxy models import models.yaml --only-access-groups-matching-regex beta

Combine both filters:

litellm-proxy models import models.yaml --only-models-matching-regex gpt --only-access-groups-matching-regex beta

Credentials Management

The CLI provides commands for managing credentials on your LiteLLM proxy server:

List Credentials

View all available credentials:

litellm-proxy credentials list [--format table|json]

Options:

--format: Output format (table or json, default: table)

The table format displays:

Credential Name
Custom LLM Provider

Create Credential

Create a new credential:

litellm-proxy credentials create <credential-name> --info <json-string> --values <json-string>

Options:

--info: JSON string containing credential info (e.g., custom_llm_provider)
--values: JSON string containing credential values (e.g., api_key)

Example:

litellm-proxy credentials create azure-cred \
  --info '{"custom_llm_provider": "azure"}' \
  --values '{"api_key": "sk-123", "api_base": "https://example.azure.openai.com"}'

Get Credential

Get information about a specific credential:

litellm-proxy credentials get <credential-name>

Delete Credential

Delete a credential:

litellm-proxy credentials delete <credential-name>

Keys Management

The CLI provides commands for managing API keys on your LiteLLM proxy server:

List Keys

View all API keys:

litellm-proxy keys list [--format table|json] [options]

Options:

--format: Output format (table or json, default: table)
--page: Page number for pagination
--size: Number of items per page
--user-id: Filter keys by user ID
--team-id: Filter keys by team ID
--organization-id: Filter keys by organization ID
--key-hash: Filter by specific key hash
--key-alias: Filter by key alias
--return-full-object: Return the full key object
--include-team-keys: Include team keys in the response

Generate Key

Generate a new API key:

litellm-proxy keys generate [options]

Options:

--models: Comma-separated list of allowed models
--aliases: JSON string of model alias mappings
--spend: Maximum spend limit for this key
--duration: Duration for which the key is valid (e.g. '24h', '7d')
--key-alias: Alias/name for the key
--team-id: Team ID to associate the key with
--user-id: User ID to associate the key with
--budget-id: Budget ID to associate the key with
--config: JSON string of additional configuration parameters

Example:

litellm-proxy keys generate --models gpt-4,gpt-3.5-turbo --spend 100 --duration 24h --key-alias my-key --team-id team123

Delete Keys

Delete API keys by key or alias:

litellm-proxy keys delete [--keys <comma-separated-keys>] [--key-aliases <comma-separated-aliases>]

Options:

--keys: Comma-separated list of API keys to delete
--key-aliases: Comma-separated list of key aliases to delete

Example:

litellm-proxy keys delete --keys sk-key1,sk-key2 --key-aliases alias1,alias2

Get Key Info

Get information about a specific API key:

litellm-proxy keys info --key <key-hash>

Options:

--key: The key hash to get information about

Example:

litellm-proxy keys info --key sk-key1

User Management

The CLI provides commands for managing users on your LiteLLM proxy server:

List Users

View all users:

litellm-proxy users list

Get User Info

Get information about a specific user:

litellm-proxy users get --id <user-id>

Create User

Create a new user:

litellm-proxy users create --email [email protected] --role internal_user --alias "Alice" --team team1 --max-budget 100.0

Delete User

Delete one or more users by user_id:

litellm-proxy users delete <user-id-1> <user-id-2>

Chat Commands

The CLI provides commands for interacting with chat models through your LiteLLM proxy server:

Chat Completions

Create a chat completion:

litellm-proxy chat completions <model> [options]

Arguments:

model: The model to use (e.g., gpt-4, claude-2)

Options:

--message, -m: Messages in 'role:content' format. Can be specified multiple times to create a conversation.
--temperature, -t: Sampling temperature between 0 and 2
--top-p: Nucleus sampling parameter between 0 and 1
--n: Number of completions to generate
--max-tokens: Maximum number of tokens to generate
--presence-penalty: Presence penalty between -2.0 and 2.0
--frequency-penalty: Frequency penalty between -2.0 and 2.0
--user: Unique identifier for the end user

Examples:

Simple completion:

litellm-proxy chat completions gpt-4 -m "user:Hello, how are you?"

Multi-message conversation:

litellm-proxy chat completions gpt-4 \
  -m "system:You are a helpful assistant" \
  -m "user:What's the capital of France?" \
  -m "assistant:The capital of France is Paris." \
  -m "user:What's its population?"

With generation parameters:

litellm-proxy chat completions gpt-4 \
  -m "user:Write a story" \
  --temperature 0.7 \
  --max-tokens 500 \
  --top-p 0.9

HTTP Commands

The CLI provides commands for making direct HTTP requests to your LiteLLM proxy server:

Make HTTP Request

Make an HTTP request to any endpoint:

litellm-proxy http request <method> <uri> [options]

Arguments:

method: HTTP method (GET, POST, PUT, DELETE, etc.)
uri: URI path (will be appended to base_url)

Options:

--data, -d: Data to send in the request body (as JSON string)
--json, -j: JSON data to send in the request body (as JSON string)
--header, -H: HTTP headers in 'key:value' format. Can be specified multiple times.

Examples:

List models:

litellm-proxy http request GET /models

Create a chat completion:

litellm-proxy http request POST /chat/completions -j '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello"}]}'

Test connection with custom headers:

litellm-proxy http request GET /health/test_connection -H "X-Custom-Header:value"

Environment Variables

The CLI respects the following environment variables:

LITELLM_PROXY_URL: Base URL of the proxy server
LITELLM_PROXY_API_KEY: API key for authentication

Examples

List all models in table format:

litellm-proxy models list

Add a new model with parameters:

litellm-proxy models add gpt-4 -p api_key=sk-123 -p max_tokens=2048

Get model information in JSON format:

litellm-proxy models info --format json

Update model parameters:

litellm-proxy models update model-123 -p temperature=0.7 -i description="Updated model"

List all credentials in table format:

litellm-proxy credentials list

Create a new credential for Azure:

litellm-proxy credentials create azure-prod \
  --info '{"custom_llm_provider": "azure"}' \
  --values '{"api_key": "sk-123", "api_base": "https://prod.azure.openai.com"}'

Make a custom HTTP request:

litellm-proxy http request POST /chat/completions \
  -j '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello"}]}' \
  -H "X-Custom-Header:value"

User management:

# List users
litellm-proxy users list

# Get user info
litellm-proxy users get --id u1

# Create a user
litellm-proxy users create --email [email protected] --role internal_user --alias "Alice" --team team1 --max-budget 100.0

# Delete users
litellm-proxy users delete u1 u2

Import models from a YAML file (with filters):

# Only import models where the model name contains 'gpt'
litellm-proxy models import models.yaml --only-models-matching-regex gpt

# Only import models with access group containing 'beta'
litellm-proxy models import models.yaml --only-access-groups-matching-regex beta

# Combine both filters
litellm-proxy models import models.yaml --only-models-matching-regex gpt --only-access-groups-matching-regex beta

Error Handling

The CLI will display appropriate error messages when:

The proxy server is not accessible
Authentication fails
Invalid parameters are provided
The requested model or credential doesn't exist
Invalid JSON is provided for credential creation
Any other operation fails

For detailed debugging, use the --debug flag with any command.