Spaces:

DesertWolf
/

test3

Configuration error

App Files Files Community

test3 / docs /my-website /src /pages /stream.md

DesertWolf

Upload folder using huggingface_hub

447ebeb verified 14 days ago

preview code

raw

history blame contribute delete

2.09 kB

	# Streaming Responses & Async Completion

	- [Streaming Responses](#streaming-responses)
	- [Async Completion](#async-completion)

	## Streaming Responses
	LiteLLM supports streaming the model response back by passing `stream=True` as an argument to the completion function
	### Usage
	```python
	response = completion(model="gpt-3.5-turbo", messages=messages, stream=True)
	for chunk in response:
	print(chunk['choices'][0]['delta'])

	```

	## Async Completion
	Asynchronous Completion with LiteLLM
	LiteLLM provides an asynchronous version of the completion function called `acompletion`
	### Usage
	```
	from litellm import acompletion
	import asyncio

	async def test_get_response():
	user_message = "Hello, how are you?"
	messages = [{"content": user_message, "role": "user"}]
	response = await acompletion(model="gpt-3.5-turbo", messages=messages)
	return response

	response = asyncio.run(test_get_response())
	print(response)

	```

	## Streaming Token Usage

	Supported across all providers. Works the same as openai.

	`stream_options={"include_usage": True}`

	If set, an additional chunk will be streamed before the data: [DONE] message. The usage field on this chunk shows the token usage statistics for the entire request, and the choices field will always be an empty array. All other chunks will also include a usage field, but with a null value.

	### SDK
	```python
	from litellm import completion
	import os

	os.environ["OPENAI_API_KEY"] = ""

	response = completion(model="gpt-3.5-turbo", messages=messages, stream=True, stream_options={"include_usage": True})
	for chunk in response:
	print(chunk['choices'][0]['delta'])
	```

	### PROXY

	```bash
	curl https://0.0.0.0:4000/v1/chat/completions \
	-H "Content-Type: application/json" \
	-H "Authorization: Bearer $OPENAI_API_KEY" \
	-d '{
	"model": "gpt-4o",
	"messages": [
	{
	"role": "system",
	"content": "You are a helpful assistant."
	},
	{
	"role": "user",
	"content": "Hello!"
	}
	],
	"stream": true,
	"stream_options": {"include_usage": true}
	}'

	```