Voozh

https://developers.cloudflare.com/workers-ai/models/text-generation/

API Key

# env variable
os.environ['CLOUDFLARE_API_KEY']="3dnSGlxxxx"
os.environ['CLOUDFLARE_ACCOUNT_ID']="03xxxxx"

Sample Usage

from litellm import completion
import os

os.environ['CLOUDFLARE_API_KEY']="3dnSGlxxxx"
os.environ['CLOUDFLARE_ACCOUNT_ID']="03xxxxx"

response = completion(
 model="cloudflare/@cf/meta/llama-2-7b-chat-int8",
 messages=[
{"role":"user","content":"hello from litellm"}
],
)
print(response)

Sample Usage - Streaming

from litellm import completion
import os

os.environ['CLOUDFLARE_API_KEY']="3dnSGlxxxx"
os.environ['CLOUDFLARE_ACCOUNT_ID']="03xxxxx"

response = completion(
 model="cloudflare/@hf/thebloke/codellama-7b-instruct-awq",
 messages=[
{"role":"user","content":"hello from litellm"}
],
 stream=True
)

for chunk in response:
print(chunk)

Supported Models

All models listed here https://developers.cloudflare.com/workers-ai/models/text-generation/ are supported

Model Name	Function Call
@cf/meta/llama-2-7b-chat-fp16	`completion(model="mistral/mistral-tiny", messages)`
@cf/meta/llama-2-7b-chat-int8	`completion(model="mistral/mistral-small", messages)`
@cf/mistral/mistral-7b-instruct-v0.1	`completion(model="mistral/mistral-medium", messages)`
@hf/thebloke/codellama-7b-instruct-awq	`completion(model="codellama/codellama-medium", messages)`

URL: https://docs.litellm.ai/docs/providers/cloudflare_workers

⇱ Cloudflare Workers AI | liteLLM

API Key

Sample Usage

Sample Usage - Streaming

Supported Models

URL: https://docs.litellm.ai/docs/providers/cloudflare_workers

⇱ Cloudflare Workers AI | liteLLM

API Key​

Sample Usage​

Sample Usage - Streaming​

Supported Models​

API Key

Sample Usage

Sample Usage - Streaming

Supported Models