Overview
Provider Type
APIAPI Endpoint
https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run/
Free Tier Highlights
Why Choose Cloudflare Workers AI?
Cloudflare Workers AI stands out for its unique features and capabilities. With a developer-friendly API and comprehensive documentation, you can integrate AI capabilities into your applications within minutes.
Quick Start Guide
Sign up at https://dash.cloudflare.com/
Go to AI > Workers AI
Get your Account ID from the dashboard
Create an API Token with Workers AI permissions
Use the REST API or Workers bindings
Available Models
| Model Name | ID | Context | Capabilities |
|---|---|---|---|
| Llama 3.1 8B Instruct Free |
@cf/meta/llama-3.1-8b-instruct
|
128 000 |
Reasoning
|
| Llama 3.2 3B Instruct Free |
@cf/meta/llama-3.2-3b-instruct
|
128 000 |
- |
| Mistral 7B Instruct v0.2 Free |
@hf/mistral/mistral-7b-instruct-v0.2
|
32 000 |
Multilingual
|
| Qwen 1.5 7B Chat Free |
@cf/qwen/qwen1.5-7b-chat-awq
|
32 000 |
Chinese
|
| DeepSeek Coder 6.7B Free |
@hf/thebloke/deepseek-coder-6.7b-instruct-awq
|
16 000 |
Code
|
| Phi-2 Free |
@cf/microsoft/phi-2
|
2 000 |
Reasoning
|
Integration Examples
Ready-to-use code snippets for your applications.
Select Model
Free Tier Pricing & Limits
Rate Limit
Requests per minute
Daily Quota
Requests per day
Token Limit
Tokens per minute
Monthly Quota
Per month limit
Use Cases
Serverless AI applications
Edge-first AI products
Low-latency global chatbots
Image classification and generation
Content moderation at the edge
Limitations & Considerations
10,000 neurons/day cap (varies per model)
Larger models consume more neurons per request
No fine-tuning support
Some models have limited context windows
Beta models may change
Community Hub
LiveJoin the discussion, share tips, and rate Cloudflare Workers AI.
Quick Reactions
Add Discussion
Comments are moderated. Be helpful and respectful.