Cloudflare Workers AI

Verified Truly Free

Run AI models on Cloudflare's global network. Workers AI gives you a generous free tier of 10,000 neurons per day across dozens of open-source models including Llama, Mistral, and more. No credit card required.

Free Tier Edge Computing Global Network No Credit Card
Get API Key Suggest Edit
0

Overview

Provider Type

API

API Endpoint

https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run/

Free Tier Highlights

Varies by model

Why Choose Cloudflare Workers AI?

Cloudflare Workers AI stands out for its unique features and capabilities. With a developer-friendly API and comprehensive documentation, you can integrate AI capabilities into your applications within minutes.

Quick Start Guide

1

Sign up at https://dash.cloudflare.com/

2

Go to AI > Workers AI

3

Get your Account ID from the dashboard

4

Create an API Token with Workers AI permissions

5

Use the REST API or Workers bindings

Available Models

Model Name ID Context Capabilities
Llama 3.1 8B Instruct Free
@cf/meta/llama-3.1-8b-instruct
128 000
Reasoning
Llama 3.2 3B Instruct Free
@cf/meta/llama-3.2-3b-instruct
128 000
-
Mistral 7B Instruct v0.2 Free
@hf/mistral/mistral-7b-instruct-v0.2
32 000
Multilingual
Qwen 1.5 7B Chat Free
@cf/qwen/qwen1.5-7b-chat-awq
32 000
Chinese
DeepSeek Coder 6.7B Free
@hf/thebloke/deepseek-coder-6.7b-instruct-awq
16 000
Code
Phi-2 Free
@cf/microsoft/phi-2
2 000
Reasoning

Integration Examples

Ready-to-use code snippets for your applications.

main.py
import requests

ACCOUNT_ID = "YOUR_ACCOUNT_ID"
API_TOKEN = "YOUR_API_TOKEN"

response = requests.post(
    f"https://api.cloudflare.com/client/v4/accounts/{ACCOUNT_ID}/ai/run/@cf/meta/llama-3.1-8b-instruct",
    headers={"Authorization": f"Bearer {API_TOKEN}"},
    json={
        "messages": [
            {"role": "user", "content": "What is Workers AI?"}
        ]
    }
)

print(response.json()["result"]["response"])

Free Tier Pricing & Limits

Rate Limit

Requests per minute

Varies by model

Daily Quota

Requests per day

10000 neurons/day

Token Limit

Tokens per minute

Included in neuron budget

Monthly Quota

Per month limit

~300000 neurons/month

Use Cases

Serverless AI applications

Edge-first AI products

Low-latency global chatbots

Image classification and generation

Content moderation at the edge

Limitations & Considerations

10,000 neurons/day cap (varies per model)

Larger models consume more neurons per request

No fine-tuning support

Some models have limited context windows

Beta models may change

Community Hub

Live

Join the discussion, share tips, and rate Cloudflare Workers AI.

Quick Reactions

Add Discussion

Comments are moderated. Be helpful and respectful.

Recent Activity

0 comments

Ready to Get Started?

Join thousands of developers using Cloudflare Workers AI

Start Building Now