DeepInfra

Verified Truly Free

Cost-effective inference platform with $5 free credits on signup. Hosts 40+ open-source models with OpenAI-compatible API. Known for reliable uptime and competitive pricing after credits.

$5 Credits OpenAI Compatible 40+ Models Reliable
Get API Key Suggest Edit
0

Overview

Provider Type

Trial Credits

API Endpoint

https://api.deepinfra.com/v1/openai

Free Tier Highlights

60 RPM (varies by model)

Why Choose DeepInfra?

DeepInfra stands out for its unique features and capabilities. With a developer-friendly API and comprehensive documentation, you can integrate AI capabilities into your applications within minutes.

Quick Start Guide

1

Visit https://deepinfra.com/

2

Sign up with email or GitHub

3

Get $5 free credit automatically

4

Navigate to API Keys

5

Generate an API key

6

Use with OpenAI SDK (change base_url)

Available Models

Model Name ID Context Capabilities
Llama 3.1 405B Instruct
meta-llama/Meta-Llama-3.1-405B-Instruct
128 000
Reasoning
Llama 3.1 70B Instruct
meta-llama/Meta-Llama-3.1-70B-Instruct
128 000
-
Mixtral 8x22B Instruct
mistralai/Mixtral-8x22B-Instruct-v0.1
64 000
Reasoning Multilingual
Qwen 2.5 72B Instruct
Qwen/Qwen2.5-72B-Instruct
32 000
Chinese
DeepSeek V3
deepseek-ai/DeepSeek-V3
64 000
Reasoning

Integration Examples

Ready-to-use code snippets for your applications.

main.py
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_DEEPINFRA_KEY",
    base_url="https://api.deepinfra.com/v1/openai"
)

response = client.chat.completions.create(
    model="meta-llama/Meta-Llama-3.1-405B-Instruct",
    messages=[
        {"role": "user", "content": "Explain the MoE architecture"}
    ]
)

print(response.choices[0].message.content)

Free Tier Pricing & Limits

Rate Limit

Requests per minute

60 Requests per minute (varies by model)

Daily Quota

Requests per day

Credit-based (no daily cap)

Token Limit

Tokens per minute

$5 = ~5 000 000 tokens (varies by model)

Monthly Quota

Per month limit

One-time $5 credit

Free Credits

One-time (90 days expiry)

$5

Use Cases

Cost-sensitive production applications

Model comparison and evaluation

High-volume text generation

Code generation pipelines

RAG applications

Limitations & Considerations

$5 credit is one-time only

Credits expire after 90 days

Rate limits vary by model

Billing required for continued use

Free tier has lower request priority

Community Hub

Live

Join the discussion, share tips, and rate DeepInfra.

Quick Reactions

Add Discussion

Comments are moderated. Be helpful and respectful.

Recent Activity

0 comments

Ready to Get Started?

Join thousands of developers using DeepInfra

Start Building Now