Overview
Provider Type
APIAPI Endpoint
https://api-inference.huggingface.co/models
Free Tier Highlights
Why Choose Hugging Face Inference?
Hugging Face Inference stands out for its transparent, open-source approach. With a developer-friendly API and comprehensive documentation, you can integrate AI capabilities into your applications within minutes.
Quick Start Guide
Create Account
Get Access Token
Pick a Model
Available Models
| Model Name | ID | Context | Capabilities |
|---|---|---|---|
| Llama 3.2 11B Vision Free |
meta-llama/Llama-3.2-11B-Vision-Instruct
|
128 000 |
Text
Vision
|
| Llama 3.1 8B Instruct Free |
meta-llama/Meta-Llama-3.1-8B-Instruct
|
128 000 |
- |
| Qwen 2.5 72B Instruct Free |
Qwen/Qwen2.5-72B-Instruct
|
32 000 |
- |
| Gemma 2 9B Instruct Free |
google/gemma-2-9b-it
|
8 000 |
- |
| Flux.1 Dev Free |
black-forest-labs/FLUX.1-dev
|
Image |
- |
Integration Examples
Ready-to-use code snippets for your applications.
Select Model
Free Tier Pricing & Limits
Rate Limit
Requests per minute
Daily Quota
Requests per day
Token Limit
Tokens per minute
Monthly Quota
Per month limit
Use Cases
Prototyping & Testing
Learning NLP / ML
Lightweight Apps
Hackathons
Model Evaluation
Limitations & Considerations
Rate limited to ~300 request/hour for free users
Models larger than 10GB may not load
Cold starts can occur
No SLA on free tier
Community Hub
LiveJoin the discussion, share tips, and rate Hugging Face Inference.
Quick Reactions
Add Discussion
Comments are moderated. Be helpful and respectful.