Rate Limit
Max Context
Cost
$0
.00
Quick Start
API Endpoint
https://api.groq.com/openai/v1
Why Choose Groq?
-
Free Quotas - No hidden costs or credit card required
-
Instant Access - Get started in seconds with API key
-
Generous Limits - Perfect for development & production
-
16 Models - Choose the perfect model for your task
Example: Making Your First Request
import os
from groq import Groq
client = Groq(
api_key="YOUR_API_KEY",
)
chat_completion = client.chat.completions.create(
messages=[
{
"role": "user",
"content": "Explain the importance of fast language models",
}
],
model="allam-2-7b-instruct",
)
print(chat_completion.choices[0].message.content)
Free Models
| Model Name | ID | Context | Capabilities |
|---|---|---|---|
| Allam 2 7B |
allam-2-7b-instruct
|
7,000 Requests / day, 6,000 Tokens / min |
|
| Llama 3.1 8B |
llama-3.1-8b-instant
|
14,400 Requests / day, 6,000 Tokens / min |
|
| Llama 3.3 70B |
llama-3.3-70b-versatile
|
1,000 Requests / day, 12,000 Tokens / min |
|
| Llama 4 Maverick 17B |
llama-4-maverick-17b-128e-instruct
|
1,000 Requests / day, 6,000 Tokens / min |
|
| Llama 4 Scout |
llama-4-scout-instruct
|
1,000 Requests / day, 30,000 Tokens / min |
|
| Whisper Large v3 |
whisper-large-v3
|
7,200 audio-sec/min, 2,000 Requests / day |
|
| Whisper Large v3 Turbo |
whisper-large-v3-turbo
|
7,200 audio-sec/min, 2,000 Requests / day |
|
| Groq Compound |
groq/compound
|
250 Requests / day, 70,000 Tokens / min |
|
| Groq Compound Mini |
groq/compound-mini
|
250 Requests / day, 70,000 Tokens / min |
|
| Llama Guard 4 12B |
meta-llama/llama-guard-4-12b
|
14,400 Requests / day, 15,000 Tokens / min |
|
| Moonshot Kimi K2 |
moonshotai/kimi-k2-instruct
|
1,000 Requests / day, 10,000 Tokens / min |
|
| Moonshot Kimi K2 0905 |
moonshotai/kimi-k2-instruct-0905
|
1,000 Requests / day, 10,000 Tokens / min |
|
| GPT-OSS 120B |
openai/gpt-oss-120b
|
1,000 Requests / day, 8,000 Tokens / min |
|
| GPT-OSS 20B |
openai/gpt-oss-20b
|
1,000 Requests / day, 8,000 Tokens / min |
|
| GPT-OSS Safeguard 20B |
openai/gpt-oss-safeguard-20b
|
1,000 Requests / day, 8,000 Tokens / min |
|
| Qwen3 32B |
qwen/qwen3-32b
|
1,000 Requests / day, 6,000 Tokens / min |
|
Current Limits
Request Limits
-
Rate Limit30 Requests / min, 14,400 Requests / day
-
Daily Limit14,400 Requests/Day
Token Limits
Throughput
Monthly Cap
Key Features
Known Limitations
- Test
Frequently Asked Questions
How do I get an API key?
Getting your Groq API key is simple:
- Visit the official Groq website
- Sign up for a free account (no credit card required)
- Navigate to the API section in your dashboard
- Generate your API key and start using it immediately
Are there free quotas?
Yes! Groq offers a generous free tier with no credit card required. Check the provider's website for current terms and conditions regarding free tier availability and duration.
What happens if I exceed the free limits?
If you reach the rate limit, your requests will be temporarily throttled. The limits reset automatically after the specified time period. You can always monitor your usage in the provider's dashboard.
Is my data secure?
Groq takes security seriously. All API communications are encrypted via HTTPS. Check their privacy policy and terms of service for detailed information about data handling and retention policies.
Start Building with Groq Today
Join thousands of developers using Groq to build amazing AI-powered applications. Get started in less than 2 minutes.