Groq

Free Quotas

LPU Inference Engine.

16
Free Models
Community Verified
Truly Free Community Pick

Rate Limit

Maximum requests per time period
30 Requests / min, 14,400 Requests / day

Max Context

Maximum tokens the model can process
1,000 Requests / day, 6,000 Tokens / min

Cost

$0

.00

FREE Credit card not required

Quick Start

API Endpoint

https://api.groq.com/openai/v1

Why Choose Groq?

  • Free Quotas - No hidden costs or credit card required
  • Instant Access - Get started in seconds with API key
  • Generous Limits - Perfect for development & production
  • 16 Models - Choose the perfect model for your task

Example: Making Your First Request

import os
from groq import Groq

client = Groq(
    api_key="YOUR_API_KEY",
)

chat_completion = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "Explain the importance of fast language models",
        }
    ],
    model="allam-2-7b-instruct",
)

print(chat_completion.choices[0].message.content)

Free Models

Model Name ID Context Capabilities
Allam 2 7B
allam-2-7b-instruct
7,000 Requests / day, 6,000 Tokens / min
Llama 3.1 8B
llama-3.1-8b-instant
14,400 Requests / day, 6,000 Tokens / min
Llama 3.3 70B
llama-3.3-70b-versatile
1,000 Requests / day, 12,000 Tokens / min
Llama 4 Maverick 17B
llama-4-maverick-17b-128e-instruct
1,000 Requests / day, 6,000 Tokens / min
Llama 4 Scout
llama-4-scout-instruct
1,000 Requests / day, 30,000 Tokens / min
Whisper Large v3
whisper-large-v3
7,200 audio-sec/min, 2,000 Requests / day
Whisper Large v3 Turbo
whisper-large-v3-turbo
7,200 audio-sec/min, 2,000 Requests / day
Groq Compound
groq/compound
250 Requests / day, 70,000 Tokens / min
Groq Compound Mini
groq/compound-mini
250 Requests / day, 70,000 Tokens / min
Llama Guard 4 12B
meta-llama/llama-guard-4-12b
14,400 Requests / day, 15,000 Tokens / min
Moonshot Kimi K2
moonshotai/kimi-k2-instruct
1,000 Requests / day, 10,000 Tokens / min
Moonshot Kimi K2 0905
moonshotai/kimi-k2-instruct-0905
1,000 Requests / day, 10,000 Tokens / min
GPT-OSS 120B
openai/gpt-oss-120b
1,000 Requests / day, 8,000 Tokens / min
GPT-OSS 20B
openai/gpt-oss-20b
1,000 Requests / day, 8,000 Tokens / min
GPT-OSS Safeguard 20B
openai/gpt-oss-safeguard-20b
1,000 Requests / day, 8,000 Tokens / min
Qwen3 32B
qwen/qwen3-32b
1,000 Requests / day, 6,000 Tokens / min

Current Limits

Request Limits

  • Rate Limit
    30 Requests / min, 14,400 Requests / day
  • Daily Limit
    14,400 Requests/Day

Token Limits

Throughput

40,000 Tokens / min (Varies by model)

Monthly Cap

Free Forever

Key Features

LPU Accelerated
Extremely Fast Inference
Open Source Models
API Access

Known Limitations

  • Test

Frequently Asked Questions

How do I get an API key?

Getting your Groq API key is simple:

  1. Visit the official Groq website
  2. Sign up for a free account (no credit card required)
  3. Navigate to the API section in your dashboard
  4. Generate your API key and start using it immediately
Are there free quotas?

Yes! Groq offers a generous free tier with no credit card required. Check the provider's website for current terms and conditions regarding free tier availability and duration.

What happens if I exceed the free limits?

If you reach the rate limit, your requests will be temporarily throttled. The limits reset automatically after the specified time period. You can always monitor your usage in the provider's dashboard.

Is my data secure?

Groq takes security seriously. All API communications are encrypted via HTTPS. Check their privacy policy and terms of service for detailed information about data handling and retention policies.

Start Building with Groq Today

Join thousands of developers using Groq to build amazing AI-powered applications. Get started in less than 2 minutes.

Credit card not required
Setup in 2 minutes
Forever free tier
Get API Key