Groq

Verified Truly Free

LPU Inference Engine.

Truly Free Community Pick
Get API Key Suggest Edit
1243

Overview

Provider Type

API

API Endpoint

https://api.groq.com/openai/v1

Free Tier Highlights

30 RPM14.4k RPD

Why Choose Groq?

Groq stands out for its unique features and capabilities. With a developer-friendly API and comprehensive documentation, you can integrate AI capabilities into your applications within minutes.

Available Models

Model Name ID Context Capabilities
Allam 2 7B Free
allam-2-7b-instruct
7 000 Requests per day6 000 Tokens per minute
-
Llama 3.1 8B Free
llama-3.1-8b-instant
14 400 Requests per day6 000 Tokens per minute
-
Llama 3.3 70B Free
llama-3.3-70b-versatile
1 000 Requests per day12 000 Tokens per minute
-
Llama 4 Maverick 17B Free
llama-4-maverick-17b-128e-instruct
1 000 Requests per day6 000 Tokens per minute
-
Llama 4 Scout Free
llama-4-scout-instruct
1 000 Requests per day30 000 Tokens per minute
-
Whisper Large v3 Free
whisper-large-v3
7 200 audio-sec/min2 000 Requests per day
-
Whisper Large v3 Turbo Free
whisper-large-v3-turbo
7 200 audio-sec/min2 000 Requests per day
-
Groq Compound Free
groq/compound
250 Requests per day70 000 Tokens per minute
-
Groq Compound Mini Free
groq/compound-mini
250 Requests per day70 000 Tokens per minute
-
Llama Guard 4 12B Free
meta-llama/llama-guard-4-12b
14 400 Requests per day15 000 Tokens per minute
-
Moonshot Kimi K2 Free
moonshotai/kimi-k2-instruct
1 000 Requests per day10 000 Tokens per minute
-
Moonshot Kimi K2 0905 Free
moonshotai/kimi-k2-instruct-0905
1 000 Requests per day10 000 Tokens per minute
-
GPT-OSS 120B Free
openai/gpt-oss-120b
1 000 Requests per day8 000 Tokens per minute
-
GPT-OSS 20B Free
openai/gpt-oss-20b
1 000 Requests per day8 000 Tokens per minute
-
GPT-OSS Safeguard 20B Free
openai/gpt-oss-safeguard-20b
1 000 Requests per day8 000 Tokens per minute
-
Qwen3 32B Free
qwen/qwen3-32b
1 000 Requests per day6 000 Tokens per minute
-

Integration Examples

Ready-to-use code snippets for your applications.

main.py
import os
from groq import Groq

client = Groq(
    api_key="YOUR_API_KEY",
)

chat_completion = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "Explain the importance of fast language models",
        }
    ],
    model="allam-2-7b-instruct",
)

print(chat_completion.choices[0].message.content)

Free Tier Pricing & Limits

Rate Limit

Requests per minute

30 Requests per minute14 400 Requests per day

Daily Quota

Requests per day

14400 Requests/Day

Token Limit

Tokens per minute

40 000 Tokens per minute (Varies by model)

Monthly Quota

Per month limit

Free Forever

Limitations & Considerations

Test

Community Hub

Live

Join the discussion, share tips, and rate Groq.

Quick Reactions

Add Discussion

Comments are moderated. Be helpful and respectful.

Recent Activity

0 comments

Ready to Get Started?

Join thousands of developers using Groq

Start Building Now