Battle of the Models
Compare specific LLM models, context windows, and capabilities.
GPT-OSS 120B
Groq
Intelligence Score
65/100
Model Popularity
0 votes
Context Window
1k RPD, 8k TPM
Pricing Model
Free / Open
Phi-4
A-TIERGitHub Models
Intelligence Score
89/100
Context Window
128K
Pricing Model
Free / Open
Model Popularity
0 votes
FINAL VERDICT
Phi-4 Wins
With an intelligence score of 89/100 vs 65/100, Phi-4 outperforms GPT-OSS 120B by 24 points.
Clear Winner: Significant performance advantage for Phi-4.
HEAD-TO-HEAD
Detailed Comparison
| Feature |
GPT-OSS 120B
|
Phi-4
|
|---|---|---|
|
Context Window
|
1k RPD, 8k TPM | 128K |
|
Architecture
|
Transformer (Proprietary) | Transformer |
|
Est. MMLU Score
|
~60-64% | ~80-84% |
|
Release Date
|
2024 | Dec 2024 |
|
Pricing Model
|
Free Tier | Free Tier |
|
Rate Limit (RPM)
|
30 RPM, 14.4k RPD | Varies by Copilot Tier |
|
Daily Limit
|
14,400 Requests/Day | Low |
|
Capabilities
|
No specific data
|
Reasoning
|
|
Performance Tier
|
C-Tier (Good) | A-Tier (Excellent) |
|
Speed Estimate
|
Medium | Medium |
|
Primary Use Case
|
General Purpose | General Purpose |
|
Model Size
|
120B | Undisclosed |
|
Limitations
|
|
|
|
Key Strengths
|
|
|
Similar Comparisons
GPT-OSS 120B
vs
GPT-4o
Phi-4
vs
GPT-4o
GPT-OSS 120B
vs
Llama 3.3 70B Instruct
Phi-4
vs
Llama 3.3 70B Instruct
GPT-OSS 120B
vs
Mistral Large (24.11)
Phi-4
vs
Mistral Large (24.11)
Phi-4
vs
Cohere Command R+
Phi-4
vs
AI21 Jamba 1.5 Large
Phi-4
vs
Llama Guard 4 12B
Phi-4
vs
Moonshot Kimi K2
Phi-4
vs
Moonshot Kimi K2 0905
Phi-4
vs
GPT-OSS 20B