Skip to main content
gradually.ai logogradually.ai
  • Blog
  • About Us
AI Newsletter
AI Newsletter
  1. Home
  2. AI Blog

The 50 Best Open Source LLMs (and How to Use Them)

Discover the best open source LLMs of 2026, their licenses, and how to run free LLMs locally on your own computer.

FHFinn Hillebrandt
April 27, 2026
Auf Deutsch lesen
AI Technology
The 50 Best Open Source LLMs (and How to Use Them)
𝕏XShare on XFacebookShare on FacebookLinkedInShare on LinkedInPinterestShare on PinterestThreadsShare on ThreadsFlipboardShare on Flipboard
Links marked with * are affiliate links. If a purchase is made through such links, we receive a commission.

Open source LLMs are one of the most important AI trends of 2026.

And for good reason:

Open source models were long significantly weaker than proprietary models. But by spring 2026 they have caught up, especially out of Chinese labs:

DeepSeek V4 Pro (released April 24, 2026), GLM-5.1 from Z.ai, Kimi K2.6 from Moonshot AI, and Qwen3.5 from Alibaba can compete with the best proprietary LLMs like Claude Opus 4.7, GPT-5.5, or Gemini 3.1 Pro, and even beat them on specific benchmarks like SWE-Bench Pro and HumanEval.

In this article, you'll find an overview of the 50 best open source LLMs as of April 2026 with their key benchmark scores and licenses.

Additionally, I'll show you how to easily and freely use open LLMs on your own computer (without needing to program or use the terminal).

TL;DRKey Takeaways
  • DeepSeek V4 Pro (1.6T MoE, MIT, April 2026), Kimi K2.6 (1T MoE), and GLM-5.1 from Z.ai lead the April 2026 rankings, with GLM-5.1 topping SWE-Bench Pro at 58.4%
  • 50 open source LLMs available with various licenses, from MIT and Apache 2.0 through to restricted research-only licenses
  • Chinese labs (DeepSeek, Moonshot AI, Z.ai, Alibaba) hold most top positions; the 2025 leaders (GPT-OSS-120B, DeepSeek R1, Qwen3-235B, Llama 4) are still solid but no longer at the top
  • Local usage possible with tools like Ollama, LM Studio, or GPT4All, but the new top models need serious hardware (multi-GPU or quantized variants for consumer rigs)

Open Source LLMs Compared

#
Model
MMLU
Math
Code
Developer
License
1DeepSeek V4 Pro (1.6T MoE)87.5%90.1%93.5%DeepSeekMIT
2Kimi K2.6 (1T MoE)84.6%90.5%92.0%Moonshot AIModified MIT
3GLM-5.1 (754B MoE)91.7%85.7%58.4%Z.aiMIT
4GLM-5 Reasoning (744B MoE)96.0%94.0%94.2%Z.aiMIT
5Kimi K2.5 (1T MoE)92.0%87.6%99.0%Moonshot AIModified MIT
6DeepSeek V4 Flash (284B MoE)83.0%85.0%88.0%DeepSeekMIT
7DeepSeek V3.2 (671B MoE)85.0%79.9%89.3%DeepSeekMIT
8GPT-OSS-120B (117B MoE)90.0%80.1%96.6%OpenAIApache 2.0
9DeepSeek-R1 (671B MoE)90.8%97.3%71.5%DeepSeekMIT
10Qwen3-235B-A22B-Thinking87.0%92.3%74.1%AlibabaApache 2.0
11Llama 4 Maverick (400B MoE)80.5%69.8%43.4%MetaLlama 4 Community
12Kimi K2 (1T MoE)97.4%71.6%53.7%Moonshot AIMIT
13DeepSeek-V3 (671B MoE)88.5%90.2%85.0%DeepSeekMIT
14GPT-OSS-20B (20B MoE)85.3%96.0%69.0%OpenAIApache 2.0
15Llama 3.3 70B Instruct86.0%77.3%83.0%MetaLlama 3.3 Community
16Qwen2.5-72B-Instruct85.3%82.3%82.0%AlibabaQwen License
17Llama 3.1 405B Instruct88.6%81.1%73.8%MetaLlama 3.1 Community
18Gemma 3 27B67.5%42.4%69.0%GoogleGemma Terms of Use
19Command R+ (104B)88.2%85.0%92.0%CohereCC BY-NC-4.0
20Llama-3.1-Nemotron-70B85.0%57.6%8.98NVIDIALlama 3.1 Community
21Mixtral-8x22B (141B MoE)77.8%68.0%75.0%Mistral AIApache 2.0
22Mistral Large 2 (123B)84.0%76.9%82.0%Mistral AIMistral Research License
23Phi-4 (14B)56.1%82.6%80.4%MicrosoftMIT
24Qwen3-32B-Instruct83.5%77.0%78.0%AlibabaApache 2.0
25OLMo 2 32B74.0%78.6%84.0%Allen InstituteApache 2.0
26DBRX (132B MoE)73.7%70.1%66.9%DatabricksDatabricks Open Model
27DeepSeek Coder V2 (236B MoE)78.5%90.2%76.2%DeepSeekMIT
28Llama 3.1 70B Instruct79.3%68.0%80.5%MetaLlama 3.1 Community
29Yi-34B76.3%67.6%85.0%01.AIApache 2.0
30Falcon 3 10B73.1%42.5%58.0%TIIFalcon License
31Qwen2.5-32B-Instruct83.1%75.5%78.9%AlibabaApache 2.0
32Mistral NeMo 12B68.0%83.5%76.8%Mistral AI / NVIDIAApache 2.0
33InternLM3 8B-Instruct72.3%75.0%75.6%Shanghai AI LabApache 2.0
34Granite Code 34B75.4%68.3%67.5%IBMApache 2.0
35Falcon 180B70.4%85.3%77.6%TIIFalcon License
36WizardLM-2 8x22B77.2%83.0%73.2%MicrosoftApache 2.0
37Qwen2-72B-Instruct84.2%89.5%64.6%AlibabaApache 2.0
38Mixtral-8x7B (46.7B MoE)70.6%74.4%40.2%Mistral AIApache 2.0
39Llama 3.1 8B Instruct68.4%84.5%72.6%MetaLlama 3.1 Community
40Gemma 3 8B70.9%77.9%56.0%GoogleGemma Terms of Use
41Code Llama 70B Instruct62.0%67.8%62.0%MetaLlama 2 Community
42Falcon 3 7B67.4%39.2%70.8%TIIFalcon License
43SOLAR 10.7B v1.066.0%69.9%71.0%UpstageApache 2.0
44Mistral 7B v0.362.5%52.2%83.0%Mistral AIApache 2.0
45Yi-1.5 34B76.8%80.1%75.0%01.AIApache 2.0
46OLMo 2 13B68.2%71.4%82.1%Allen InstituteApache 2.0
47StarCoder2 15B46.0%36.6%49.6%BigCodeBigCode Open RAIL-M v1
48Phi-3 Medium (14B)78.0%91.0%62.2%MicrosoftMIT
49InternLM2-Chat-20B67.0%79.6%67.1%Shanghai AI LabApache 2.0
50DeepSeek LLM 67B71.3%63.4%40.0%DeepSeekDeepSeek License

Benchmark score color coding:

ExcellentTop tier
GoodAbove average
AverageSolid
PoorBelow average

1. Key Benchmarks Explained

To objectively compare open source LLMs, I use three central benchmark categories:

MMLU / MMLU-Pro: The Massive Multitask Language Understanding Benchmark tests general knowledge across 57 subjects (STEM, social sciences, humanities). MMLU-Pro is the more challenging variant with less contamination. Top models score 85-90% here.

MATH / GPQA: These benchmarks test mathematical and scientific reasoning. MATH-500 contains challenging math problems, while GPQA (Graduate-Level Physics Questions Answers) tests expert knowledge in biology, physics, and chemistry. Top models score 70-97% here.

HumanEval / LiveCodeBench: These benchmarks test code generation. HumanEval contains Python programming tasks, LiveCodeBench tests code performance with current, uncontaminated tasks. Top models score 60-90% here.

The table shows three benchmark scores for each model, which vary depending on the model's strengths (e.g., code-focused models have higher HumanEval scores).

2. Top Models of April 2026

DeepSeek V4 Pro (released April 24, 2026) is the new leader. The 1.6 trillion parameter MoE activates only 49B per token, scores 87.5% on MMLU-Pro, 90.1% on GPQA Diamond, and 93.5% on LiveCodeBench. Same MIT license as the rest of the DeepSeek lineup, and it ships with native 1M-token context at roughly 27% of the inference FLOPs of V3.2.

Kimi K2.6 from Moonshot AI is the second-best open weight overall: 92% on HumanEval, 90.5% on GPQA Diamond, 96.4% on AIME 2026, with a 256K context window and native video input. Modified MIT license, 1T parameters MoE.

GLM-5.1 from Z.ai (formerly Zhipu) tops SWE-Bench Pro with 58.4%, beating GPT-5.4 (57.7%) and Claude Opus 4.6 (57.3%). The 754B-parameter MoE was trained entirely on Huawei Ascend chips and ships under the MIT license. The reasoning sibling, GLM-5, hits 96% on MMLU and 94% on GPQA, the highest knowledge scores in the open-source space.

Kimi K2.5 still posts the highest HumanEval score on any leaderboard (99.0) and leads on MATH-500 (98.0). It is the best open weight purely for code generation when latency matters less than peak quality.

DeepSeek V4 Flash (284B / 13B active) is the cost-efficient sibling of V4 Pro and the most practical choice when you want frontier-class quality on a single high-end GPU.

The previous generation is still very usable: GPT-OSS-120B (OpenAI's first open-weight model since GPT-2), DeepSeek R1, Qwen3-235B-A22B-Thinking, and Llama 4 Maverick all remain strong, just no longer state-of-the-art.

3. LLM Licenses Explained

Here's an overview of the most commonly used licenses for open source LLMs.

Warning
Note: Please always review the current license terms of LLMs yourself before using them. License conditions can change at any time.

MIT License

A very permissive open source license, similar to Apache 2.0. It allows unrestricted use, modification, and distribution of the LLM, including in proprietary programs, as long as the copyright notice is retained. DeepSeek V3 uses MIT with some restrictions for military use.

Llama 2 Community / Llama 3 Community

Meta released Llama 2 and Llama 3 under these licenses. They allow free use of the LLMs for research and commercial applications with up to 700 million monthly active users. The source code and model weights are freely available.

Qwen License / Qianwen LICENSE

Qwen models are released under various licenses. While smaller models are often licensed under Apache 2.0, larger models like Qwen2.5-72B have special license terms that allow commercial use with certain restrictions.

Apache 2.0

A very permissive open source license with minimal restrictions. It allows use, modification, and distribution of the LLM, including in proprietary programs, as long as the copyright notice is retained. It contains no copyleft clause.

CC BY-NC-4.0

A Creative Commons license that allows editing and sharing the LLM in any form, but not for commercial purposes. The author's name must be credited.

CC BY-NC-SA-4.0

Similar to CC BY-NC-4.0, but with the additional Share-Alike condition. This means forks or modified versions of an LLM must be distributed under the same conditions.

Non-Commercial

Here, using the LLM for commercial purposes is prohibited. However, what exactly counts as "commercial" is not always clearly defined or delimited.

Usually, "non-commercial" models are only released for research purposes or private use.

4. Using Open Source LLMs Locally on Your Own Computer

Using open source LLMs locally on your own computer is easier than you might think:

1. Download LM Studio

Download LM Studio from the website. It's free and available for Mac, Windows, and Linux:

LM Studio

2. Install and Open LM Studio

Next, install LM Studio on your computer and open it.

3. Download Your Desired Open Source LLMs

Now you need to download the open source LLMs you want to use in LM Studio.

Many popular LLMs are already on the home screen. To download an LLM, simply click the blue download button:

Download open source LLMs

To find specific open source LLMs, you can also use the search function:

Search open source LLMs

4. Important: Check System Requirements Before Downloading

Before downloading an LLM, you should check the system requirements.

Llama 3, for example, requires more than 8 GB RAM and 4.92 GB of free storage:

Open source LLM system requirements

5. Chat with the Open Source LLM

After downloading an open source LLM, you can use it directly in LM Studio.

Simply click on the speech bubble icon (?) in the left sidebar.

The user interface and settings options are reminiscent of the OpenAI Playground:

Chat with open source LLM

Frequently Asked Questions About Open Source LLMs

𝕏XShare on XFacebookShare on FacebookLinkedInShare on LinkedInPinterestShare on PinterestThreadsShare on ThreadsFlipboardShare on Flipboard
FH

Finn Hillebrandt

AI Expert & Blogger

Finn Hillebrandt is the founder of Gradually AI, an SEO and AI expert. He helps online entrepreneurs simplify and automate their processes and marketing with AI. Finn shares his knowledge here on the blog in 50+ articles as well as through his ChatGPT Course and the AI Business Club.

Learn more about Finn and the team, follow Finn on LinkedIn, join his Facebook group for ChatGPT, OpenAI & AI Tools or do like 17,500+ others and subscribe to his AI Newsletter with tips, news and offers about AI tools and online business. Also visit his other blog, Blogmojo, which is about WordPress, blogging and SEO.

Similar Articles

The 9 Best AI Image Generation Models in 2026
AI Technology

The 9 Best AI Image Generation Models in 2026

May 21, 2026
FHFinn Hillebrandt
ChatGPT Statistics 2026: Fascinating Numbers, Data & Facts
AI Technology

ChatGPT Statistics 2026: Fascinating Numbers, Data & Facts

May 21, 2026
FHFinn Hillebrandt
ChatGPT Versions: All 34 GPT Models at a Glance
AI Technology

ChatGPT Versions: All 34 GPT Models at a Glance

May 21, 2026
FHFinn Hillebrandt
Claude Statistics 2026: Numbers, Data & Facts About Anthropic
AI Technology

Claude Statistics 2026: Numbers, Data & Facts About Anthropic

May 21, 2026
FHFinn Hillebrandt
DeepSeek Statistics 2026: Key Numbers, Data & Facts
AI Technology

DeepSeek Statistics 2026: Key Numbers, Data & Facts

May 21, 2026
FHFinn Hillebrandt
Gemini Models: All Google Models at a Glance
AI Technology

Gemini Models: All Google Models at a Glance

May 21, 2026
FHFinn Hillebrandt

Stay Updated with the AI Newsletter

Get the latest AI tools, tutorials, and exclusive tips delivered to your inbox weekly

Unsubscribe anytime. About 4 to 8 emails per month. Consent includes notes on revocation, service provider, and statistics according to our Privacy Policy.

gradually.ai logogradually.ai

Germany's leading platform for AI tools and knowledge for online entrepreneurs.

AI Tools

  • AI Chat
  • ChatGPT in German
  • Text Generator
  • Prompt Enhancer
  • Prompt Link Generator
  • FLUX AI Image Generator
  • AI Art Generator
  • Midjourney Prompt Generator
  • Veo 3 Prompt Generator
  • AI Humanizer
  • AI Text Detector
  • Gemini Watermark Remover
  • All Tools →

Creative Tools

  • Blog Name Generator
  • AI Book Title Generator
  • Song Lyrics Generator
  • Artist Name Generator
  • Team Name Generator
  • AI Mindmap Generator
  • Headline Generator
  • Company Name Generator
  • AI Slogan Generator
  • Brand Name Generator
  • Newsletter Name Generator
  • YouTube Channel Name Generator

Business Tools

  • API Cost Calculator
  • Token Counter
  • AI Ad Generator
  • AI Copy Generator
  • Essay Generator
  • Story Generator
  • AI Rewrite Generator
  • Blog Post Generator
  • Meta Description Generator
  • AI Email Generator
  • Email Subject Line Generator
  • Instagram Bio Generator
  • AI Hashtag Generator

Resources

  • Claude Code MCP Servers
  • Claude Code Skills
  • n8n Hosting Comparison
  • OpenClaw Hosting Comparison
  • Claude Code Plugins
  • Claude Code Use Cases
  • Claude Cowork Use Cases
  • OpenClaw Use Cases
  • Changelogs

© 2026 Gradually AI. All rights reserved.

  • Blog
  • About Us
  • Legal Notice
  • Privacy Policy