deepseek r1-0528-qwen3 8b benchmark

Analysis of DeepSeek's DeepSeek R1 0528 Qwen3 8B and comparison to other AI models across key metrics including quality, price, performance (tokens per second & time to first token), context window & more.

reddit.com › r/localllama › deepseek’s new r1-0528-qwen3-8b is the most intelligent 8b parameter model yet, but not by much: alibaba’s own qwen3 8b is just one point behind

r/LocalLLaMA on Reddit: DeepSeek’s new R1-0528-Qwen3-8B is the most intelligent 8B parameter model yet, but not by much: Alibaba’s own Qwen3 8B is just one point behind

June 5, 2025 -

source: https://x.com/ArtificialAnlys/status/1930630854268850271

amazing to have a local 8b model so smart like this in my machine!

what are your thoughts?

Videos

41:46

YouTube

DeepSeek R1 0528 : 8B vs 671B (Live Test) - YouTube

May 30, 2025

17:15

YouTube

DeepSeek R1 0528 Qwen3 8B - Small Upgraded Student Model - Install ...

May 29, 2025

reddit.com

r/LocalLLaMA on Reddit: DeepSeek-R1-0528-Qwen3-8B on iPhone 16 Pro

December 24, 2024

11:57

YouTube

Deepseek R1 0528 685b Q4 Full Local Ai Review - YouTube

May 31, 2025

06:04

YouTube

DeepSeek R1 0528 in 6 Minutes - YouTube

May 29, 2025

10:25

YouTube

Run DeepSeek-R1-0528-Qwen3-8B Locally with Gaia (Easy Tutorial!)

June 9, 2025

View all

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-R1-0528-Qwen3-8B

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face

This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking.

LM Studio

lmstudio.ai › models › deepseek › deepseek-r1-0528-qwen3-8b

deepseek/deepseek-r1-0528-qwen3-8b

This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking.

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-R1-0528

deepseek-ai/DeepSeek-R1-0528 · Hugging Face

This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking.

OpenRouter

openrouter.ai › deepseek › deepseek-r1-0528-qwen3-8b:free

DeepSeek R1 0528 Qwen3 8B - API, Providers, Stats | OpenRouter

The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.

Skywork

skywork.ai › home › models › deepseek: deepseek r1 0528 qwen3 8b free chat online

DeepSeek: DeepSeek R1 0528 Qwen3 8B Free Chat Online - Skywork ai

November 1, 2025 - The 8B model excels at code generation, debugging, explanation, and multi-language programming tasks. Its hybrid reasoning system is particularly effective for algorithmic problem-solving and complex code refactoring tasks that require deep ...

Artificial Analysis

artificialanalysis.ai › models › deepseek-r1-qwen3-8b › providers

DeepSeek R1 0528 Qwen3 8B: API Provider Performance Benchmarking & Price Analysis | Artificial Analysis

Analysis of API providers for DeepSeek R1 0528 Qwen3 8B across performance metrics including latency (time to first token), output speed (output tokens per second), price and others. API providers benchmarked include Novita.

reddit.com › r/localllama › deepseek-r1-0528 official benchmarks released!!!

r/LocalLLaMA on Reddit: DeepSeek-R1-0528 Official Benchmarks Released!!!

May 29, 2025 - AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking. We believe that the chain-of-thought from DeepSeek-R1-0528 will hold significant importance for both academic research on reasoning models and industrial ...

Find elsewhere

Google Bing Mojeek

Ollama

ollama.com › library › deepseek-r1:8b-0528-qwen3-q4_K_M

deepseek-r1:8b-0528-qwen3-q4_K_M

In this update, DeepSeek R1 has significantly improved its reasoning and inference capabilities. The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic.

Ollama

ollama.com › library › deepseek-r1:8b

deepseek-r1:8b

In this update, DeepSeek R1 has significantly improved its reasoning and inference capabilities. The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic.

Hugging Face

huggingface.co › unsloth › DeepSeek-R1-0528-Qwen3-8B-GGUF

unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF · Hugging Face

This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking.

Clarifai

clarifai.com › deepseek-ai › deepseek-chat › models › DeepSeek-R1-0528-Qwen3-8B

DeepSeek-R1-0528-Qwen3-8B model | Clarifai - The World's AI

DeepSeek-R1-0528 improves reasoning and logic via better computation and optimization, nearing the performance of top models like O3 and Gemini 2.5 Pro.

LM Studio

lmstudio.ai › home › models › deepseek-r1

deepseek-r1

This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking.

Cordatus

blog.cordatus.ai › home › deepseek-r1-0528-qwen3-8b: think like a 235b model

DeepSeek-R1-0528-Qwen3-8B: Think Like a 235B Model - Cordatus

July 25, 2025 - On the AIME 2024 benchmark, the model achieves state-of-the-art (SOTA) performance among all open-source models, surpassing the standard Qwen3 8B by an impressive +10.0% in accuracy.

Unsloth

unsloth.ai › blog › deepseek-r1-0528

How to Run Deepseek-R1-0528 Locally

The distill achieves the same performance as Qwen3 (235B). Qwen3 GGUF: DeepSeek-R1-0528-Qwen3-8B-GGUF You can also fine-tune the Qwen3 model with Unsloth. You can run the model using Unsloth's 1.78-bit Dynamic 2.0 GGUFs on your favorite inference frameworks. We quantized DeepSeek’s R1 671B parameter model from 720GB down to 185GB - a 75% size reduction.

DeepSeek

api-docs.deepseek.com › deepseek-r1-0528 release 2025/05/28

DeepSeek-R1-0528 Release | DeepSeek API Docs

🚀 DeepSeek-R1-0528 is here · 🔹 Improved benchmark performance

Runpod

runpod.io › home › blog › the 'minor upgrade' that’s anything but: deepseek r1 0528 deep dive

The 'Minor Upgrade' That’s Anything But: DeepSeek R1 0528 Deep Dive | Runpod Blog

Perhaps most impressively, the chain-of-thought from DeepSeek-R1-0528 was distilled to post-train Qwen3 8B Base, obtaining DeepSeek-R1-0528-Qwen3-8B. This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching ...

Ollama

ollama.com › sam860 › deepseek-r1-0528-qwen3:8b

sam860/deepseek-r1-0528-qwen3:8b

For benchmarks requiring sampling, the model uses: - Temperature: 0.6 - Top-p value: 0.95 - 16 responses per query to estimate pass@1

reddit.com › r/localllama › deepseek-r1-0528-qwen3-8b

r/LocalLLaMA on Reddit: DeepSeek-R1-0528-Qwen3-8B

April 10, 2025 - (Edit: source: https://livebench.ai/#/) But I got the impression qwq 32b is worth trying cause based on https://fiction.live/stories/Fiction-liveBench-Feb-21-2025/oQdzQvKHw8JyXbN87 it performs almost at the level of deepseek r1 at the larger context, it is even better than deepseek r1 0528 on larger context... You should try it yourself and compare the result for your usecase. ... I don't see the R1 Qwen3 8B distilled on that site, and neither do I find Data Analysis... So I am not sure what you are talking about here? ... The site I gave the link to is the fiction benchmark, basically tests coherence at various context lengths.