Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1
deepseek-ai/DeepSeek-R1 · Hugging Face
DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
Videos
03:09
Deploying Deepseek R1 on Hugging Face - YouTube
05:51
How to Run the Deepseek R1 LLM on Windows (7b mini) #ollama ...
43:26
Hugging Face Journal Club - DeepSeek R1 - YouTube
13:11
ArrrZero: Why DeepSeek R1 is less important than R1-Zero - YouTube
25:36
DeepSeek R1 Theory Overview | GRPO + RL + SFT - YouTube
03:31
DeepSeek-R1 vs DeepSeek-R1-Zero - YouTube
GitHub
github.com › deepseek-ai › DeepSeek-R1
GitHub - deepseek-ai/DeepSeek-R1
DeepSeek-R1-Zero · 671B · 37B · 128K · 🤗 HuggingFace · DeepSeek-R1 · 671B · 37B · 128K · 🤗 HuggingFace · DeepSeek-R1-Zero & DeepSeek-R1 are trained based on DeepSeek-V3-Base. For more details regarding the model architecture, please refer to DeepSeek-V3 repository.
Starred by 91.6K users
Forked by 11.8K users
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-Zero › tree › main
deepseek-ai/DeepSeek-R1-Zero at main
DeepSeek-R1-Zero · 689 GB · 5 contributors History: 27 commits · msr2000 · Small fix 7223428 9 months ago · figures · Release DeepSeek-R1 11 months ago · .gitattributes 1.52 kB · initial commit 11 months ago · LICENSE 1.06 kB · Release DeepSeek-R1 11 months ago ·
GitHub
github.com › huggingface › open-r1
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Starred by 25.7K users
Forked by 2.4K users
Languages Python 89.4% | Shell 10.0% | Makefile 0.6%
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-Zero › commit › cfa4541c2f577dafd72f51f2b14119bde0f3c812
Release DeepSeek-R1 · deepseek-ai/DeepSeek-R1-Zero at cfa4541
deepseek_v3 · conversational · custom_code · text-generation-inference · fp8 · arXiv: 2501.12948 · License: mit · Model card Files Files and versions · xet Community · 24 · Train · Deploy · Use this model · msr2000 commited on Jan 20 · Commit · cfa4541 ·
Unsloth
unsloth.ai › blog › deepseek-r1
Run Deepseek-R1 / R1 Zero
DeepSeek also distilled from R1 and fine-tuned it on Llama 3 and Qwen 2.5 models, meaning you can now also fine-tune the models out of the box with Unsloth. See our collection for all versions of the R1 model series including GGUF's, 4-bit and more! huggingface.co/collections/unsloth/deepseek-r1 Jan 27, 2025 update: We've released 1.58-bit Dynamic GGUFs for DeepSeek-R1 allowing you to run R1 even better with a 80% size reduction: 1.58-bit Dynamic R1 Feb 6, 2025 update: You can now train your own reasoning model like R1 using: GRPO + Unsloth
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-Zero › tree › 945a40ef65921ea791b3d8232f7a8bb8c35a67f0 › figures
deepseek-ai/DeepSeek-R1-Zero at 945a40ef65921ea791b3d8232f7a8bb8c35a67f0
DeepSeek-R1-Zero / figures · Ctrl+K · Ctrl+K · 5 contributors History: 1 commit · msr2000 · Release DeepSeek-R1 cfa4541 5 months ago · benchmark.jpg · Safe 777 kB ·
Reddit
reddit.com › r/localllama › deepseek r1 / r1 zero
r/LocalLLaMA on Reddit: Deepseek R1 / R1 Zero
January 20, 2025 - When asked for its name in chat.deepseek.com, it says DeepSeek R1. ... I use it from Open-WebUI via OpenRouter. Continue this thread ... Continue this thread Continue this thread Continue this thread Continue this thread Continue this thread ... There seems to be distilled versions as well: https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B