🌐
GitHub
github.com › deepseek-ai › DeepSeek-R1
GitHub - deepseek-ai/DeepSeek-R1
DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on ...
Starred by 91.6K users
Forked by 11.8K users
🌐
GitHub
github.blog › home › changelogs › deepseek-r1 is now available in github models (public preview)
DeepSeek-R1 is now available in GitHub Models (Public Preview) - GitHub Changelog
April 30, 2025 - The latest trending AI model DeepSeek-R1 is now available in GitHub Models. DeepSeek-R1 is a 671B parameter AI model designed to enhance deep learning, natural language processing, and computer vision
🌐
GitHub
github.com › huggingface › open-r1
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
We release Mixture-of-Thoughts--a curated reasoning dataset of 350k verified traces distilled from R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step. We also provide a recipe to train OpenR1-Distill-7B, which replicates the reasoning capabilities of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B and marks the completion of step 1 in the Open R1 project.
Starred by 25.7K users
Forked by 2.4K users
Languages   Python 89.4% | Shell 10.0% | Makefile 0.6%
🌐
DeepSeek
api-docs.deepseek.com › deepseek-r1 release 2025/01/20
DeepSeek-R1 Release | DeepSeek API Docs
🛠️ DeepSeek-R1: Technical Highlights · 📈 Large-scale RL in post-training · 🏆 Significant performance boost with minimal labeled data · 🔢 Math, code, and reasoning tasks on par with OpenAI-o1 · 📄 More details: https://gi...
🌐
GitHub
github.com › iamgmujtaba › deepseek-r1
GitHub - iamgmujtaba/deepseek-r1: DeepSeek-R1: WebGPU-based Local Reasoning Model
Everything runs locally with no data sent to servers, ensuring privacy and performance. Built with 🤗 Transformers.js and ONNX Runtime Web, it’s lightweight, offline-capable, and blazing-fast at 60 tokens per second.
Starred by 4 users
Forked by 3 users
Languages   JavaScript 85.3% | CSS 8.0% | Shell 5.6% | HTML 1.1%
🌐
GitHub
github.com › deepseek-ai › DeepSeek-R1 › blob › main › DeepSeek_R1.pdf
DeepSeek-R1/DeepSeek_R1.pdf at main · deepseek-ai/DeepSeek-R1
deepseek-ai / DeepSeek-R1 Public · Notifications · You must be signed in to change notification settings · Fork 11.8k · Star 91.6k ·
Author   deepseek-ai
🌐
GitHub
github.com › echohive42 › deepdeek-r1-experiments
GitHub - echohive42/deepdeek-r1-experiments: deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors
Choose between DeepSeek R1 or OpenAI O1 models · ⚠️ EXECUTE CODE ON YOUR SYSTEM · Automatically test generated code · Detect and fix runtime errors · 5-second timeout for each execution · Include error correction agent · Only save ...
Starred by 95 users
Forked by 16 users
Languages   Python
🌐
GitHub
github.com › deepseek-ai
DeepSeek · GitHub
DeepSeek-R1 · DeepSeek-R1 Public · 91.6k 11.8k · awesome-deepseek-integration · awesome-deepseek-integration Public · Integrate the DeepSeek API into popular softwares · 34.8k 3.9k · DeepSeek-VL2 · DeepSeek-VL2 Public · DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding ·
Find elsewhere
🌐
GitHub
github.com › DeepSeek-R1-Fully-Open-Reproduction
DeepSeek-R1: Fully Open Reproduction · GitHub
DeepSeek-R1 is an open-source project by Hugging Face that replicates the R1 pipeline for model training, evaluation, and synthetic data generation. This initiative empowers researchers and developers to reproduce and extend cutting-edge AI ...
🌐
Microsoft Azure
azure.microsoft.com › blog home › ai + machine learning › deepseek r1 is now available on azure ai foundry and github
DeepSeek R1 is now available on Azure AI Foundry and GitHub | Microsoft Azure Blog
March 26, 2025 - Read the GitHub Models blog post. Customers will be able to use distilled flavors of the DeepSeek R1 model to run locally on their Copilot+ PCs.
🌐
GitHub
github.com › FareedKhan-dev › train-deepseek-r1
GitHub - FareedKhan-dev/train-deepseek-r1: Building DeepSeek R1 from Scratch
We are using the same thinking prompt template that DeepSeek uses for the GRPO algorithm to build R1 Zero, so let’s define that: # DeepSeek system prompt for GRPO based training SYSTEM_PROMPT = ( f"""A conversation between User and Assistant. The user asks a question, and the Assistant solves it.
Starred by 730 users
Forked by 118 users
Languages   Jupyter Notebook
🌐
GitHub
github.com › topics › deepseek-r1
deepseek-r1 · GitHub Topics · GitHub
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
🌐
GitHub
github.com › qunash › r1-overthinker
GitHub - qunash/r1-overthinker: Force DeepSeek r1 models to think for as long as you wish
Using this app you can force DeepSeek R1 models to think more deeply by extending their reasoning process.
Starred by 372 users
Forked by 23 users
Languages   Jupyter Notebook
🌐
GitHub
github.blog › home › changelogs › deepseek-r1-0528 is now generally available in github models
DeepSeek-R1-0528 is now generally available in GitHub Models - GitHub Changelog
June 4, 2025 - The latest version of DeepSeek-R1, DeepSeek-R1-0528, is now available on GitHub Models. DeepSeek-R1-0528 is an updated version of DeepSeek-R1 with improved reasoning, inference, and performance via optimizations and enhanced computational…
🌐
GitHub
github.com › Rizwankaka › deepseek-r1-chat
GitHub - Rizwankaka/deepseek-r1-chat: chatbot made with gradio using opensource deepseek-r1 running locally
Built with Gradio and powered by the DeepSeek-r1 language model through Ollama, it provides intelligent coding assistance, debugging help, and programming guidance. ... Get instant, locally-processed responses for your coding queries!
Starred by 315 users
Forked by 48 users
Languages   Python
🌐
GitHub
github.com › agentsea › r1-computer-use
GitHub - agentsea/r1-computer-use: Applying the ideas of Deepseek R1 to computer use
DeepSeek-R1 has shown that large language models can develop powerful reasoning skills through iterative reward optimization. Traditionally, such projects rely on hard verifiers or rule-based scripts to determine correctness in tasks like math ...
Starred by 216 users
Forked by 11 users
Languages   Python
🌐
GitHub
github.com › jianzhnie › Open-R1
GitHub - jianzhnie/Open-R1: The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1
Open-R1 is a open-source library that allows you to train a hyper-personalized DeepSeek-R1-like model using your own data and the least amount of compute possible.
Starred by 273 users
Forked by 52 users
Languages   Python 99.4% | Shell 0.6%
🌐
GitHub
gist.github.com › awni › ec071fd27940698edd14a4191855bba6
Run DeepSeek R1 or V3 with MLX Distributed · GitHub
Can two of the new Mac Studios with M3 Ultra and max 512Gb of unified memory, and networked using Thunderbolt 5, run the non-quantized R1 version? (saw the news and got curious) ... You could run the 8 bit model with 1T RAM. That's quantized but perf should be about the same as the original fp8. ... Is there a limit of setting the available ram for GPU? Just wondering for the coming 512gb mac studio how much I can squeeze out for GPU alone, I assume that if I can leave only something like 16gb for os on 2 machines and get 496x2 vram for deepseek r1 I can run the full version with fp16 on core attention and fp8 on the rest of the params?
🌐
GitHub
github.com › deepseek-ai › DeepSeek-R1 › issues
deepseek-ai/DeepSeek-R1
deepseek-ai / DeepSeek-R1 Public · Notifications · You must be signed in to change notification settings · Fork 11.8k · Star 91.6k · Search Issues · is:issue state:open · is:issue state:open Search · LabelsMilestonesNew issue · Open · Closed · Status: Open.
Author   deepseek-ai