Chinese artificial intelligence company

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is … Wikipedia
Factsheet
Native name 杭州深度求索人工智能基础技术研究有限公司
Company type Private
Factsheet
Native name 杭州深度求索人工智能基础技术研究有限公司
Company type Private
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1
deepseek-ai/DeepSeek-R1 · Hugging Face
DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
🌐
GitHub
github.com › huggingface › open-r1
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.
Starred by 25.7K users
Forked by 2.4K users
Languages   Python 89.4% | Shell 10.0% | Makefile 0.6%
🌐
Hugging Face
huggingface.co › deepseek-ai
deepseek-ai (DeepSeek)
Org profile for DeepSeek on Hugging Face, the AI community building the future.
🌐
Hugging Face
huggingface.co › blog › open-r1
Open-R1: a fully open reproduction of DeepSeek-R1
DeepSeek-R1 is a reasoning model built on the foundation of DeepSeek-V3. Like any good reasoning model, it starts with a strong base model, and DeepSeek-V3 is exactly that. This 671B Mixture of Experts (MoE) model performs on par with heavyweights ...
🌐
Hugging Face
huggingface.co › unsloth › DeepSeek-R1-GGUF
unsloth/DeepSeek-R1-GGUF · Hugging Face
DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-0528 · Hugging Face
In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across ...
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1 › tree › main
deepseek-ai/DeepSeek-R1 at main
DeepSeek-R1 · 689 GB · 5 contributors History: 21 commits · msr2000 · Small fix 56d4cbb 9 months ago · figures · Release DeepSeek-R1 11 months ago · .gitattributes 1.52 kB · initial commit 11 months ago · LICENSE 1.06 kB · Release DeepSeek-R1 11 months ago ·
🌐
Hugging Face
huggingface.co › learn › llm-course › en › chapter12 › 3
Understanding the DeepSeek R1 Paper - Hugging Face LLM Course
DeepSeek R1 represents a significant advancement in language model training, particularly in developing reasoning capabilities through reinforcement learning.
Find elsewhere
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-0528-Qwen3-8B
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face
In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across ...
🌐
GitHub
github.com › deepseek-ai › DeepSeek-R1
GitHub - deepseek-ai/DeepSeek-R1
DeepSeek-R1-Distill-Qwen-32B · Qwen2.5-32B · 🤗 HuggingFace · DeepSeek-R1-Distill-Llama-70B · Llama-3.3-70B-Instruct · 🤗 HuggingFace · DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers.
Starred by 91.6K users
Forked by 11.8K users
🌐
Hugging Face
huggingface.co › nvidia › DeepSeek-R1-FP4
nvidia/DeepSeek-R1-FP4 · Hugging Face
The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.
🌐
Hugging Face
huggingface.co › collections › unsloth › deepseek-r1-all-versions-678e1c48f5d2fce87892ace5
DeepSeek R1 (All Versions) - a unsloth Collection
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-0528 › tree › main
deepseek-ai/DeepSeek-R1-0528 at main
Release DeepSeek-R1-0528 6 months ago · config.json · Safe 1.66 kB · Add files using upload-large-folder tool 6 months ago · configuration_deepseek.py · Safe 9.9 kB · Release DeepSeek-R1-0528 6 months ago · generation_config.json · Safe 171 Bytes · Add files using upload-large-folder tool 6 months ago ·
🌐
Hugging Face
huggingface.co › unsloth › DeepSeek-R1
unsloth/DeepSeek-R1 · Hugging Face
DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
🌐
Hugging Face
huggingface.co › nvidia › DeepSeek-R1-NVFP4
nvidia/DeepSeek-R1-NVFP4 · Hugging Face
The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.
🌐
Hugging Face
huggingface.co › collections › deepseek-ai › deepseek-r1-678e1e131c0169c0bc89728d
DeepSeek-R1 - a deepseek-ai Collection
DeepSeek-R1 · DeepSeek-V3 · DeepSeek-VL2 · Janus · DeepSeek-Prover · DeepSeek-V2 · DeepSeekCoder-V2 · DeepSeek-Math · ESFT · DeepSeek-VL · DeepSeek-Coder · DeepSeek-LLM · DeepSeek-MoE · DeepSeek-V2.5 · updated May 29 · Upvote · 807 · +797 · Text Generation • 685B • Updated Mar 27 • 518k • • 12.8k ·
🌐
YouTube
youtube.com › watch
Hugging Face Journal Club - DeepSeek R1 - YouTube
The post-training team at Hugging Face discuss the tech report behind DeepSeek's ground breaking R1 models.- Report: https://github.com/deepseek-ai/DeepSeek-...
Published   January 22, 2025
🌐
Hugging Face
huggingface.co › bartowski › DeepSeek-R1-GGUF
bartowski/DeepSeek-R1-GGUF · Hugging Face
huggingface-cli download bartowski/DeepSeek-R1-GGUF --include "DeepSeek-R1-Q8_0/*" --local-dir ./
🌐
Hugging Face
huggingface.co › spaces › webml-community › deepseek-r1-webgpu
DeepSeek-R1 WebGPU - a Hugging Face Space by webml-community
Enter text to find images that match your description. Provide a text query, and the app will return relevant images for you.
🌐
Reddit
reddit.com › r/deeplearning › hugging face releases fully open source version of deepseek r1 called open-r1
r/deeplearning on Reddit: hugging face releases fully open source version of deepseek r1 called open-r1
November 2, 2024 -

for those afraid of using a chinese ai or want to more easily build more powerful ais based on deepseek's r1:

"The release of DeepSeek-R1 is an amazing boon for the community, but they didn’t release everything—although the model weights are open, the datasets and code used to train the model are not.

The goal of Open-R1 is to build these last missing pieces so that the whole research and industry community can build similar or better models using these recipes and datasets. And by doing this in the open, everybody in the community can contribute!.

As shown in the figure below, here’s our plan of attack:

Step 1: Replicate the R1-Distill models by distilling a high-quality reasoning dataset from DeepSeek-R1.

Step 2: Replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

Step 3: Show we can go from base model → SFT → RL via multi-stage training.

The synthetic datasets will allow everybody to fine-tune existing or new LLMs into reasoning models by simply fine-tuning on them. The training recipes involving RL will serve as a starting point for anybody to build similar models from scratch and will allow researchers to build even more advanced methods on top."

https://huggingface.co/blog/open-r1?utm_source=tldrai#what-is-deepseek-r1