DeepSeek

Chinese artificial intelligence company

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is … Wikipedia

Factsheet

Native name 杭州深度求索人工智能基础技术研究有限公司

Company type Private

Industry Information technology
Artificial intelligence

Factsheet

Native name 杭州深度求索人工智能基础技术研究有限公司

Company type Private

Industry Information technology
Artificial intelligence

huggingface.co › deepseek-ai › DeepSeek-R1

deepseek-ai/DeepSeek-R1 · Hugging Face

DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...

github.com › huggingface › open-r1

GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1

Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.

Starred by 25.7K users

Forked by 2.4K users

Languages Python 89.4% | Shell 10.0% | Makefile 0.6%

Videos

Deploying Deepseek R1 on Hugging Face - YouTube

DeepSeek V3.1 First Test – Is This The BEST Open Source LLM? ...

August 20, 2025

You HAVE to Try Agentic RAG with DeepSeek R1 (Insane Results) - ...

February 3, 2025

DeepSeek's R1 model will be replicated by researchers at Hugging ...

January 29, 2025

How to Run the Deepseek R1 LLM on Windows (7b mini) #ollama ...

January 29, 2025

huggingface.co › deepseek-ai

deepseek-ai (DeepSeek)

Org profile for DeepSeek on Hugging Face, the AI community building the future.

huggingface.co › blog › open-r1

Open-R1: a fully open reproduction of DeepSeek-R1

DeepSeek-R1 is a reasoning model built on the foundation of DeepSeek-V3. Like any good reasoning model, it starts with a strong base model, and DeepSeek-V3 is exactly that. This 671B Mixture of Experts (MoE) model performs on par with heavyweights ...

huggingface.co › unsloth › DeepSeek-R1-GGUF

unsloth/DeepSeek-R1-GGUF · Hugging Face

DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...

huggingface.co › deepseek-ai › DeepSeek-R1-0528

deepseek-ai/DeepSeek-R1-0528 · Hugging Face

In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across ...

huggingface.co › deepseek-ai › DeepSeek-R1 › tree › main

deepseek-ai/DeepSeek-R1 at main

DeepSeek-R1 · 689 GB · 5 contributors History: 21 commits · msr2000 · Small fix 56d4cbb 9 months ago · figures · Release DeepSeek-R1 11 months ago · .gitattributes 1.52 kB · initial commit 11 months ago · LICENSE 1.06 kB · Release DeepSeek-R1 11 months ago ·

huggingface.co › learn › llm-course › en › chapter12 › 3

Understanding the DeepSeek R1 Paper - Hugging Face LLM Course

DeepSeek R1 represents a significant advancement in language model training, particularly in developing reasoning capabilities through reinforcement learning.

Find elsewhere

Google Bing Mojeek

huggingface.co › deepseek-ai › DeepSeek-R1-0528-Qwen3-8B

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face

In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across ...

github.com › deepseek-ai › DeepSeek-R1

GitHub - deepseek-ai/DeepSeek-R1

DeepSeek-R1-Distill-Qwen-32B · Qwen2.5-32B · 🤗 HuggingFace · DeepSeek-R1-Distill-Llama-70B · Llama-3.3-70B-Instruct · 🤗 HuggingFace · DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers.

Starred by 91.6K users

Forked by 11.8K users

huggingface.co › nvidia › DeepSeek-R1-FP4

nvidia/DeepSeek-R1-FP4 · Hugging Face

The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.

huggingface.co › collections › unsloth › deepseek-r1-all-versions-678e1c48f5d2fce87892ace5

DeepSeek R1 (All Versions) - a unsloth Collection

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

huggingface.co › deepseek-ai › DeepSeek-R1-0528 › tree › main

deepseek-ai/DeepSeek-R1-0528 at main

Release DeepSeek-R1-0528 6 months ago · config.json · Safe 1.66 kB · Add files using upload-large-folder tool 6 months ago · configuration_deepseek.py · Safe 9.9 kB · Release DeepSeek-R1-0528 6 months ago · generation_config.json · Safe 171 Bytes · Add files using upload-large-folder tool 6 months ago ·

huggingface.co › unsloth › DeepSeek-R1

unsloth/DeepSeek-R1 · Hugging Face

DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...

huggingface.co › nvidia › DeepSeek-R1-NVFP4

nvidia/DeepSeek-R1-NVFP4 · Hugging Face

The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.

huggingface.co › collections › deepseek-ai › deepseek-r1-678e1e131c0169c0bc89728d

DeepSeek-R1 - a deepseek-ai Collection

DeepSeek-R1 · DeepSeek-V3 · DeepSeek-VL2 · Janus · DeepSeek-Prover · DeepSeek-V2 · DeepSeekCoder-V2 · DeepSeek-Math · ESFT · DeepSeek-VL · DeepSeek-Coder · DeepSeek-LLM · DeepSeek-MoE · DeepSeek-V2.5 · updated May 29 · Upvote · 807 · +797 · Text Generation • 685B • Updated Mar 27 • 518k • • 12.8k ·

youtube.com › watch

Hugging Face Journal Club - DeepSeek R1 - YouTube

The post-training team at Hugging Face discuss the tech report behind DeepSeek's ground breaking R1 models.- Report: https://github.com/deepseek-ai/DeepSeek-...

Published January 22, 2025

huggingface.co › bartowski › DeepSeek-R1-GGUF

bartowski/DeepSeek-R1-GGUF · Hugging Face

huggingface-cli download bartowski/DeepSeek-R1-GGUF --include "DeepSeek-R1-Q8_0/*" --local-dir ./

huggingface.co › spaces › webml-community › deepseek-r1-webgpu

DeepSeek-R1 WebGPU - a Hugging Face Space by webml-community

Enter text to find images that match your description. Provide a text query, and the app will return relevant images for you.

reddit.com › r/deeplearning › hugging face releases fully open source version of deepseek r1 called open-r1

r/deeplearning on Reddit: hugging face releases fully open source version of deepseek r1 called open-r1

November 2, 2024 -

for those afraid of using a chinese ai or want to more easily build more powerful ais based on deepseek's r1:

"The release of DeepSeek-R1 is an amazing boon for the community, but they didn’t release everything—although the model weights are open, the datasets and code used to train the model are not.

The goal of Open-R1 is to build these last missing pieces so that the whole research and industry community can build similar or better models using these recipes and datasets. And by doing this in the open, everybody in the community can contribute!.

As shown in the figure below, here’s our plan of attack:

Step 1: Replicate the R1-Distill models by distilling a high-quality reasoning dataset from DeepSeek-R1.

Step 2: Replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

Step 3: Show we can go from base model → SFT → RL via multi-stage training.

The synthetic datasets will allow everybody to fine-tune existing or new LLMs into reasoning models by simply fine-tuning on them. The training recipes involving RL will serve as a starting point for anybody to build similar models from scratch and will allow researchers to build even more advanced methods on top."

https://huggingface.co/blog/open-r1?utm_source=tldrai#what-is-deepseek-r1

There's not much to comment on! Thank God we have initiatives like this that allow us, the curious mortals, to learn, play, and enjoy the latest trends and researchs in this passionate world of AI.

an important correction. they haven't released it yet; they're still working on it. but i doubt it will take them more than a couple of weeks.