DeepSeek

Chinese artificial intelligence company

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is … Wikipedia

Factsheet

Native name 杭州深度求索人工智能基础技术研究有限公司

Company type Private

Industry Information technology
Artificial intelligence

Factsheet

Native name 杭州深度求索人工智能基础技术研究有限公司

Company type Private

Industry Information technology
Artificial intelligence

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-R1

deepseek-ai/DeepSeek-R1 · Hugging Face

1 month ago - DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...

Hugging Face

huggingface.co › deepseek-ai

deepseek-ai (DeepSeek)

3 weeks ago - Org profile for DeepSeek on Hugging Face, the AI community building the future.

Discussions

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face

Made some Unsloth dynamic GGUFs which retain accuracy: https://huggingface.co/unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF More on reddit.com

r/LocalLLaMA

298

February 27, 2025

Hugging Face wants to reverse-engineer DeepSeek’s R1

Begging of twentieth century: soviets reverse engineering British tanks Second half of twentieth century: Chinese reverse engineering soviet planes Twenty first century: USA reverse engineering Chinese ai models More on reddit.com

r/LocalLLaMA

104

November 16, 2024

Videos

03:09

YouTube

Deploying Deepseek R1 on Hugging Face - YouTube

April 14, 2025

05:51

YouTube

How to Run the Deepseek R1 LLM on Windows (7b mini) #ollama ...

January 29, 2025

02:30

YouTube

AI News: DeepSeek-R1 V2, the new open source SoTA! - YouTube

May 29, 2025

View all

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-R1-0528 › tree › main

deepseek-ai/DeepSeek-R1-0528 at main

May 29, 2025 - Release DeepSeek-R1-0528 6 months ago · config.json · Safe 1.66 kB · Add files using upload-large-folder tool 6 months ago · configuration_deepseek.py · Safe 9.9 kB · Release DeepSeek-R1-0528 6 months ago · generation_config.json · Safe 171 Bytes · Add files using upload-large-folder tool 6 months ago ·

Hugging Face

huggingface.co › learn › llm-course › en › chapter12 › 3

Understanding the DeepSeek R1 Paper - Hugging Face LLM Course

DeepSeek R1 represents a significant advancement in language model training, particularly in developing reasoning capabilities through reinforcement learning.

Hugging Face

huggingface.co › nvidia › DeepSeek-R1-NVFP4

nvidia/DeepSeek-R1-NVFP4 · Hugging Face

1 week ago - The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.

GitHub

github.com › huggingface › open-r1

GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1

Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.

Starred by 25.7K users

Forked by 2.4K users

Languages Python 89.4% | Shell 10.0% | Makefile 0.6%

Find elsewhere

Google Bing Mojeek

Hugging Face

huggingface.co › collections › unsloth › deepseek-r1-all-versions

DeepSeek R1 (All Versions) - a unsloth Collection

2 weeks ago - DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

Hugging Face

huggingface.co › nvidia › DeepSeek-R1-FP4

nvidia/DeepSeek-R1-FP4 · Hugging Face

November 15, 2025 - The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.

Hugging Face

huggingface.co › blog › open-r1

Open-R1: a fully open reproduction of DeepSeek-R1

January 28, 2025 - DeepSeek-R1 is a reasoning model built on the foundation of DeepSeek-V3. Like any good reasoning model, it starts with a strong base model, and DeepSeek-V3 is exactly that. This 671B Mixture of Experts (MoE) model performs on par with heavyweights ...

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-V3.1

deepseek-ai/DeepSeek-V3.1 · Hugging Face

1 month ago - For complex questions that require ... DeepSeek-V3.1 can leverage a user-provided search tool through a multi-turn tool-calling process. Please refer to the assets/search_tool_trajectory.html and assets/search_python_tool_trajectory.html for the detailed template. ... Search agents are evaluated with our internal search framework, which uses a commercial search API + webpage filter + 128K context window. Seach agent results of R1-0528 are evaluated ...

Hugging Face

huggingface.co › unsloth › DeepSeek-R1-GGUF

unsloth/DeepSeek-R1-GGUF · Hugging Face

2 weeks ago - DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...

Hugging Face

huggingface.co › deepseek-ai › DeepSeek-R1-0528

deepseek-ai/DeepSeek-R1-0528 · Hugging Face

1 month ago - In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic.

Hugging Face

huggingface.co › microsoft › MAI-DS-R1

microsoft/MAI-DS-R1 · Hugging Face

May 1, 2025 - MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team to improve its responsiveness on blocked topics and its risk profile, while maintaining its reasoning capabilities and competitive performance.

reddit.com › r/localllama › deepseek-ai/deepseek-r1-0528-qwen3-8b · hugging face

r/LocalLLaMA on Reddit: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face

February 27, 2025 - It sounded like you could just go to the DeepSeek page in HF and grab the GGUF from there. I looked into it and found that you can't do that, and that the only GGUFs available are through 3rd parties. Ollama also has their pages up if you google r1-0528 + the quantization annotation

YouTube

youtube.com › watch

Hugging Face Journal Club - DeepSeek R1 - YouTube

43:26

The post-training team at Hugging Face discuss the tech report behind DeepSeek's ground breaking R1 models.- Report: https://github.com/deepseek-ai/DeepSeek-...

Published January 22, 2025

Theaisignal

theaisignal.com › p › hugging-face-takes-on-deepseeks-r1

Hugging Face Takes on DeepSeek’s R1 with Open-R1 Project

January 29, 2025 - In response to DeepSeek’s “black box” release of its R1 reasoning model, Hugging Face has launched Open-R1 to fully open-source its replication. Backed by its Science Cluster and community support, the project aims to unlock AI transparency ...

C# Corner

c-sharpcorner.com › home › latest technology news › deepseek updates r1 ai model, now on hugging face

DeepSeek Updates R1 AI Model, Now on Hugging Face

June 30, 2025 - Chinese AI firm DeepSeek has unveiled an updated version of its flagship reasoning model R1, now available on the popular machine learning platform Hugging Face.

Venturebeat

venturebeat.com › ai › holy-smokes-a-new-200-faster-deepseek-r1-0528-variant-appears-from-german-lab-tng-technology-consulting-gmbh

HOLY SMOKES! A new, 200% faster DeepSeek R1-0528 variant appears from German lab TNG Technology Consulting GmbH | VentureBeat

August 27, 2025 - On the model card TNG released for its new R1T2 on the AI code sharing community Hugging Face, the company states that it is "about 20% faster than the regular R1" (the one released back in January) "and more than twice as fast as R1-0528" (the ...

OpenRouter

openrouter.ai › deepseek › deepseek-r1-0528-qwen3-8b:free

DeepSeek R1 0528 Qwen3 8B - API, Providers, Stats | OpenRouter

May 29, 2025 - DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. Run DeepSeek R1 0528 Qwen3 8B with API

reddit.com › r/localllama › hugging face wants to reverse-engineer deepseek’s r1

r/LocalLLaMA on Reddit: Hugging Face wants to reverse-engineer DeepSeek’s R1

November 16, 2024 - Guys I reverse engineered deepseeks r1.