Chinese artificial intelligence company

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is … Wikipedia
Factsheet
Native name 杭州深度求索人工智能基础技术研究有限公司
Company type Private
Factsheet
Native name 杭州深度求索人工智能基础技术研究有限公司
Company type Private
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1
deepseek-ai/DeepSeek-R1 · Hugging Face
1 month ago - DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
🌐
Hugging Face
huggingface.co › deepseek-ai
deepseek-ai (DeepSeek)
3 weeks ago - Org profile for DeepSeek on Hugging Face, the AI community building the future.
Discussions

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face
Made some Unsloth dynamic GGUFs which retain accuracy: https://huggingface.co/unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF More on reddit.com
🌐 r/LocalLLaMA
68
298
February 27, 2025
Hugging Face wants to reverse-engineer DeepSeek’s R1
Begging of twentieth century: soviets reverse engineering British tanks Second half of twentieth century: Chinese reverse engineering soviet planes Twenty first century: USA reverse engineering Chinese ai models More on reddit.com
🌐 r/LocalLLaMA
20
104
November 16, 2024
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-0528 › tree › main
deepseek-ai/DeepSeek-R1-0528 at main
May 29, 2025 - Release DeepSeek-R1-0528 6 months ago · config.json · Safe 1.66 kB · Add files using upload-large-folder tool 6 months ago · configuration_deepseek.py · Safe 9.9 kB · Release DeepSeek-R1-0528 6 months ago · generation_config.json · Safe 171 Bytes · Add files using upload-large-folder tool 6 months ago ·
🌐
Hugging Face
huggingface.co › learn › llm-course › en › chapter12 › 3
Understanding the DeepSeek R1 Paper - Hugging Face LLM Course
DeepSeek R1 represents a significant advancement in language model training, particularly in developing reasoning capabilities through reinforcement learning.
🌐
Hugging Face
huggingface.co › nvidia › DeepSeek-R1-NVFP4
nvidia/DeepSeek-R1-NVFP4 · Hugging Face
1 week ago - The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.
🌐
GitHub
github.com › huggingface › open-r1
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.
Starred by 25.7K users
Forked by 2.4K users
Languages   Python 89.4% | Shell 10.0% | Makefile 0.6%
Find elsewhere
🌐
Hugging Face
huggingface.co › collections › unsloth › deepseek-r1-all-versions
DeepSeek R1 (All Versions) - a unsloth Collection
2 weeks ago - DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
🌐
Hugging Face
huggingface.co › nvidia › DeepSeek-R1-FP4
nvidia/DeepSeek-R1-FP4 · Hugging Face
November 15, 2025 - The NVIDIA DeepSeek R1 FP4 model is the quantized version of the DeepSeek AI's DeepSeek R1 model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here.
🌐
Hugging Face
huggingface.co › blog › open-r1
Open-R1: a fully open reproduction of DeepSeek-R1
January 28, 2025 - DeepSeek-R1 is a reasoning model built on the foundation of DeepSeek-V3. Like any good reasoning model, it starts with a strong base model, and DeepSeek-V3 is exactly that. This 671B Mixture of Experts (MoE) model performs on par with heavyweights ...
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.1 · Hugging Face
1 month ago - For complex questions that require ... DeepSeek-V3.1 can leverage a user-provided search tool through a multi-turn tool-calling process. Please refer to the assets/search_tool_trajectory.html and assets/search_python_tool_trajectory.html for the detailed template. ... Search agents are evaluated with our internal search framework, which uses a commercial search API + webpage filter + 128K context window. Seach agent results of R1-0528 are evaluated ...
🌐
Hugging Face
huggingface.co › unsloth › DeepSeek-R1-GGUF
unsloth/DeepSeek-R1-GGUF · Hugging Face
2 weeks ago - DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community. Notably, it is the first open research to validate that reasoning capabilities ...
🌐
Hugging Face
huggingface.co › deepseek-ai › DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-0528 · Hugging Face
1 month ago - In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic.
🌐
Hugging Face
huggingface.co › microsoft › MAI-DS-R1
microsoft/MAI-DS-R1 · Hugging Face
May 1, 2025 - MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team to improve its responsiveness on blocked topics and its risk profile, while maintaining its reasoning capabilities and competitive performance.
🌐
Reddit
reddit.com › r/localllama › deepseek-ai/deepseek-r1-0528-qwen3-8b · hugging face
r/LocalLLaMA on Reddit: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · Hugging Face
February 27, 2025 - It sounded like you could just go to the DeepSeek page in HF and grab the GGUF from there. I looked into it and found that you can't do that, and that the only GGUFs available are through 3rd parties. Ollama also has their pages up if you google r1-0528 + the quantization annotation
🌐
YouTube
youtube.com › watch
Hugging Face Journal Club - DeepSeek R1 - YouTube
The post-training team at Hugging Face discuss the tech report behind DeepSeek's ground breaking R1 models.- Report: https://github.com/deepseek-ai/DeepSeek-...
Published   January 22, 2025
🌐
Theaisignal
theaisignal.com › p › hugging-face-takes-on-deepseeks-r1
Hugging Face Takes on DeepSeek’s R1 with Open-R1 Project
January 29, 2025 - In response to DeepSeek’s “black box” release of its R1 reasoning model, Hugging Face has launched Open-R1 to fully open-source its replication. Backed by its Science Cluster and community support, the project aims to unlock AI transparency ...
🌐
C# Corner
c-sharpcorner.com › home › latest technology news › deepseek updates r1 ai model, now on hugging face
DeepSeek Updates R1 AI Model, Now on Hugging Face
June 30, 2025 - Chinese AI firm DeepSeek has unveiled an updated version of its flagship reasoning model R1, now available on the popular machine learning platform Hugging Face.
🌐
Venturebeat
venturebeat.com › ai › holy-smokes-a-new-200-faster-deepseek-r1-0528-variant-appears-from-german-lab-tng-technology-consulting-gmbh
HOLY SMOKES! A new, 200% faster DeepSeek R1-0528 variant appears from German lab TNG Technology Consulting GmbH | VentureBeat
August 27, 2025 - On the model card TNG released for its new R1T2 on the AI code sharing community Hugging Face, the company states that it is "about 20% faster than the regular R1" (the one released back in January) "and more than twice as fast as R1-0528" (the ...
🌐
OpenRouter
openrouter.ai › deepseek › deepseek-r1-0528-qwen3-8b:free
DeepSeek R1 0528 Qwen3 8B - API, Providers, Stats | OpenRouter
May 29, 2025 - DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. Run DeepSeek R1 0528 Qwen3 8B with API