DeepSeek

Path:	`/datasets/ai/deepseek`
URL:	https://huggingface.co/deepseek-ai
Downloaded:	2025-02-10
Cite:	Guo, Daya, et al. “Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.” arXiv preprint arXiv:2501.12948 (2025) Liu, Aixin, et al. “Deepseek-v3 technical report.” arXiv preprint arXiv:2412.19437 (2024).
Variant:	DeepSeek-R1 DeepSeek-R1-Distill-Llama-70B DeepSeek-R1-Distill-Llama-8B DeepSeek-R1-Distill-Qwen-1.5B DeepSeek-R1-Distill-Qwen-14B DeepSeek-R1-Distill-Qwen-32B DeepSeek-R1-Distill-Qwen-7B DeepSeek-R1-Zero DeepSeek-V3 DeepSeek-V3-Base Janus-Pro-7B deepseek-coder-1.3b-instruct deepseek-coder-33b-instruct deepseek-coder-6.7b-base deepseek-coder-6.7b-instruct deepseek-math-7b-instruct
Bibtex:	@article{guo2025deepseek,4 title={Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning}, author={Guo, Daya and Yang, Dejian and Zhang, Haowei and Song, Junxiao and Zhang, Ruoyu and Xu, Runxin and Zhu, Qihao and Ma, Shirong and Wang, Peiyi and Bi, Xiao and others}, journal={arXiv preprint arXiv:2501.12948}, year={2025} } @article{liu2024deepseek, title={Deepseek-v3 technical report}, author={Liu, Aixin and Feng, Bei and Xue, Bing and Wang, Bingxuan and Wu, Bochao and Lu, Chengda and Zhao, Chenggang and Deng, Chengqi and Zhang, Chenyu and Ruan, Chong and others}, journal={arXiv preprint arXiv:2412.19437}, year={2024} }