R1-1776
Path: | /datasets/ai/perplexity | |
URL: | https://huggingface.co/perplexity-ai/r1-1776-distill-llama-70b | |
Downloaded: | 2025-02-26 | |
Cite: | Guo, Daya, et al. “Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.” arXiv preprint arXiv:2501.12948 (2025) | |
Variant: |
| |
Bibtex: |
|