DeepSeek-AI, Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu,
Q., Ma, S., Wang, P., Bi, X., Zhang, X., Yu, X., Wu, Y., Wu, Z. F., Gou, Z.,
Shao, Z., Li, Z., Gao, Z., … Zhang, Z. (2025). DeepSeek-R1: Incentivizing
Reasoning Capability in LLMs via Reinforcement Learning (No.
arXiv:2501.12948). arXiv.
https://doi.org/10.48550/arXiv.2501.12948