DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 439
VBART Finetuned Models Collection VBART model finetuned to specific cases. • 10 items • Updated May 15, 2024 • 2