Erdi ARI

erdiari

·

https://erdiari.dev

AI & ML interests

NLP - RL

Organizations

None yet

upvoted a paper 11 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 454

upvoted a collection over 2 years ago

VBART Finetuned Models

VBART model finetuned to specific cases. • 10 items • Updated May 15, 2024 • 3