p

shivash

·

AI & ML interests

None yet

Recent Activity

updated a collection 5 days ago

modles to work with

liked a model 5 days ago

mistralai/Devstral-2-123B-Instruct-2512

updated a collection 5 days ago

modles to work with

View all activity

Organizations

upvoted a paper 3 months ago

Seed-Coder: Let the Code Model Curate Data for Itself

Paper • 2506.03524 • Published Jun 4, 2025 • 8

upvoted 3 papers 8 months ago

Predicting the Order of Upcoming Tokens Improves Language Modeling

Paper • 2508.19228 • Published Aug 26, 2025 • 23

Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs

Paper • 2509.24107 • Published Sep 28, 2025 • 81

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99