Vijay Saraswat's picture

Vijay Saraswat

vjsaraswat

·

Saraswat

AI & ML interests

language

Organizations

upvoted an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted a paper about 2 years ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

upvoted a paper over 2 years ago

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 20

upvoted 2 papers almost 3 years ago

Uncovering mesa-optimization algorithms in Transformers

Paper • 2309.05858 • Published Sep 11, 2023 • 14

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 43