Neer Vana

Neervana

3 2

·

AI & ML interests

None yet

Organizations

upvoted an article 8 months ago

Article

AutoBench Goes Scientific: Rigorous Validation for a Dynamic, Open-Source LLM Benchmark

PeterKruger

•

Oct 29, 2025

• 4

upvoted an article 11 months ago

Article

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

PeterKruger

•

Aug 20, 2025

• 6

upvoted a paper about 1 year ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 96