Sunil Kumar Yadav's picture

🏝️ On Vacation

Sunil Kumar Yadav

sukuya

·

https://sukuya.github.io/

sukuya

AI & ML interests

Machine Translation, Large Language Models

Recent Activity

liked a model about 1 month ago

bytedance-research/Lance

upvoted a collection about 2 months ago

liked a model about 2 months ago

talkie-lm/talkie-1930-13b-it

View all activity

Organizations

upvoted a collection about 2 months ago

GPT-1900

Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7). • 11 items • Updated Apr 2 • 9

upvoted 2 papers 3 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted a paper 9 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 517

upvoted a collection 9 months ago

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 34 items • Updated about 16 hours ago • 117

upvoted a collection over 1 year ago

Synthetic Dataset Creation

Spaces focused on generating synthetic datasets • 6 items • Updated Mar 2 • 11

upvoted a collection almost 2 years ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 157

upvoted a paper about 2 years ago

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 31

upvoted a paper over 2 years ago

RakutenAI-7B: Extending Large Language Models for Japanese

Paper • 2403.15484 • Published Mar 21, 2024 • 15

upvoted a paper about 3 years ago

Simple and Controllable Music Generation

Paper • 2306.05284 • Published Jun 8, 2023 • 167