Balaji Rudrawar

balaji1233

·

balaji1233

AI & ML interests

None yet

Organizations

upvoted a collection 5 months ago

📝 Research & Long-Form Blog Posts

In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated May 28 • 35

upvoted an article 6 months ago

Article

Showcase Your Projects in Spaces using Gradio

merve

•

Oct 5, 2021

• 14

upvoted a collection 7 months ago

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 32 items • Updated 21 days ago • 62

upvoted an article about 1 year ago

Article

How to Build an MCP Server with Gradio

abidlabs, ysharma

•

Apr 30, 2025

• 202

upvoted a paper about 1 year ago

LLMs Get Lost In Multi-Turn Conversation

Paper • 2505.06120 • Published May 9, 2025 • 7

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

+3

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted 6 papers about 1 year ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 79

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24, 2025 • 124

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15, 2025 • 63

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 210

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 305

upvoted a paper over 1 year ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

upvoted 2 articles over 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 298

Article

Trace & Evaluate your Agent with Arize Phoenix

+1

schavalii, jgilhuly16, m-ric

•

Feb 28, 2025

• 41

upvoted 2 papers over 1 year ago

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Paper • 2402.14207 • Published Feb 22, 2024 • 10

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

upvoted a collection over 1 year ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 567

upvoted an article over 1 year ago

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 494

upvoted a collection over 1 year ago

Deepseek Papers

Deepseek papers collection • 32 items • Updated 4 days ago • 355