Laxmi Tiwari
laxuu
AI & ML interests
Agentic AI, RL, MARL
Recent Activity
reacted to theirpost with 👍 about 14 hours ago
Hot take :Wednesday🔥
For years, AI progress has often looked like:
"Need a smarter model?"
➡️ Add more parameters.
➡️ Add more GPUs.
➡️ Hope your budget survives.
RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale?
Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes.
As someone interested in Recurrent RL and autonomous systems, this raises an exciting question:
Are we entering the era where experience becomes more valuable than parameters?
The next breakthrough AI might not be the biggest model.
It might be the one that learns continuously.
📄 Paper: https://arxiv.org/pdf/2505.03238
💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main
#ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface
reacted to theirpost with 🧠 about 14 hours ago
Hot take :Wednesday🔥
For years, AI progress has often looked like:
"Need a smarter model?"
➡️ Add more parameters.
➡️ Add more GPUs.
➡️ Hope your budget survives.
RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale?
Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes.
As someone interested in Recurrent RL and autonomous systems, this raises an exciting question:
Are we entering the era where experience becomes more valuable than parameters?
The next breakthrough AI might not be the biggest model.
It might be the one that learns continuously.
📄 Paper: https://arxiv.org/pdf/2505.03238
💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main
#ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface
reacted to theirpost with 🚀 about 14 hours ago
Hot take :Wednesday🔥
For years, AI progress has often looked like:
"Need a smarter model?"
➡️ Add more parameters.
➡️ Add more GPUs.
➡️ Hope your budget survives.
RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale?
Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes.
As someone interested in Recurrent RL and autonomous systems, this raises an exciting question:
Are we entering the era where experience becomes more valuable than parameters?
The next breakthrough AI might not be the biggest model.
It might be the one that learns continuously.
📄 Paper: https://arxiv.org/pdf/2505.03238
💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main
#ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface