Chmielewski
Eryk-Chmielewski
AI & ML interests
Senior AI Agent Architect
Recent Activity
upvoted a paper 2 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models upvoted a paper 2 days ago
How Far Can Unsupervised RLVR Scale LLM Training? commented on
a paper
4 days ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning