Ariel Kwiatkowski
RedTachyon
AI & ML interests
RL, MARL, Crowd Simulation
Recent Activity
upvoted
a
paper
about 11 hours ago
Likelihood-Based Reward Designs for General LLM Reasoning
upvoted
a
paper
10 days ago
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
upvoted
a
paper
4 months ago
Soft Tokens, Hard Truths