uj
u01
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards liked a dataset about 1 hour ago
lambda/hermes-agent-reasoning-traces liked a model about 1 hour ago
unsloth/Kimi-K2.6-GGUF