AI & ML interests

None defined yet.

Recent Activity

sergiopaniego 
posted an update 3 days ago
sergiopaniego 
posted an update 10 days ago
sergiopaniego 
posted an update 12 days ago
view post
Post
251
GLM-5.2 is open and comes with competitive performance against opus 4.8

day-0 in transformers + vllm + sglang, mit license 🤗

on the post-training side: critic-based ppo for variable-length agentic rollouts (ppo is back!) + an online anti-reward-hacking module that feeds the agent dummy info when it tries to cheat