Ruiyi Wang
ruiyiwang
AI & ML interests
social agents, LLM reasoning, reinforcement learning
Recent Activity
updated a dataset 28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits published a dataset 28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits updated a model about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3Organizations
None yet