Katherine Tieu
kthrn22
AI & ML interests
LLMs, Agents, RL, Multimodal Learning, GNNs
Recent Activity
upvoted
a
paper
about 9 hours ago
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
upvoted
a
paper
3 days ago
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise
Reasoning