6 4

Yejin Choi

yejinchoinka

https://homes.cs.washington.edu/~yejin/

YejinChoinka

AI & ML interests

None yet

Recent Activity

authored a paper 20 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 4 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

View all activity

Organizations

authored a paper 20 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 21 days ago • 211

upvoted a paper 4 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 143

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

upvoted a paper 8 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

authored 3 papers 9 months ago

authored 2 papers 10 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 77

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

authored a paper 12 months ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3, 2025 • 21

authored 2 papers about 1 year ago

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Paper • 2501.08292 • Published Jan 14, 2025 • 17

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published Dec 2, 2024 • 22

authored 2 papers over 1 year ago

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild

Paper • 2409.03753 • Published Sep 5, 2024 • 19

StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements

Paper • 2408.15666 • Published Aug 28, 2024 • 12

upvoted a paper over 1 year ago

StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements

Paper • 2408.15666 • Published Aug 28, 2024 • 12

authored 5 papers over 1 year ago

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Paper • 2407.16607 • Published Jul 23, 2024 • 23

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26, 2024 • 10

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16, 2024 • 14

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Paper • 2406.04770 • Published Jun 7, 2024 • 28

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 64

Yejin Choi

AI & ML interests

Recent Activity

Organizations

yejinchoinka's activity