Yang Ouyang's picture

Yang Ouyang

OriDragon2000

https://oyy2000.github.io/

AI & ML interests

Safety and Efficiency

Recent Activity

liked a dataset 9 days ago

hotpotqa/hotpot_qa

upvoted a paper 2 months ago

The Art of Efficient Reasoning: Data, Reward, and Optimization

liked a model 4 months ago

google/gemma-2-2b-it

View all activity

Organizations

None yet

upvoted a paper 2 months ago

The Art of Efficient Reasoning: Data, Reward, and Optimization

Paper • 2602.20945 • Published Feb 24 • 7

upvoted a paper 11 months ago

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Paper • 2501.02629 • Published Jan 5, 2025 • 1

upvoted a paper about 2 years ago

Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

Paper • 2404.02936 • Published Apr 3, 2024 • 3