weiwei yang's picture

1 2

weiwei yang

weiweiyang

·

weiweiy

AI & ML interests

None yet

Recent Activity

upvoted a collection about 21 hours ago

authored a paper 2 days ago

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

authored a paper 2 days ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

View all activity

Organizations

authored 4 papers 2 days ago

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Paper • 2410.18469 • Published Oct 24, 2024 • 1

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Paper • 2601.22636 • Published Jan 30 • 22

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Paper • 2602.06854 • Published Feb 6 • 6

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

Paper • 2312.09601 • Published Dec 15, 2023