Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
weiwei yang's picture
1 2

weiwei yang

weiweiyang
·
  • weiweiy

AI & ML interests

None yet

Recent Activity

upvoted a collection about 21 hours ago
GridSFM
authored a paper 2 days ago
Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
authored a paper 2 days ago
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling
View all activity

Organizations

Microsoft's profile picture LLM Efficiency Challenge's profile picture AIHackerCup's profile picture HackerCupAI's profile picture GridSage's profile picture

authored 4 papers 2 days ago

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Paper • 2410.18469 • Published Oct 24, 2024 • 1

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Paper • 2601.22636 • Published Jan 30 • 22

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Paper • 2602.06854 • Published Feb 6 • 6

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

Paper • 2312.09601 • Published Dec 15, 2023
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs