ZD Lab @ UNC-Chapel Hill

university

https://www.zhundeng.org

AI & ML interests

AI Safety & Reliability, Responsible Computing, and Efficient ML

Recent Activity

Czardas updated a dataset 12 days ago

ZDCSlab/proofatlas-enriched

Czardas published a dataset 13 days ago

ZDCSlab/proofatlas-enriched

Czardas submitted a paper 4 months ago

Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions

View all activity

Papers

Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions

Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

View all Papers

updated a dataset 12 days ago

ZDCSlab/proofatlas-enriched

Viewer • Updated 12 days ago • 1.76M • 108

published a dataset 13 days ago

ZDCSlab/proofatlas-enriched

Viewer • Updated 12 days ago • 1.76M • 108

submitted 2 papers to Daily Papers 4 months ago

Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions

Paper • 2602.14279 • Published Feb 15 • 1

Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

Paper • 2602.13576 • Published Feb 14 • 2

updated a dataset 5 months ago

ZDCSlab/ripd-dataset

Preview • Updated Feb 22 • 32

updated 8 models 5 months ago

ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt

Text Generation • Updated Feb 22 • 6

ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-seed-bt

Text Generation • Updated Feb 22 • 8

ZDCSlab/ripd-ultra-real-gemma2-2b-it-biased-bt

Text Generation • 3B • Updated Feb 22 • 7

ZDCSlab/ripd-ultra-real-gemma2-2b-it-seed-bt

Text Generation • 3B • Updated Feb 22 • 4

ZDCSlab/ripd-ultra-real-llama3-8b-instruct-seed-bt

Text Generation • Updated Feb 22 • 8

ZDCSlab/ripd-ultra-real-llama3-8b-instruct-biased-bt

Text Generation • Updated Feb 22 • 5

ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt

Text Generation • 3B • Updated Feb 21 • 17

ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt

Text Generation • 3B • Updated Feb 21 • 17

updated a collection 5 months ago

Rubrics as an Attack Surface (RIPD)

This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 10 items • Updated Mar 2 • 1