Xuandong Zhao's picture

Xuandong Zhao

Xuandong

·

https://xuandongzhao.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Self-Sovereign Agent

submitted a paper 21 days ago

Self-Sovereign Agent

new activity 2 months ago

sunblaze-ucb/Qwen2.5-7B-Intuitor-MATH-1EPOCH:Improve model card with paper summary and code links

View all activity

Organizations

upvoted a paper 21 days ago

Self-Sovereign Agent

Paper • 2604.08551 • Published Mar 4 • 5

submitted a paper to Daily Papers 21 days ago

Self-Sovereign Agent

Paper • 2604.08551 • Published Mar 4 • 5

New activity in sunblaze-ucb/Qwen2.5-7B-Intuitor-MATH-1EPOCH 2 months ago

Improve model card with paper summary and code links

#1 opened 2 months ago by

New activity in sunblaze-ucb/Llama-3.2-3B-Instruct-GRPO-MATH-1EPOCH 2 months ago

Improve model card: add library_name and links to paper/code

#1 opened 2 months ago by

New activity in sunblaze-ucb/Llama-3.2-3B-Instruct-Intuitor-MATH-1EPOCH 2 months ago

Improve model card: add library_name, GitHub link and method description

#1 opened 2 months ago by

New activity in sunblaze-ucb/OLMo-2-7B-SFT-Intuitor-MATH-1EPOCH-SYSP 2 months ago

Improve model card: add library_name, GitHub link, and paper reference

#1 opened 2 months ago by

New activity in sunblaze-ucb/OLMo-2-7B-SFT-GRPO-MATH-1EPOCH-SYSP 2 months ago

Improve model card metadata and add paper/code links

#1 opened 2 months ago by

New activity in sunblaze-ucb/Qwen2.5-14B-Intuitor-MATH-1EPOCH 2 months ago

Improve model card: add library_name, paper/code links, and correct description

#1 opened 2 months ago by

upvoted a paper 3 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 60

authored 2 papers 3 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 60

Clipping-Free Policy Optimization for Large Language Models

Paper • 2601.22801 • Published Jan 30 • 3

submitted a paper to Daily Papers 3 months ago

Clipping-Free Policy Optimization for Large Language Models

Paper • 2601.22801 • Published Jan 30 • 3

authored 8 papers 3 months ago

DE-COP: Detecting Copyrighted Content in Language Models Training Data

Paper • 2402.09910 • Published Feb 15, 2024 • 1

An undetectable watermark for generative image models

Paper • 2410.07369 • Published Oct 9, 2024

A Practical Examination of AI-Generated Text Detectors for Large Language Models

Paper • 2412.05139 • Published Dec 6, 2024

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Paper • 2502.12659 • Published Feb 18, 2025 • 7

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

Paper • 2502.17358 • Published Feb 24, 2025 • 1

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Paper • 2406.03728 • Published Jun 6, 2024

Improving LLM Safety Alignment with Dual-Objective Optimization

Paper • 2503.03710 • Published Mar 5, 2025 • 1

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Paper • 2503.14827 • Published Mar 19, 2025