Dawei Li's picture

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Organizations

upvoted a paper 2 months ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

upvoted 2 papers 4 months ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published Mar 2 • 64

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 356

upvoted 4 papers 5 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 49

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Paper • 2601.12294 • Published Jan 18 • 19

RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation

Paper • 2601.08430 • Published Jan 13 • 62

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

upvoted 2 papers 6 months ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published Jan 9 • 37

Agent-as-a-Judge

Paper • 2601.05111 • Published Jan 8 • 20

upvoted 4 papers 7 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 40

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 80

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 85

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 45

upvoted a paper 8 months ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published Oct 29, 2025 • 42

upvoted 2 papers 9 months ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 68

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29, 2025 • 30

upvoted 2 papers 10 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 85

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published Aug 25, 2025 • 25

upvoted 2 papers 11 months ago

Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences

Paper • 2508.03542 • Published Aug 5, 2025 • 7

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

Paper • 2508.03365 • Published Aug 5, 2025 • 5