Rummy

yang31210999

6 14

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

upvoted a collection 3 months ago

The Art of Efficient Reasoning

Collection

Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2

upvoted a paper 4 months ago

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

Paper • 2603.00889 • Published Mar 1 • 56

upvoted 2 papers 6 months ago

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

Paper • 2601.09195 • Published Jan 14 • 15

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 128

upvoted a paper 7 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

upvoted a paper 8 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

upvoted a paper 9 months ago

Revisiting Model Interpolation for Efficient Reasoning

Paper • 2510.10977 • Published Oct 13, 2025 • 10

upvoted a paper 12 months ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17, 2025 • 43

upvoted a paper about 1 year ago

Shadow-FT: Tuning Instruct via Base

Paper • 2505.12716 • Published May 19, 2025 • 4

upvoted a collection about 1 year ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 738

upvoted 2 papers over 1 year ago

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23, 2025 • 54

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 65

upvoted an article almost 2 years ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 281

Rummy

AI & ML interests

Organizations

yang31210999's activity

Fine-tuning LLMs to 1.58bit: extreme quantization made easy