Yushi Huang's picture

In a Training Loop 🔄

Yushi Huang

Harahan

·

https://harahan.github.io/

Harahan

AI & ML interests

Efficient AIGC, especially for video/image generation and MLLMs

Organizations

None yet

authored 4 papers 3 months ago

Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention

Paper • 2602.04789 • Published Feb 4 • 3

LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit

Paper • 2405.06001 • Published May 9, 2024

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Paper • 2602.02159 • Published Feb 2 • 1

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Paper • 2511.15690 • Published Nov 19, 2025

submitted 2 papers to Daily Papers 3 months ago

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Paper • 2602.02159 • Published Feb 2 • 1

Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention

Paper • 2602.04789 • Published Feb 4 • 3

authored a paper 3 months ago

Temporal Feature Matters: A Framework for Diffusion Model Quantization

Paper • 2407.19547 • Published Jul 28, 2024

authored 2 papers 4 months ago

SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning

Paper • 2508.06447 • Published Aug 8, 2025

LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation

Paper • 2510.08318 • Published Oct 9, 2025

authored a paper 8 months ago

LLMC+: Benchmarking Vision-Language Model Compression with a Plug-and-play Toolkit

Paper • 2508.09981 • Published Aug 13, 2025 • 2

authored a paper 12 months ago

QVGen: Pushing the Limit of Quantized Video Generative Models

Paper • 2505.11497 • Published May 16, 2025 • 4

authored a paper over 1 year ago

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration

Paper • 2410.01723 • Published Oct 2, 2024 • 4

authored a paper over 2 years ago

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

Paper • 2311.16503 • Published Nov 27, 2023