5 17 34

Zhijian Liu

zhijianliu

https://zhijianliu.com

AI & ML interests

Efficient AI

Recent Activity

liked a model 19 days ago

z-lab/Qwen3.6-27B-DFlash

liked a model 20 days ago

z-lab/gemma-4-26B-A4B-it-DFlash

liked a model 20 days ago

z-lab/gemma-4-31B-it-DFlash

View all activity

Organizations

upvoted a collection 2 months ago

SparseLoRA

Collection

Accelerating LLM Fine-Tuning with Contextual Sparsity • 4 items • Updated Mar 11 • 3

upvoted a collection 3 months ago

ParoQuant

Collection

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 23 items • Updated 10 days ago • 25

upvoted a paper 4 months ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 81

upvoted a collection 5 months ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 21 items • Updated 16 days ago • 120

upvoted 2 papers 6 months ago

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

Paper • 2512.01031 • Published Nov 30, 2025 • 27

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Paper • 2511.10645 • Published Nov 13, 2025 • 11

upvoted 2 papers 7 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 181

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

upvoted a paper 8 months ago

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 59

upvoted a paper 10 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

upvoted a paper 11 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19, 2025 • 17

upvoted a paper 12 months ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 46

upvoted a paper about 1 year ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96

upvoted a paper over 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 61

upvoted 3 papers over 2 years ago

Zhijian Liu

AI & ML interests

Recent Activity

Organizations

zhijianliu's activity