13 26 17

Huiqiang Jiang

iofu728

https://hqjiang.com/

AI & ML interests

None yet

Recent Activity

authored a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

commentedon a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

upvoted a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

View all activity

Organizations

authored a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Paper • 2606.12370 • Published 15 days ago • 21

commented a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Paper • 2606.12370 • Published 15 days ago • 21 •

upvoted a paper 14 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Paper • 2606.12370 • Published 15 days ago • 21

authored a paper 7 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 107

upvoted a paper 7 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 107

authored a paper about 1 year ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

upvoted a paper about 1 year ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

authored a paper about 1 year ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 29

upvoted a paper about 1 year ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 29

commented a paper about 1 year ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 29 •

liked a model about 1 year ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated Jul 26, 2025 • 759k • • 1.1k

upvoted a paper about 1 year ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22, 2025 • 8

commented a paper about 1 year ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22, 2025 • 8 •

liked a model over 1 year ago

moonshotai/Moonlight-16B-A3B

Text Generation • 16B • Updated Jan 30 • 62.1k • 113

upvoted a paper over 1 year ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28, 2025 • 37

liked a model over 1 year ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29, 2025 • 9.7k • • 341

upvoted a paper over 1 year ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23, 2025 • 48

liked 2 models over 1 year ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24, 2025 • 773k • • 1.57k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 7.17M • • 13.4k

upvoted a paper over 1 year ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 290

Huiqiang Jiang

AI & ML interests

Recent Activity

Organizations

iofu728's activity