Jia-Nan Li

JinaLeejnl

3 13

JinaLeejnl

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Improved Large Language Diffusion Models

upvoted a paper 19 days ago

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

upvoted a paper 22 days ago

PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration

View all activity

Organizations

upvoted a paper 5 days ago

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 6 days ago • 41

upvoted a paper 19 days ago

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Paper • 2606.12397 • Published 20 days ago • 89

upvoted 2 papers 22 days ago

PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration

Paper • 2502.00527 • Published Feb 1, 2025 • 3

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Paper • 2606.07502 • Published 25 days ago • 99

upvoted a paper about 1 month ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

upvoted 2 papers 4 months ago

LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Paper • 2603.01068 • Published Mar 1 • 22

Spectral Condition for μP under Width-Depth Scaling

Paper • 2603.00541 • Published Feb 28 • 15

upvoted an article 6 months ago

Article

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

SII-xrliu

•

Nov 15, 2025

• 15

upvoted a paper 6 months ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 100

updated a model 6 months ago

GSAI-ML/ReFusion

Text Generation • 8B • Updated Dec 26, 2025 • 233 • • 14

published a dataset 6 months ago

GSAI-ML/ReFusion

Viewer • Updated Dec 26, 2025 • 1.83M • 47 • 3

updated a dataset 6 months ago

GSAI-ML/ReFusion

Viewer • Updated Dec 26, 2025 • 1.83M • 47 • 3

updated a model 6 months ago

JinaLeejnl/AlignXplore-7B-Streaming

8B • Updated Dec 26, 2025 • 7

published a model 6 months ago

JinaLeejnl/AlignXplore-7B-Streaming

8B • Updated Dec 26, 2025 • 7

New activity in GSAI-ML/ReFusion 6 months ago

Add `library_name: transformers` to metadata

#2 opened 6 months ago by

nielsr

many errors in code

#1 opened 6 months ago by

21world

commented a paper 6 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93 •

authored a paper 7 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93

upvoted a paper 7 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93

submitted a paper to Daily Papers 7 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93

Jia-Nan Li

AI & ML interests

Recent Activity

Organizations

JinaLeejnl's activity

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Add `library_name: transformers` to metadata

many errors in code