34 27 1

Yuxian Gu

t1101675

https://t1101675.github.io/

AI & ML interests

Efficient methods for language models

Recent Activity

upvoted a paper 30 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

upvoted a paper about 2 months ago

Online Experiential Learning for Language Models

upvoted a paper 3 months ago

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

View all activity

Organizations

upvoted a paper 30 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 112

upvoted a paper about 2 months ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 59

upvoted 3 papers 3 months ago

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 23

Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 87

upvoted an article 4 months ago

Article

Differential Transformer V2

Jan 20

•

upvoted a paper 4 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 324

upvoted a paper 5 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

updated a Space 6 months ago

Trackio

🚀

Display tracking information

published a Space 6 months ago

Trackio

🚀

Display tracking information

upvoted 3 papers 7 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Paper • 2509.25180 • Published Sep 29, 2025 • 10

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

Paper • 2509.25182 • Published Sep 29, 2025 • 39

published 3 datasets 7 months ago

updated a dataset 7 months ago

jet-ai/math_qa

Viewer • Updated Sep 25, 2025 • 37.3k • 410 • 2

updated a dataset 8 months ago

jet-ai/social_i_qa

Viewer • Updated Sep 24, 2025 • 35.4k • 470

New activity in allenai/social_i_qa 8 months ago

Convert dataset to Parquet

#4 opened 9 months ago by

SaylorTwift

updated a model 8 months ago

jet-ai/Jet-Nemotron-2B

Text Generation • Updated Sep 28, 2025 • 1.36k • 17

Yuxian Gu

AI & ML interests

Recent Activity

Organizations

t1101675's activity

Differential Transformer V2

Trackio

Trackio

Convert dataset to Parquet