-
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper • 2506.01049 • Published • 38 -
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Paper • 2402.09240 • Published • 5 -
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Paper • 2410.06373 • Published • 36 -
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning
Paper • 2209.04851 • Published • 3
Juanxi Tian
Juanxi
AI & ML interests
Efficient AI & Gen AI
Recent Activity
upvoted
a
paper
about 20 hours ago
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
upvoted
a
paper
about 20 hours ago
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
upvoted
a
paper
about 20 hours ago
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders