TianlaiChen 's Collections papers
updated
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through
Two-Stage Rule-Based RL
Paper
• 2503.07536
• Published
• 88
Seedream 2.0: A Native Chinese-English Bilingual Image Generation
Foundation Model
Paper
• 2503.07703
• Published
• 37
Gemini Embedding: Generalizable Embeddings from Gemini
Paper
• 2503.07891
• Published
• 46
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Paper
• 2503.07572
• Published
• 48
Implicit Reasoning in Transformers is Reasoning through Shortcuts
Paper
• 2503.07604
• Published
• 23
Beyond Decoder-only: Large Language Models Can be Good Encoders for
Machine Translation
Paper
• 2503.06594
• Published
• 6
A Survey of Efficient Reasoning for Large Reasoning Models: Language,
Multimodality, and Beyond
Paper
• 2503.21614
• Published
• 43
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
Paper
• 2503.22230
• Published
• 45
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion
Transformers
Paper
• 2504.10483
• Published
• 22
Efficient Reasoning Models: A Survey
Paper
• 2504.10903
• Published
• 21
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Paper
• 2504.11393
• Published
• 18
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation
through Pretraining, SFT, and RL
Paper
• 2504.11455
• Published
• 14
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Paper
• 2504.10479
• Published
• 306
Scaling Data-Constrained Language Models
Paper
• 2305.16264
• Published
• 16