2 8 7

Haiyue Song

shyyhs

https://shyyhs.github.io/

AI & ML interests

LLM post-training, deepsearch agent, RL for LLM, structural document machine translation, machine translation, subword

Recent Activity

authored a paper 14 days ago

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

authored a paper 14 days ago

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

authored a paper 14 days ago

When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Hallucinations Undermine Trust; Metacognition is a Way Forward

Paper • 2605.01428 • Published May 2 • 24

upvoted a paper 2 months ago

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards

Paper • 2603.02146 • Published Mar 2 • 1

upvoted a paper 3 months ago

OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training

Paper • 2603.28858 • Published Mar 30 • 9

upvoted 2 papers 7 months ago

Structured Document Translation via Format Reinforcement Learning

Paper • 2512.05100 • Published Dec 4, 2025 • 2

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

upvoted an article about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted a paper about 1 year ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10, 2025 • 48

upvoted an article almost 2 years ago

Article

Welcome Gemma 2 - Google’s new open LLM

philschmid, osanseviero, pcuenq, lewtun, tomaarsen, reach-vb

•

Jun 27, 2024

• 132

Haiyue Song

AI & ML interests

Recent Activity

Organizations

shyyhs's activity

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Welcome Gemma 2 - Google’s new open LLM