rin2401
rin2401
AI & ML interests
None yet
Recent Activity
updated
a collection
about 7 hours ago
Safety
liked
a model
1 day ago
unicorn-team/Unicorn-VL-R3
updated
a model
13 days ago
unicorn-team/Unicorn-R3
Organizations
Agent
DPO
Aya
-
Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Paper • 2408.14960 • Published -
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Paper • 2407.02552 • Published • 4 -
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm
Paper • 2406.18682 • Published -
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Paper • 2410.10801 • Published • 3
PEFT
Safety
Think
Agent
Tokenizer
DPO
Fewshot
Aya
-
Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Paper • 2408.14960 • Published -
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Paper • 2407.02552 • Published • 4 -
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm
Paper • 2406.18682 • Published -
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Paper • 2410.10801 • Published • 3
Benchmark
PEFT
LLM