Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiarui Yao's picture
2 28 1

Jiarui Yao

FlippyDora
Antigonish's profile picture research4pan's profile picture manh-linh's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago
CorrectKLinRL/Qwen3-4B-Base-dapo_filter-grpo-noKL
published a model 2 days ago
CorrectKLinRL/Qwen3-4B-Base-dapo_filter-grpo-noKL
updated a model 2 days ago
CorrectKLinRL/Qwen3-1.7B-Base-dapo_filter-grpo-noKL
View all activity

Organizations

University of Illinois at Urbana-Champaign's profile picture RandomSampling's profile picture Embodied Reasoning Agent's profile picture EM-RAFT's profile picture Micro-RM's profile picture era-temporary's profile picture FANS - Formal Answer Selection Using Lean4's profile picture DPO-RM's profile picture CoE - Chain of Experts's profile picture tmp's profile picture PRM-CoT's profile picture UIUC ScaleML Lab's profile picture rb_dev's profile picture CorrectKLinRL's profile picture

FlippyDora 's models 65

FlippyDora/dpo-rm-translate

Updated Nov 17, 2024

FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo

Updated Oct 23, 2024

FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo

Updated Oct 22, 2024 • 2

FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo

Updated Oct 22, 2024 • 1

FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback

3B • Updated Oct 16, 2024 • 1
  • Previous
  • 1
  • 2
  • 3
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs