Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Jesse Zhang's picture

Jesse Zhang

Nagi-ovo

xinjican's profile picture

John6666's profile picture

optimusHimself's profile picture

·

Nagi-ovo

AI & ML interests

Humanoids & RL

Organizations

None yet

Nagi-ovo 's collections 2

Nagi-ovo/DeepSeek-V3.1-Math-RL-G16-LoRA

Updated Jan 31
Nagi-ovo/Qwen3-235B-A22B-Instruct-MATH-RL-LoRA

Updated Jan 31
Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter

Text Generation • Updated Feb 8, 2025 • 6
Nagi-ovo/Llama-3-8B-PPO

Text Generation • 8B • Updated Jan 21, 2025 • 2

Llama-3-8B-RLHF-Pipeline

Nagi-ovo/Llama-3-8B-SFT-RuoZhiBa

Text Generation • 8B • Updated Jan 7, 2025 • 5
Nagi-ovo/Llama-3-8B-DPO

Text Generation • 8B • Updated Jan 6, 2025 • 6
Nagi-ovo/Llama-3-8B-RM

Text Classification • 8B • Updated Jan 6, 2025 • 5 • 2
Nagi-ovo/Llama-3-8B-PPO

Text Generation • 8B • Updated Jan 21, 2025 • 2

Nagi-ovo/DeepSeek-V3.1-Math-RL-G16-LoRA

Updated Jan 31
Nagi-ovo/Qwen3-235B-A22B-Instruct-MATH-RL-LoRA

Updated Jan 31
Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter

Text Generation • Updated Feb 8, 2025 • 6
Nagi-ovo/Llama-3-8B-PPO

Text Generation • 8B • Updated Jan 21, 2025 • 2

Llama-3-8B-RLHF-Pipeline

Nagi-ovo/Llama-3-8B-SFT-RuoZhiBa

Text Generation • 8B • Updated Jan 7, 2025 • 5
Nagi-ovo/Llama-3-8B-DPO

Text Generation • 8B • Updated Jan 6, 2025 • 6
Nagi-ovo/Llama-3-8B-RM

Text Classification • 8B • Updated Jan 6, 2025 • 5 • 2
Nagi-ovo/Llama-3-8B-PPO

Text Generation • 8B • Updated Jan 21, 2025 • 2

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs