Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
Jiahao Zhang
zjhhhh
Follow
MisDrifter's profile picture
1 follower
·
4 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 month ago
zjhhhh/rltf-feedback-distillation-toy-50q-4r
published
a dataset
about 1 month ago
zjhhhh/rltf-feedback-distillation-toy-50q-4r
updated
a model
3 months ago
zjhhhh/rltf-dapo-sd-smoke-gs31
View all activity
Organizations
None yet
zjhhhh
's models
608
Sort: Recently updated
zjhhhh/3b_rerun_rlcf_expand_eta_1e4_step_301
Text Generation
•
3B
•
Updated
Dec 5, 2025
•
1
zjhhhh/3b_rerun_rlcf_expand_eta_1e4_step_201
Text Generation
•
3B
•
Updated
Dec 5, 2025
•
1
zjhhhh/3b_rerun_rlcf_expand_eta_1e4_step_101
Text Generation
•
3B
•
Updated
Dec 5, 2025
•
1
zjhhhh/3b_rerun_rlcf_expand_eta_1e4_step_1
Text Generation
•
3B
•
Updated
Dec 5, 2025
•
2
zjhhhh/Llama-3.2-3B-Instruct_multi_armo_2reward_SFT
3B
•
Updated
Dec 5, 2025
•
1
zjhhhh/bon_1e3_reward0_step_521_final
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
zjhhhh/bon_1e3_reward1_step_521_final
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward0_step_501
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
1
zjhhhh/bon_1e3_reward1_step_501
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward0_step_401
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward1_step_401
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward0_step_301
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward1_step_301
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
1
zjhhhh/bon_1e3_reward0_step_201
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward1_step_201
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward0_step_101
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward1_step_101
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward0_step_1
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_reward1_step_1
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
1
zjhhhh/bon_1e5_step_521_final
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
zjhhhh/bon_1e3_step_521_final
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e4_step_521_final
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
1
zjhhhh/bon_1e5_step_501
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_step_501
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
zjhhhh/bon_1e4_step_501
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e5_step_401
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
zjhhhh/bon_1e3_step_401
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
1
zjhhhh/bon_1e4_step_401
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
zjhhhh/bon_1e5_step_301
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
2
zjhhhh/bon_1e3_step_301
Text Generation
•
3B
•
Updated
Dec 4, 2025
•
3
Previous
1
...
3
4
5
6
7
...
21
Next