Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
mamba413's profile picture
Kyleyee's profile picture
callmespring's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
4 days ago
august66/hh_removed_prompts_generations_clip_v2_dpo
published
a dataset
4 days ago
august66/hh_removed_prompts_generations_clip_v2_dpo
updated
a model
5 days ago
august66/hh_qwen1.5_IS_CLIP_small_clip_v2
View all activity
Organizations
august66
's datasets
45
Sort: Recently updated
august66/hh_removed_prompts_generations_clip_v2_dpo
Viewer
•
Updated
4 days ago
•
119
•
11
august66/hh_removed_prompts_generations_Is_ref
Viewer
•
Updated
5 days ago
•
119
•
8
august66/hh_removed_prompts_generations_Is_dpo
Viewer
•
Updated
5 days ago
•
119
•
10
august66/hh_removed_prompts_generations_DM_dpo
Viewer
•
Updated
5 days ago
•
119
•
13
august66/hh_removed_prompts_generations_gated_dpo
Viewer
•
Updated
5 days ago
•
119
•
9
august66/hh_removed_prompts_generations
Viewer
•
Updated
5 days ago
•
119
•
8
august66/ultrafeedback_helpful_base
Viewer
•
Updated
5 days ago
•
62k
•
15
august66/hh_helpfulness_mc_rewards_DPO
Viewer
•
Updated
6 days ago
•
46.1k
•
22
august66/hh_helpfulness_qwen2.5_1.5b_generation_stochastic
Viewer
•
Updated
9 days ago
•
46.1k
•
7
august66/hh_helpfulness_mc_rewards_IS_clip
Viewer
•
Updated
9 days ago
•
46.1k
•
11
august66/hh_helpfulness_qwen2.5_1.5b_generation_dpo
Viewer
•
Updated
10 days ago
•
46.1k
•
21
august66/hh_helpfulness_qwen2.5_1.5b_generation
Viewer
•
Updated
16 days ago
•
46.1k
•
60
august66/hh_helpfulness_mc_rewards
Viewer
•
Updated
16 days ago
•
46.1k
•
17
august66/hh_helpfulness_drpo_from_sft
Viewer
•
Updated
23 days ago
•
46.1k
•
520
august66/hh_helpful_base
Viewer
•
Updated
28 days ago
•
46.1k
•
236
august66/hh_harmless_base
Viewer
•
Updated
30 days ago
•
44.8k
•
17
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_vllm_conv
Viewer
•
Updated
Feb 10
•
43.8k
•
9
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_vllm
Viewer
•
Updated
Feb 10
•
43.8k
•
9
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_sampled
Viewer
•
Updated
Feb 9
•
48.8k
•
7
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
Viewer
•
Updated
Oct 8, 2025
•
48.8k
•
4
august66/hh_qwen2.5_1.5b_with_bias_bt_pref
Viewer
•
Updated
Oct 2, 2025
•
18k
•
8
august66/hh_qwen2.5_1.5b_with_bias
Viewer
•
Updated
Sep 27, 2025
•
18k
•
5
august66/drpo_hh_qwen2.5_1.5b
Viewer
•
Updated
Sep 8, 2025
•
43.8k
•
5
august66/dpo_reward_dist_pi_theta_prompt_3
Viewer
•
Updated
Sep 3, 2025
•
5k
•
6
august66/dpo_reward_dist_pi_theta_prompt_2
Viewer
•
Updated
Sep 3, 2025
•
5k
•
6
august66/dpo_reward_dist_pi_theta
Viewer
•
Updated
Aug 23, 2025
•
5k
•
6
august66/reward_distribution_2_tldr_openassist_pi_ref
Viewer
•
Updated
Aug 4, 2025
•
5k
•
5
august66/reward_distribution_2_tldr_openassist_pi_theta
Viewer
•
Updated
Aug 4, 2025
•
5k
•
5
august66/reward_distribution_tldr_openassist_pi_theta
Viewer
•
Updated
Jul 30, 2025
•
5k
•
7
august66/reward_distribution_tldr_openassist_pi_ref
Viewer
•
Updated
Jul 30, 2025
•
5k
•
5
Previous
1
2
Next