Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Huang
jasonhuang3
Follow
0 followers
·
5 following
AI & ML interests
None yet
Recent Activity
updated
a model
3 minutes ago
jasonhuang3/99-our-42-llama3-2-3b-instruct-lora-28k
updated
a model
about 5 hours ago
jasonhuang3/99-caldpo-dataset-our-65-llama3-2-3b-instruct-lora
updated
a model
about 7 hours ago
jasonhuang3/101-caldpo-dataset-caldpo-llama3-2-3b-instruct-lora
View all activity
Organizations
None yet
jasonhuang3
's models
371
Sort: Recently updated
jasonhuang3/Pro6000-dpop_our_13_6-qwen-2-5-7b-math_lora_28k
Updated
Oct 4, 2025
jasonhuang3/Pro6000-dpop_our_16-qwen-2-5-7b-math_lora_28k
Updated
Oct 3, 2025
jasonhuang3/99-caldpo-dataset-dpop-our-13-5-zephyr-7b-sft-full-merged-28k
7B
•
Updated
Oct 3, 2025
jasonhuang3/99-caldpo-dataset-dpop-our-13-5-zephyr-7b-sft-full-lora-28k
Updated
Oct 1, 2025
jasonhuang3/Pro6000-dpop_our_13_5-qwen-2-5-7b-math_lora_28k
Updated
Oct 1, 2025
jasonhuang3/Pro6000-dpop_our_13_5-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Oct 1, 2025
jasonhuang3/99-bpo-qwen-2-5-7b-math-merged-new-prompt-0927
8B
•
Updated
Sep 30, 2025
jasonhuang3/Pro6000-dpop_our_13_4-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Sep 30, 2025
jasonhuang3/Pro6000-dpop_our_13_4-qwen-2-5-7b-math_lora_28k
Updated
Sep 30, 2025
jasonhuang3/Pro6000-caldpo-dataset-dpop-our-13-2-zephyr-7b-sft-full-lora-28k
Updated
Sep 29, 2025
jasonhuang3/Pro6000-dpop_our_13_3-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Sep 29, 2025
jasonhuang3/99-bpo-qwen-2-5-7b-math-lora-new-prompt-0927
Updated
Sep 29, 2025
jasonhuang3/Pro6000-dpop_our_13_3-qwen-2-5-7b-math_lora_28k
Updated
Sep 28, 2025
jasonhuang3/Pro6000-dpop_our_13_2-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Sep 27, 2025
jasonhuang3/Pro6000-dpop_our_13_2-qwen-2-5-7b-math_lora_28k
Updated
Sep 27, 2025
jasonhuang3/Pro6000-llama3-2-1b-instruct-dpo-merged-28k
1B
•
Updated
Sep 27, 2025
jasonhuang3/Pro6000-llama3-2-1b-instruct-dpop-our-13-merged-28k
1B
•
Updated
Sep 27, 2025
jasonhuang3/Pro6000-llama3-2-1b-instruct-dpo-lora-28k
Updated
Sep 26, 2025
jasonhuang3/Pro6000-llama3-2-1b-instruct-dpop-our-13-lora-28k
Updated
Sep 26, 2025
jasonhuang3/Pro6000-bpo-qwen-2-5-7b-math-balance-hinge-alpha-0.3-lora-28k-0926
Updated
Sep 26, 2025
jasonhuang3/Pro6000-llama3-1-8b-instruct-dpop-our-13-merged-28k
8B
•
Updated
Sep 26, 2025
jasonhuang3/Pro6000-llama3-1-8b-instruct-dpo-merged-28k
8B
•
Updated
Sep 26, 2025
jasonhuang3/Pro6000-llama3-1-8b-instruct-dpop-our-13-lora-28k
Updated
Sep 25, 2025
jasonhuang3/Pro6000-llama3-1-8b-instruct-dpo-lora-28k
Updated
Sep 25, 2025
jasonhuang3/Pro6000-llama3-1-8b-instruct-dpo-old-prompt-lora-28k
Updated
Sep 24, 2025
jasonhuang3/Pro6000-qwen-2-5-7b-math-dpo_old_prompt_merged_28k
8B
•
Updated
Sep 24, 2025
jasonhuang3/Pro6000-qwen-2-5-7b-math-dpo_old_prompt_lora_28k
Updated
Sep 24, 2025
jasonhuang3/Pro6000-dpop-old-prompt-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Sep 23, 2025
jasonhuang3/Pro6000-dpop-old-prompt-qwen-2-5-7b-math_lora_28k
Updated
Sep 23, 2025
jasonhuang3/Pro6000-dpop_our_13_old_prompt-qwen-2-5-7b-math_merged_28k
8B
•
Updated
Sep 22, 2025
Previous
1
...
6
7
8
9
10
...
13
Next