Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
10
2
Jiahe Jin
zizi-0123
Follow
henryhe0123's profile picture
jzguo's profile picture
2 followers
·
3 following
zizi0123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
29 days ago
Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training
upvoted
a
paper
about 2 months ago
daVinci-Dev: Agent-native Mid-training for Software Engineering
updated
a model
2 months ago
zizi-0123/mhqa_llama_grpo
View all activity
Organizations
None yet
zizi-0123
's models
35
Sort: Recently updated
zizi-0123/mhqa_llama_grpo
Updated
Jan 8
zizi-0123/web_llama_sft_correct
Text Generation
•
3B
•
Updated
Jan 8
zizi-0123/web_llama_sft_correct_grpo
Updated
Jan 8
zizi-0123/mhqa_llama_sft_behavior
Text Generation
•
3B
•
Updated
Jan 8
•
1
zizi-0123/mhqa_llama_sft_behavior_grpo
Updated
Jan 8
zizi-0123/OLMo2-1B-midtrain-run1
1B
•
Updated
Dec 15, 2025
•
2
zizi-0123/mhqa_llama_sft_random_grpo
Updated
Nov 16, 2025
zizi-0123/mhqa_llama_sft_correct_grpo
Updated
Nov 16, 2025
zizi-0123/web_qwen_sft_singlebehavior_grpo
Updated
Nov 7, 2025
zizi-0123/web_llama_sft_random_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_random_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_correct_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_random
Text Generation
•
2B
•
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_behavior_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_correct
Text Generation
•
2B
•
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_behavior
Text Generation
•
2B
•
Updated
Nov 7, 2025
zizi-0123/web_llama_sft_random
Text Generation
•
3B
•
Updated
Nov 7, 2025
•
1
zizi-0123/web_llama_grpo
Updated
Nov 1, 2025
zizi-0123/web_llama_sft_behavior_grpo
Updated
Nov 1, 2025
zizi-0123/mhqa_qwen_grpo
Updated
Oct 21, 2025
zizi-0123/web_qwen_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_correct_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_nobehavior_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_incorrect_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_random_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_correct_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_2k_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_random
Text Generation
•
2B
•
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_correct
Text Generation
•
2B
•
Updated
Oct 1, 2025
Previous
1
2
Next