Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
9
2
Jiahe Jin
zizi-0123
Follow
henryhe0123's profile picture
jzguo's profile picture
2 followers
·
3 following
zizi0123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
daVinci-Dev: Agent-native Mid-training for Software Engineering
updated
a model
20 days ago
zizi-0123/mhqa_llama_grpo
updated
a model
20 days ago
zizi-0123/web_llama_sft_correct
View all activity
Organizations
None yet
zizi-0123
's models
35
Sort: Recently updated
zizi-0123/mhqa_llama_grpo
Updated
20 days ago
zizi-0123/web_llama_sft_correct
Text Generation
•
3B
•
Updated
20 days ago
•
3
zizi-0123/web_llama_sft_correct_grpo
Updated
20 days ago
zizi-0123/mhqa_llama_sft_behavior
Text Generation
•
3B
•
Updated
20 days ago
•
4
zizi-0123/mhqa_llama_sft_behavior_grpo
Updated
20 days ago
zizi-0123/OLMo2-1B-midtrain-run1
1B
•
Updated
Dec 15, 2025
zizi-0123/mhqa_llama_sft_random_grpo
Updated
Nov 16, 2025
zizi-0123/mhqa_llama_sft_correct_grpo
Updated
Nov 16, 2025
zizi-0123/web_qwen_sft_singlebehavior_grpo
Updated
Nov 7, 2025
zizi-0123/web_llama_sft_random_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_random_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_correct_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_random
Text Generation
•
2B
•
Updated
Nov 7, 2025
•
1
zizi-0123/mhqa_qwen_sft_behavior_grpo
Updated
Nov 7, 2025
zizi-0123/mhqa_qwen_sft_correct
Text Generation
•
2B
•
Updated
Nov 7, 2025
•
1
zizi-0123/mhqa_qwen_sft_behavior
Text Generation
•
2B
•
Updated
Nov 7, 2025
•
2
zizi-0123/web_llama_sft_random
Text Generation
•
3B
•
Updated
Nov 7, 2025
•
2
zizi-0123/web_llama_grpo
Updated
Nov 1, 2025
zizi-0123/web_llama_sft_behavior_grpo
Updated
Nov 1, 2025
zizi-0123/mhqa_qwen_grpo
Updated
Oct 21, 2025
zizi-0123/web_qwen_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_correct_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_nobehavior_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_incorrect_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_random_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_correct_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_behavior_2k_grpo
Updated
Oct 1, 2025
zizi-0123/web_qwen_sft_random
Text Generation
•
2B
•
Updated
Oct 1, 2025
•
1
zizi-0123/web_qwen_sft_behavior_correct
Text Generation
•
2B
•
Updated
Oct 1, 2025
•
1
Previous
1
2
Next