Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
2
1
ZHIYI LYU
ZHIYII
Follow
dark-pen's profile picture
JuneJX's profile picture
2 followers
·
1 following
AI & ML interests
reinforment learning, LLM
Recent Activity
updated
a dataset
about 2 months ago
ZHIYII/Postgres_Entropy_Action_SFT_swift_trace
updated
a dataset
about 2 months ago
ZHIYII/Notion_Entropy_Action_SFT_swift_trace_original_task
published
a dataset
about 2 months ago
ZHIYII/Notion_Entropy_Action_SFT_swift_trace_original_task
View all activity
Organizations
None yet
ZHIYII
's models
23
Sort: Recently updated
ZHIYII/rejection_sampling_sft_notion_20260428_0347
Updated
Apr 28
ZHIYII/rejection_sampling_sft_notion_20260428_0337
Updated
Apr 28
ZHIYII/baseine_sft_notion_20260428_0327
Updated
Apr 28
ZHIYII/baseine_sft_notion_20260428_0316
Updated
Apr 28
ZHIYII/baseine_sft_postgres_20260427_0254
Updated
Apr 27
ZHIYII/rejection_sampling_sft_notion_20260426_0650
Updated
Apr 26
ZHIYII/rejection_sampling_sft_postgres_20260425_1715
Updated
Apr 26
ZHIYII/Github_Qwen3_32B_20260417_0857
Updated
Apr 17
ZHIYII/Github_Qwen3_32B_20260416_0914
Updated
Apr 16
ZHIYII/Github_Qwen3_32B_20260415_1340
Updated
Apr 15
ZHIYII/Github_Ablation_llm_scoring_credit_swift_20260407_0223
Updated
Apr 7
ZHIYII/Github_Ablation_llm_scoring_credit_swift_20260407_0222
Updated
Apr 7
ZHIYII/Github_Ablation_infopo_credit_swift_20260406_1551
Updated
Apr 6
ZHIYII/Github_Ablation_infopo_credit_swift_20260406_1548
Updated
Apr 6
ZHIYII/Github_Ablation_infopo_credit_swift_20260406_1316
Updated
Apr 6
ZHIYII/Github_Qwen3_32B_20260404_0850
Updated
Apr 4
ZHIYII/Github_Ablation_llm_scoring_credit_SFT
677k
•
Updated
Apr 3
•
2
ZHIYII/llm_filter_sft_github
677k
•
Updated
Mar 28
•
2
ZHIYII/mcp_github_uniform_ablation
33B
•
Updated
Mar 21
•
2
ZHIYII/Github_Qwen3_32B_uniform_sft_20260321_0322
33B
•
Updated
Mar 21
•
2
ZHIYII/19500_RRM
Text Classification
•
7B
•
Updated
Aug 26, 2025
•
3
•
1
ZHIYII/Revision-Reward-Model
Updated
May 13, 2025
ZHIYII/BT_Qwen2.5-7B_Base
Updated
Mar 7, 2025