Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
OliverQinyy's profile picture
1 follower
·
3 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
published
a model
about 2 hours ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
updated
a model
about 3 hours ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
View all activity
Organizations
None yet
t2ance
's models
40
Sort: Recently updated
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
42 minutes ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
about 2 hours ago
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
about 3 hours ago
t2ance/CodeRM-SFT-Warmup-Selection-8B
Text Generation
•
Updated
about 4 hours ago
t2ance/CodeRM-SFT-Warmup-Selection-4B
Text Generation
•
Updated
about 5 hours ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain-SmallMeta
Updated
about 16 hours ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain
Updated
1 day ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain
Updated
2 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Heuristic
Updated
3 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline
Updated
4 days ago
t2ance/mle-playbooks
Updated
4 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Baseline
Updated
7 days ago
t2ance/CodeRM-OnlineGRPO-Selection-2B-Domain
Updated
12 days ago
t2ance/CodeRM-DPO-Selection-Domain-2-7B-Hard-Betty-Test
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Instance-Net
Updated
Jan 30
t2ance/CodeRM-KTO-Selection-Instance-Table-2-14B-Hard
Updated
Jan 29
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Domain-BCB
Updated
Jan 28
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Instance-Net
Updated
Jan 28
t2ance/SFT-Warmup-1.7B-BCB
2B
•
Updated
Jan 28
•
4
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Instance-Table
Updated
Jan 28
t2ance/CodeRM-DPO-Selection-Instance-Table-2-14B-Hard
Updated
Jan 27
t2ance/CodeRM-ORPO-Selection-Domain-2-14B-Hard
Updated
Jan 27
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline-BCB
Updated
Jan 27
t2ance/BCB-CodeRM-OnlineGRPO-Selection-3B-Domain
Updated
Jan 27
t2ance/SFT-Warmup-1.7B
2B
•
Updated
Jan 26
•
907
t2ance/CodeRM-ORPO-Selection-Instance-Table-2-14B-Hard
Text Generation
•
Updated
Jan 26
•
1
t2ance/CodeRM-DPO-Selection-Heuristic-2-14B-Hard
Updated
Jan 25
t2ance/CodeRM-OnlineGRPO-Selection-7B-Instance-Net
Updated
Jan 25
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Domain
Text Generation
•
Updated
Jan 25
•
8
t2ance/BCB-CodeRM-OfflineGRPO-Selection-Baseline-2-7B-Hard
Updated
Jan 24
Previous
1
2
Next