Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
Alex Wa
djdumpling
Follow
0 followers
·
1 following
djdumpling
AI & ML interests
RL, LLMs
Recent Activity
updated
a dataset
3 days ago
djdumpling/helpsteer3-dpo-8k
published
a dataset
3 days ago
djdumpling/helpsteer3-dpo-8k
updated
a dataset
3 days ago
djdumpling/helpsteer3-dpo-under-16k
View all activity
Organizations
models
2
Sort: Recently updated
djdumpling/qwen2.5-14b-socsci210-lora
Updated
Dec 1, 2025
djdumpling/rlhf_transformer
Updated
Jul 23, 2025
•
1
datasets
8
Sort: Recently updated
djdumpling/helpsteer3-dpo-8k
Viewer
•
Updated
3 days ago
•
19.1k
•
47
djdumpling/helpsteer3-dpo-under-16k
Viewer
•
Updated
3 days ago
•
24.3k
•
22
djdumpling/helpsteer3-dpo-style
Viewer
•
Updated
4 days ago
•
25.2k
•
37
djdumpling/megagem_sft
Viewer
•
Updated
15 days ago
•
5.4k
•
14
djdumpling/spatial_reasoning
Viewer
•
Updated
Dec 24, 2025
•
6
•
5
djdumpling/fruit_box_sft_finetuning
Viewer
•
Updated
Nov 19, 2025
•
204k
•
4
djdumpling/fruit-box
Viewer
•
Updated
Oct 24, 2025
•
170k
•
26
djdumpling/fruit-box-minimal-area
Viewer
•
Updated
Oct 23, 2025
•
51.4k
•
7