Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
23
Jerry Pan
JERRYPAN617
Follow
0 followers
·
12 following
https://jerrypan617.github.io/
jerrypan617
AI & ML interests
RLHF, Retrieval-Augmented Multimodal Understanding...
Recent Activity
upvoted
a
paper
19 days ago
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
liked
a dataset
2 months ago
PKU-Alignment/PKU-SafeRLHF-single-dimension
liked
a dataset
2 months ago
PKU-Alignment/PKU-SafeRLHF
View all activity
Organizations
JERRYPAN617
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
5 datasets
2 months ago
PKU-Alignment/PKU-SafeRLHF-single-dimension
Viewer
•
Updated
Jun 14, 2024
•
81.1k
•
108
•
3
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
7.24k
•
172
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16, 2024
•
187k
•
7.78k
•
317
LooksJuicy/ruozhiba
Viewer
•
Updated
Apr 9, 2024
•
1.5k
•
140
•
313
Karsh-CAI/btfChinese-DPO-small
Viewer
•
Updated
Apr 7, 2024
•
5k
•
116
•
22
liked
a model
2 months ago
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
2B
•
Updated
Sep 25, 2024
•
6.47M
•
•
598
liked
a model
3 months ago
JERRYPAN617/HH-BTRewardModel-roberta
Reinforcement Learning
•
0.1B
•
Updated
Nov 13, 2025
•
1
•
1
liked
7 datasets
3 months ago
ys-zong/VLGuard
Viewer
•
Updated
Jan 19, 2025
•
3k
•
221
•
13
PKU-Alignment/MM-SafetyBench
Viewer
•
Updated
Sep 19, 2024
•
6.72k
•
1.06k
•
3
saferlhf-v/BeaverTails-V
Viewer
•
Updated
Mar 8, 2025
•
30.4k
•
735
•
7
PKU-Alignment/PKU-SafeRLHF-V
Viewer
•
Updated
Mar 25, 2025
•
30.4k
•
118
•
5
Moemu/Muice-Dataset
Viewer
•
Updated
Aug 23, 2025
•
3.7k
•
180
•
49
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3, 2024
•
2.78k
•
570
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
47.5k
•
310
liked
a Space
3 months ago
Sleeping
1
Qwen2.5 Psydoctor Demo
📈
1
基于 Qwen2.5-1.5B-Instruct 模型微调的 LoRA 适配器,专门用于心理医生对话场景。
liked
2 datasets
3 months ago
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer
•
Updated
Apr 22, 2025
•
90.1k
•
5.78k
•
1.06k
nvidia/Nemotron-CC-Math-v1
Viewer
•
Updated
Dec 23, 2025
•
190M
•
7.88k
•
61
liked
a model
3 months ago
JERRYPAN617/qwen2.5-lora-psydoctor
Text Generation
•
Updated
Oct 25, 2025
•
5
•
1
liked
a dataset
3 months ago
hiyouga/geometry3k
Viewer
•
Updated
Apr 14, 2025
•
3k
•
21.9k
•
65
liked
a dataset
4 months ago
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
21.6k
•
1.65k
Load more