🔄 In a Training Loop

Jerry Pan

JERRYPAN617

3 24

https://jerrypan617.github.io/

jerrypan617

AI & ML interests

Latent Space Reasoning, VLM, Test-Time Scaling

Recent Activity

liked a dataset 11 days ago

TACPS-liv/Spatial-DISE

upvoted a paper 6 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

liked a dataset 7 months ago

PKU-Alignment/PKU-SafeRLHF-single-dimension

View all activity

Organizations

liked a dataset 11 days ago

TACPS-liv/Spatial-DISE

Viewer • Updated May 21 • 12.4k • 1.13k • 3

liked 5 datasets 7 months ago

liked a model 7 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 11.7M • • 752

liked a model 8 months ago

JERRYPAN617/HH-BTRewardModel-roberta

Reinforcement Learning • 0.1B • Updated Nov 13, 2025 • 2 • 1

liked 7 datasets 8 months ago

ys-zong/VLGuard

Viewer • Updated Jan 19, 2025 • 3k • 376 • 16

PKU-Alignment/MM-SafetyBench

Viewer • Updated Sep 19, 2024 • 6.72k • 1.63k • 8

saferlhf-v/BeaverTails-V

Viewer • Updated Mar 8, 2025 • 30.4k • 158 • 7

PKU-Alignment/PKU-SafeRLHF-V

Viewer • Updated Mar 25, 2025 • 30.4k • 488 • 6

Moemu/Muice-Dataset

Viewer • Updated May 18 • 3.74k • 137 • 58

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3, 2024 • 3.66k • 616

MMMU/MMMU

Viewer • Updated Apr 21 • 11.6k • 58.9k • 330

liked a Space 8 months ago

Qwen2.5 Psydoctor Demo

📈

基于 Qwen2.5-1.5B-Instruct 模型微调的 LoRA 适配器，专门用于心理医生对话场景。

liked 2 datasets 8 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 10.1k • 1.13k

nvidia/Nemotron-CC-Math-v1

Viewer • Updated Dec 23, 2025 • 190M • 60.3k • 89

liked a model 8 months ago

JERRYPAN617/qwen2.5-lora-psydoctor

Text Generation • Updated Oct 25, 2025 • 4 • 1

liked a dataset 9 months ago

hiyouga/geometry3k

Viewer • Updated Apr 14, 2025 • 3k • 30.9k • 82

Jerry Pan

AI & ML interests

Recent Activity

Organizations

JERRYPAN617's activity

Qwen2.5 Psydoctor Demo