Yung-Shun Chang

yungshun317

·

AI & ML interests

None yet

Organizations

None yet

yungshun317 's models 14

yungshun317/llava1.5-7b-rlaif-v-dpo

Updated Oct 31, 2025

yungshun317/qwen2.5-0.5B-prm-mathshepherd

Token Classification • 0.5B • Updated Oct 30, 2025 • 4

yungshun317/sft-qwen2.5-7b-qlora

Text Generation • Updated Oct 29, 2025 • 1

yungshun317/qwen2.5-32b-deberta-ultrafeedback-grpo-lora-ds

Updated Oct 22, 2025

yungshun317/qwen2.5-7b-deberta-ultrafeedback-grpo-lora-ds-composite-reward

Updated Oct 6, 2025

yungshun317/deberta-v3-large-format-guard-preference-distillation

0.4B • Updated Oct 1, 2025 • 2

yungshun317/deberta-v3-large-preference-distillation

0.4B • Updated Oct 1, 2025 • 1

yungshun317/deberta-v3-large-format-guard

0.4B • Updated Sep 30, 2025 • 1

yungshun317/qwen2.5-7b-deberta-ultrafeedback-grpo-lora-ds

Updated Sep 28, 2025

yungshun317/qwen2-0.5B-deberta-ultrafeedback-grpo

Text Generation • 0.5B • Updated Sep 26, 2025 • 6

yungshun317/smollm2-135m-ultrafeedback-dpo

0.1B • Updated Sep 26, 2025 • 1

yungshun317/deberta-v3-large-ultrafeedback-rm

Text Classification • 0.4B • Updated Sep 22, 2025 • 1

yungshun317/naruto-lora

Text-to-Image • Updated Aug 29, 2024 • 4

yungshun317/albert-base-v2-finetuned-squad

Question Answering • Updated Jul 26, 2023