shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-k5-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 7 days ago • 16
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-k5-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 7 days ago • 16
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-instructions-k10-opus-distill-32k-lr5e6-multiturn Updated 11 days ago
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-instructions-k10-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated 11 days ago • 14
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-instructions-k10-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated 11 days ago • 14
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 14 days ago • 460
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 14 days ago • 460
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated 15 days ago • 387
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated 15 days ago • 387
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean_think Text Generation • 1B • Updated Mar 28 • 33
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean_think Text Generation • 1B • Updated Mar 28 • 33
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean Text Generation • 1B • Updated Mar 27 • 78
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean Text Generation • 1B • Updated Mar 27 • 78
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think Text Generation • 1B • Updated Mar 27 • 33
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think Text Generation • 1B • Updated Mar 27 • 33
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection Paper • 2505.17558 • Published May 23, 2025 • 15
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Paper • 2502.14302 • Published Feb 20, 2025 • 9