OALL/details_princeton-nlp__Mistral-7B-Base-SFT-IPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-KTO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-DPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-RDPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-RRHF
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-CPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-SLiC-HF
Viewer
• Updated
• 146k • 5
OALL/details_tanliboy__lambda-qwen2.5-32b-dpo-test
Viewer
• Updated
• 146k • 8
OALL/details_princeton-nlp__Mistral-7B-Instruct-CPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Instruct-RRHF
Viewer
• Updated
• 146k • 8
OALL/details_princeton-nlp__Mistral-7B-Instruct-SLiC-HF
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-RRHF
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-SLiC-HF
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Instruct-8B-RRHF
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Instruct-8B-RRHF-v0.2
Viewer
• Updated
• 146k • 4
OALL/details_princeton-nlp__Llama-3-Instruct-8B-SLiC-HF-v0.2
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT
Viewer
• Updated
• 146k • 5
OALL/details_v000000__L3.1-Niitorm-8B-DPO-t0.0001
Viewer
• Updated
• 146k • 5
OALL/details_Cran-May__T.E-8.1
Viewer
• Updated
• 146k • 5
OALL/details_Syed-Hasan-8503__Phi-3-mini-4K-instruct-cpo-simpo
Viewer
• Updated
• 146k • 5
OALL/details_UCLA-AGI__Mistral7B-PairRM-SPPO-Iter1
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Mistral-7B-Base-SFT-SimPO
Viewer
• Updated
• 146k • 5
OALL/details_MaziyarPanahi__calme-2.1-qwen2.5-72b
Viewer
• Updated
• 146k • 5
OALL/details_MaziyarPanahi__calme-2.2-qwen2.5-72b
Viewer
• Updated
• 146k • 5
OALL/details_UCLA-AGI__Llama-3-Instruct-8B-SPPO-Iter1
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-DPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-ORPO
Viewer
• Updated
• 146k • 4
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-KTO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-RDPO
Viewer
• Updated
• 146k • 5
OALL/details_princeton-nlp__Llama-3-Base-8B-SFT-CPO
Viewer
• Updated
• 146k • 5