sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102247 Updated Jan 11
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102246 Updated Jan 11
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102244 Updated Jan 11
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102242 Updated Jan 11
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102239 Updated Jan 11
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102236 Updated Jan 11
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601101739 Updated Jan 10
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138 Text Generation • 8B • Updated Jan 9
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42-20260108-1706 Text Generation • 8B • Updated Jan 8 • 1
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42 Text Generation • 8B • Updated Jan 8
sleeepeer/meta-llama-Meta-Llama-3-8B-Instruct-DPO-dpo_anchor_3epoch_llama3_2000-42 Updated Oct 4, 2025
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-DPO-dpo_anchor_3epoch_no_instruction-42 Updated Oct 3, 2025
sleeepeer/Llama-3.1-8B-Instruct-GRPO-alpaca_mix_combine_naive-llm-judge-42 Text Generation • 8B • Updated Jul 16, 2025 • 2
sleeepeer/Llama-3.1-8B-Instruct-GRPO-alpaca_mix_combine_naive_least_similar-llm-judge-42 Text Generation • 8B • Updated Jul 16, 2025 • 1