AI & ML interests
None defined yet.
Recent Activity
PessimisticDPO/SmolLM2-1.7B-reppo-b0.1-L4-e1
Updated
PessimisticDPO/SmolLM2-1.7B-reppo-b0.1-L4-policy-e1
Updated
PessimisticDPO/SmolLM2-1.7B-a0.1-b0.1-L2-l1-e0
Updated
PessimisticDPO/SmolLM2-1.7B-a0.1-b0.1-L2-l0-e0
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r3
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r2
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r1
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r0
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r3-e1
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r1-e1
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r2-e1
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r0-e1
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r1-e0
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r2-e0
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r3-e0
Updated
PessimisticDPO/Llama-3.1-Tulu-3-8B-SFT-reppo-a0.0-b0.0-L4-r0-e0
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l1
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l0
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l1-e8
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l0-e8
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l1-e7
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l0-e7
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l1-e6
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L2-l0-e6
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l2
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l1
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l0
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l1-e8
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l2-e8
Updated
PessimisticDPO/Yi-34B-Chat-a0.1-b0.1-L3-l0-e8
Updated