·
AI & ML interests
LLMs
Recent Activity
Organizations
None yet
models
739
ZHLiu627/sokoban-GRPO-from-sft-Llama-3.1-8B-Instruct-window-1-nothink-30step
Updated
ZHLiu627/sokoban-GRPO-from-sft-Llama-3.1-8B-Instruct-window-1-nothink-15step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-150step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-135step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-120step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-105step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-90step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-75step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-60step
Updated
ZHLiu627/aug_verl_agent_webshop-GRPO-kl0.01-from-webshop-20step-v2-Llama-3.1-8B-Instruct-info40-45step
Updated