DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_balanced__GLM-4_7-swesmith-san__20-0 Viewer • Updated Mar 22 • 1.02k • 8
DCAgent/rl__32GPU_shaped_entropy__mix_v2_baseline_uniform__GLM-4_7-swesmith-san__20-0 Viewer • Updated Mar 22 • 979 • 5
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__GLM-4_7-swesmith-san__20-0 Viewer • Updated Mar 22 • 896 • 5
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h1_struggle_zone__GLM-4_7-swesmith-san__20-0 Viewer • Updated Mar 22 • 880 • 5
DCAgent/neulab-agenttuning-mind2web-sandboxes_glm_4.7_traces_jupiter Viewer • Updated Mar 22 • 10k • 7
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_balanced__GLM-4_7-swesmith-san Viewer • Updated Mar 22 • 852 • 4
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h1_struggle_zone__GLM-4_7-swesmith-san Viewer • Updated Mar 22 • 831 • 7
DCAgent/neulab-agenttuning-alfworld-sandboxes_glm_4.7_traces_jupiter Viewer • Updated Mar 22 • 11.1k • 9
DCAgent/rl__32GPU_shaped_entropy__mix_v2_baseline_uniform__GLM-4_7-swesmith-san Viewer • Updated Mar 22 • 1.03k • 5
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__GLM-4_7-swesmith-san Viewer • Updated Mar 21 • 879 • 6
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_proportional__GLM-4_7-swesmith-san Viewer • Updated Mar 21 • 859 • 5
DCAgent/rl__24GPU_shaped_entropy__mix_v2_baseline_uniform__qwen3base-GLM-4_7-sw Viewer • Updated Mar 21 • 720 • 5
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__qwen3base-GLM-4_7-sw Viewer • Updated Mar 21 • 850 • 6
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_proportional__qwen3base-GLM-4_7-sw Viewer • Updated Mar 21 • 762 • 4