·
AI & ML interests
None yet
Organizations
Pamela153/your-hf-repo-id
Updated
Pamela153/PPO-Qwen7B-alfworld-task-all-iter16-sft-300-data
Pamela153/PPO-Qwen7B-alfworld-task-1-iter16-sft-300-data
Updated
Pamela153/PPO-Qwen7B-alfworld-task-mixed-iter16-sft-300-data
Updated
Pamela153/swe-gym-task-1-Qwen3-8B
Updated
Pamela153/PPO-Qwen7B-alfworld-iter16-sft-100-data-param-20
Updated
Pamela153/PPO-Qwen7B-alfworld-iter16-sft-100-data-param-19
Updated
Pamela153/PPO-Qwen7B-alfworld-iter24-sft-100-data-param-14
Updated
Pamela153/PPO-Qwen7B-alfworld-iter24-sft-100-data-param-11
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-temp0-fused-kernels
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-temp0
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-12
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-15
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-11
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-16
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-17
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-13
Updated
Pamela153/PPO-Qwen7B-tw-small-w4-o6-q8-iter24-param-14
Updated
Pamela153/PPO-Qwen1.5B-tw-small-w4-o6-q8-iter24-param-2-seed-2
Updated
Pamela153/PPO-Qwen1.5B-tw-small-w4-o6-q8-iter24-param-2
Updated
Pamela153/PPO-Qwen1.5B-tw-small-w4-o6-q8-iter24-param-2-seed-1
Updated
Pamela153/PPO-Qwen1.5B-tw-small-w4-o6-q8-iter24-param-8
Pamela153/PPO-Qwen1.5B-tw-small-w4-o6-q8-iter24-param-10
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-8
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-10
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-2-seed-2
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-2-seed-1
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-9
Updated
Pamela153/PPO-Qwen1.5B-tw-tiny-w2-o3-q4-iter12-param-6
Updated