Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bensondccnqwc
/
tmp-cjncn1dc91
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
tmp-cjncn1dc91
/
evalplus_results
/
mbpp
52.2 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
bensondccnqwc
Add files using upload-large-folder tool
6a444ef
verified
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_10--actor--huggingface_vllm_temp_1.0.eval_results.json
1.75 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_10--actor--huggingface_vllm_temp_1.0.jsonl
1.4 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_10--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.79 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_100--actor--huggingface_vllm_temp_1.0.eval_results.json
1.87 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_100--actor--huggingface_vllm_temp_1.0.jsonl
1.53 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_100--actor--huggingface_vllm_temp_1.0.raw.jsonl
2.06 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_20--actor--huggingface_vllm_temp_1.0.eval_results.json
1.8 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_20--actor--huggingface_vllm_temp_1.0.jsonl
1.45 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_20--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.86 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_30--actor--huggingface_vllm_temp_1.0.eval_results.json
1.78 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_30--actor--huggingface_vllm_temp_1.0.jsonl
1.43 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_30--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.85 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_40--actor--huggingface_vllm_temp_1.0.eval_results.json
1.8 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_40--actor--huggingface_vllm_temp_1.0.jsonl
1.46 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_40--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.9 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_50--actor--huggingface_vllm_temp_1.0.eval_results.json
1.81 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_50--actor--huggingface_vllm_temp_1.0.jsonl
1.46 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_50--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.91 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_60--actor--huggingface_vllm_temp_1.0.eval_results.json
1.82 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_60--actor--huggingface_vllm_temp_1.0.jsonl
1.48 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_60--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.96 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_70--actor--huggingface_vllm_temp_1.0.eval_results.json
1.82 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_70--actor--huggingface_vllm_temp_1.0.jsonl
1.48 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_70--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.97 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_80--actor--huggingface_vllm_temp_1.0.eval_results.json
1.83 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_80--actor--huggingface_vllm_temp_1.0.jsonl
1.49 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_80--actor--huggingface_vllm_temp_1.0.raw.jsonl
1.98 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_90--actor--huggingface_vllm_temp_1.0.eval_results.json
1.85 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_90--actor--huggingface_vllm_temp_1.0.jsonl
1.51 MB
Add files using upload-large-folder tool
4 months ago
home--work--minzijun_rl_output--checkpoints--ppo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm--global_step_90--actor--huggingface_vllm_temp_1.0.raw.jsonl
2.05 MB
Add files using upload-large-folder tool
4 months ago