Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bensondccnqwc
/
tmp-lpqm4
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
a99ebfc
tmp-lpqm4
/
evalplus_results
/
mbpp
14 MB
1 contributor
History:
2 commits
bensondccnqwc
Add files using upload-large-folder tool
a99ebfc
verified
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_40--actor--huggingface_vllm_temp_1.0.jsonl
1.47 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_50--actor--huggingface_vllm_temp_1.0.jsonl
1.49 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_50--actor--huggingface_vllm_temp_1.0.raw.jsonl
Safe
1.96 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_60--actor--huggingface_vllm_temp_1.0.jsonl
1.52 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_60--actor--huggingface_vllm_temp_1.0.raw.jsonl
2.01 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_70--actor--huggingface_vllm_temp_1.0.jsonl
1.53 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_70--actor--huggingface_vllm_temp_1.0.raw.jsonl
2.04 MB
Add files using upload-large-folder tool
12 days ago
home--work--minzijun_rl_output_2--checkpoints--verl_role_sft_grpo_deepmath_train_sample_6144_context_4k_Qwen3-1.7B-base_max_response4096_batch1024_rollout8_vllm_no_shuffle--global_step_80--actor--huggingface_vllm_temp_1.0.raw.jsonl
2.02 MB
Add files using upload-large-folder tool
12 days ago