·
AI & ML interests
None yet
Organizations
None yet
saurabh5/lcb_code_generation_release_v3_training_lite
Viewer
• Updated • 322 • 16
saurabh5/lcb_code_generation_release_v3_training
Viewer
• Updated • 328 • 18
saurabh5/open-code-reasoning-rlvr-original-problems
Viewer
• Updated • 13.8k • 4
saurabh5/lcb_code_generation_v4_v6
Viewer
• Updated • 443 • 9
saurabh5/lcb_code_generation_release_v3
Viewer
• Updated • 612 • 11
saurabh5/lcb_code_generation
Viewer
• Updated • 328 • 8
saurabh5/llama-nemotron-rlvr-code-stdio-sft
Viewer
• Updated • 98.3k • 4
Viewer
• Updated • 328 • 4
saurabh5/rlvr_acecoder_ot_diff_filtered
Viewer
• Updated • 29.2k • 4
saurabh5/open-code-reasoning-rlvr-stdio-og-input
Viewer
• Updated • 25.4k • 4
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-50000
Viewer
• Updated • 10k • 5
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-10000
Viewer
• Updated • 10k • 7
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-40000
Viewer
• Updated • 10k • 4
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-30000
Viewer
• Updated • 10k • 4
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-20000
Viewer
• Updated • 10k • 4
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-0
Viewer
• Updated • 10k • 4
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-4k
Viewer
• Updated • 4k • 5
saurabh5/saurabh5-rlvr_acecoder_filtered-offline-results-full-chunk-60000
Viewer
• Updated • 3.03k • 5
saurabh5/rlvr-acecoder-filtered-offline-results
Viewer
• Updated • 63k • 6
saurabh5/rlvr-code-data-Java-sft
Viewer
• Updated • 133k • 4
• 1
saurabh5/rlvr-code-data-JavaScript-sft
Viewer
• Updated • 133k • 31
• 2
saurabh5/rlvr-code-data-python-sft
Viewer
• Updated • 133k • 5
• 1
saurabh5/rlvr-code-data-python
Viewer
• Updated • 133k • 10
saurabh5/llama-nemotron-rlvr-code-stdio
Viewer
• Updated • 98.3k • 25
saurabh5/llama-nemotron-rlvr-stdio
Viewer
• Updated • 71k • 5
saurabh5/open-code-reasoning-rlvr-sft-stdio
Viewer
• Updated • 23.1k • 4
saurabh5/open-code-reasoning-rlvr-stdio
Viewer
• Updated • 25.4k • 39
saurabh5/llama-nemotron-rlvr
Viewer
• Updated • 29.1k • 13
saurabh5/the-algorithm-python
Viewer
• Updated • 608 • 11
saurabh5/rlvr_acecoder_filtered
Viewer
• Updated • 63k • 111