GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning Paper • 2504.00891 • Published Apr 1, 2025 • 14
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Paper • 2506.09942 • Published Jun 11, 2025 • 5
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models Paper • 2505.15801 • Published May 21, 2025 • 17
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning Paper • 2510.04081 • Published Oct 5, 2025 • 23
RustMap: Towards Project-Scale C-to-Rust Migration via Program Analysis and LLM Paper • 2503.17741 • Published Mar 22, 2025
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation Paper • 2504.15254 • Published Apr 21, 2025 • 5
EVOC2RUST: A Skeleton-guided Framework for Project-Level C-to-Rust Translation Paper • 2508.04295 • Published Aug 6, 2025 • 7