RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models Paper • 2602.17053 • Published 22 days ago • 1