bunny127 commited on
Commit
5992f5b
·
verified ·
1 Parent(s): 32baa51

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -3
README.md CHANGED
@@ -1,3 +1,22 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Official Repo of Reagent.
2
+ Paper:
3
+
4
+ ## Abstract:
5
+ Agentic Reinforcement Learning (Agentic RL) has achieved notable success in enabling agents to perform complex reasoning and tool use.
6
+ However, most methods still relies on sparse outcome-based reward for training.
7
+ Such feedback fails to differentiate intermediate reasoning quality, leading to suboptimal training results.
8
+ In this paper, we introduce \textbf{Agent Reasoning Reward Model (Agent-RRM)}, a multi-faceted reward model that produces structured feedback for agentic trajectories, including (1) an explicit reasoning trace , (2) a focused critique that provides refinement guidance by highlighting reasoning flaws, and (3) an overall score that evaluates process performance.
9
+ Leveraging these signals, we systematically investigate three integration strategies: \textbf{Reagent-C} (text-augmented refinement), \textbf{Reagent-R} (reward-augmented guidance), and \textbf{Reagent-U} (unified feedback integration).
10
+ Extensive evaluations across 12 diverse benchmarks demonstrate that Reagent-U yields substantial performance leaps, achieving 43.7\% on GAIA and 46.2\% on WebWalkerQA, validating the effectiveness of our reasoning reward model and training schemes.
11
+
12
+ ## GitHub Repository
13
+ The official codebase, including training and evaluation scripts for Reagent, can be found on the project's GitHub repository: https://github.com/kxfan2002/Reagent
14
+
15
+ ## Citation
16
+
17
+ ```bash
18
+
19
+ ```
20
+ ---
21
+ license: apache-2.0
22
+ ---