xingyaoww commited on
Commit
af6377d
·
verified ·
1 Parent(s): c98f6c0

Add README with model documentation

Browse files
Files changed (1) hide show
  1. README.md +69 -0
README.md ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # OpenHands Critic 4B v1.0
2
+
3
+ A 4B parameter critic model for evaluating AI agent trajectories, trained to predict task success from behavioral rubrics.
4
+
5
+ ## Model Details
6
+
7
+ - **Base Model**: Qwen3-4B
8
+ - **Training**: Full parameter fine-tuning with BCE loss
9
+ - **Context Length**: Trained on 64K, supports up to 256K tokens
10
+ - **Task**: Multi-label classification (26 labels: 25 rubric features + 1 success prediction)
11
+
12
+ ## Paper
13
+
14
+ This model is described in the paper: **"Rubric-Supervised Critics for Sparse Agent Feedback"**
15
+
16
+ ### Key Results (Mixed-Outcome Subset)
17
+ - **+15.9 points** Best@8 improvement over random selection (73.8% vs 57.9%)
18
+ - **0.83 MRR** - correct trajectory typically ranked first
19
+ - **83% compute reduction** via adaptive rollout (1.36 attempts vs 8)
20
+
21
+ ## Usage
22
+
23
+ This model is designed for use with vLLM's classification API:
24
+
25
+ ```python
26
+ from openai import OpenAI
27
+
28
+ client = OpenAI(
29
+ base_url="YOUR_VLLM_SERVER_URL/v1",
30
+ api_key="YOUR_API_KEY"
31
+ )
32
+
33
+ # Format your trajectory as a conversation
34
+ messages = [
35
+ {"role": "system", "content": "You are evaluating an AI agent's task attempt..."},
36
+ {"role": "user", "content": "Task: ..."},
37
+ {"role": "assistant", "content": "Agent actions..."}
38
+ ]
39
+
40
+ # Get classification scores
41
+ response = client.classifications.create(
42
+ model="openhands-critic-4b-v1.0",
43
+ messages=messages
44
+ )
45
+
46
+ # The model outputs probabilities for 26 labels:
47
+ # - Labels 0-24: Rubric features (behavioral indicators)
48
+ # - Label 25: Success prediction (primary output for ranking)
49
+ ```
50
+
51
+ ## Training Data
52
+
53
+ Trained on 154K segments from:
54
+ - Production agent conversations (150K segments)
55
+ - SWE-Gym benchmark trajectories (4K segments)
56
+
57
+ ## License
58
+
59
+ Please refer to the Qwen3 license for base model terms.
60
+
61
+ ## Citation
62
+
63
+ ```bibtex
64
+ @article{openhands2025critic,
65
+ title={Rubric-Supervised Critics for Sparse Agent Feedback},
66
+ author={OpenHands Team},
67
+ year={2025}
68
+ }
69
+ ```