lliutianc commited on
Commit
14ae2f0
·
verified ·
1 Parent(s): bfb7a71

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -5
README.md CHANGED
@@ -1,6 +1,6 @@
1
  # OpenRubrics/RubricRM-4B-Judge
2
 
3
- Finetuned checkpoint for rubric-based reward modeling / judging.
4
 
5
  ## Usage
6
  ```python
@@ -10,7 +10,17 @@ tok = AutoTokenizer.from_pretrained(model_id, use_fast=True)
10
  model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto")
11
  ```
12
 
13
- ## Notes
14
- - Format: Transformers-compatible (config/tokenizer/weights).
15
- - Base: Qwen3 4B (Rubric-Aware Judge).
16
- - Files tracked with Git LFS.
 
 
 
 
 
 
 
 
 
 
 
1
  # OpenRubrics/RubricRM-4B-Judge
2
 
3
+ This is a 4B RubricRM-4B-Judge model, finetuned from [Qwen3/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B).
4
 
5
  ## Usage
6
  ```python
 
10
  model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto")
11
  ```
12
 
13
+
14
+ If you find our work helpful, please consider citing our paper:
15
+
16
+ ```
17
+ @misc{liu2025openrubricsscalablesyntheticrubric,
18
+ title={OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment},
19
+ author={Tianci Liu and Ran Xu and Tony Yu and Ilgee Hong and Carl Yang and Tuo Zhao and Haoyu Wang},
20
+ year={2025},
21
+ eprint={2510.07743},
22
+ archivePrefix={arXiv},
23
+ primaryClass={cs.CL},
24
+ url={https://arxiv.org/abs/2510.07743},
25
+ }
26
+ ```