OpenRubrics
/

RubricRM-4B-Judge

Model card Files Files and versions

lliutianc commited on Oct 11, 2025

Commit

14ae2f0

·

verified ·

1 Parent(s): bfb7a71

Update README.md

Files changed (1) hide show

README.md +15 -5

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # OpenRubrics/RubricRM-4B-Judge
-Finetuned checkpoint for rubric-based reward modeling / judging.
 ## Usage
 ```python
@@ -10,7 +10,17 @@ tok = AutoTokenizer.from_pretrained(model_id, use_fast=True)
 model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto")
 ```
-## Notes
-- Format: Transformers-compatible (config/tokenizer/weights).
-- Base: Qwen3 4B (Rubric-Aware Judge).
-- Files tracked with Git LFS.

 # OpenRubrics/RubricRM-4B-Judge
+This is a 4B RubricRM-4B-Judge model, finetuned from [Qwen3/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B).
 ## Usage
 ```python
 model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto")
 ```
+If you find our work helpful, please consider citing our paper:
+```
+@misc{liu2025openrubricsscalablesyntheticrubric,
+      title={OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment},
+      author={Tianci Liu and Ran Xu and Tony Yu and Ilgee Hong and Carl Yang and Tuo Zhao and Haoyu Wang},
+      year={2025},
+      eprint={2510.07743},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2510.07743},
+}
+```