Spaces:

IP-GRM
/

README

Configuration error

ShadeCloak commited on Feb 13

Commit

d4c4733

verified ·

1 Parent(s): 08448cb

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ IP-GRM (Independent Principle Generative Reward Model) is a decoupled reward-mod
 | [IP-GRM](https://huggingface.co/IP-GRM/IP-GRM) | 16B generative reward model with decoupled principle-judgment pipeline |
 | [CreativeWriting-8B](https://huggingface.co/IP-GRM/CreativeWriting-8B) | 8B creative writing model trained via GRPO with IP-GRM rewards |
 | [IP-rewarding-8K](https://huggingface.co/datasets/IP-GRM/IP-rewarding-8K) | 8K decoupled reward SFT dataset (principle + judgment pairs) |
-| [Paper](https://arxiv.org/abs/2602.11111111) | arXiv preprint |
 | [Code](https://github.com/ShadeCloak/IP-GRM) | Training scripts and IP-GRM process functions |
 ## Key Idea

 | [IP-GRM](https://huggingface.co/IP-GRM/IP-GRM) | 16B generative reward model with decoupled principle-judgment pipeline |
 | [CreativeWriting-8B](https://huggingface.co/IP-GRM/CreativeWriting-8B) | 8B creative writing model trained via GRPO with IP-GRM rewards |
 | [IP-rewarding-8K](https://huggingface.co/datasets/IP-GRM/IP-rewarding-8K) | 8K decoupled reward SFT dataset (principle + judgment pairs) |
+| [Paper](https://arxiv.org/abs/) | arXiv preprint |
 | [Code](https://github.com/ShadeCloak/IP-GRM) | Training scripts and IP-GRM process functions |
 ## Key Idea