Improve model card: Add paper and code links
Browse filesThis PR enhances the model card by adding prominent links to the paper ([PIPer: On-Device Environment Setup via Online Reinforcement Learning](https://huggingface.co/papers/2509.25455)) and its GitHub repository ([https://github.com/JetBrains-Research/PIPer](https://github.com/JetBrains-Research/PIPer)).
These additions improve the discoverability and accessibility of key resources for the model. The existing metadata is correct and remains unchanged. No sample usage section has been added as no direct inference code was found in the provided GitHub README.
README.md
CHANGED
|
@@ -1,19 +1,22 @@
|
|
| 1 |
---
|
| 2 |
-
|
|
|
|
| 3 |
datasets:
|
| 4 |
- JetBrains-Research/PIPer-envbench-zeroshot-rl
|
| 5 |
- JetBrains-Research/PIPer-SFT-2500-sharegpt
|
| 6 |
-
|
| 7 |
-
- JetBrains-Research/Qwen3-8B-am
|
| 8 |
-
pipeline_tag: text-generation
|
| 9 |
license: mit
|
|
|
|
| 10 |
---
|
|
|
|
| 11 |
<img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
|
| 12 |
<h1>
|
| 13 |
PIPer: On-Device Environment Setup via Online Reinforcement Learning
|
| 14 |
|
| 15 |
</h1>
|
| 16 |
|
|
|
|
|
|
|
| 17 |
<div align="center">
|
| 18 |
|
| 19 |
[](https://jb.gg/PIPer)
|
|
@@ -97,4 +100,4 @@ uv run piper/hparams_entrypoint.py +experiment=llm-reward --info config
|
|
| 97 |
|
| 98 |
## 📄 License
|
| 99 |
|
| 100 |
-
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model:
|
| 3 |
+
- JetBrains-Research/Qwen3-8B-am
|
| 4 |
datasets:
|
| 5 |
- JetBrains-Research/PIPer-envbench-zeroshot-rl
|
| 6 |
- JetBrains-Research/PIPer-SFT-2500-sharegpt
|
| 7 |
+
library_name: transformers
|
|
|
|
|
|
|
| 8 |
license: mit
|
| 9 |
+
pipeline_tag: text-generation
|
| 10 |
---
|
| 11 |
+
|
| 12 |
<img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
|
| 13 |
<h1>
|
| 14 |
PIPer: On-Device Environment Setup via Online Reinforcement Learning
|
| 15 |
|
| 16 |
</h1>
|
| 17 |
|
| 18 |
+
[Paper](https://huggingface.co/papers/2509.25455) | [Code](https://github.com/JetBrains-Research/PIPer)
|
| 19 |
+
|
| 20 |
<div align="center">
|
| 21 |
|
| 22 |
[](https://jb.gg/PIPer)
|
|
|
|
| 100 |
|
| 101 |
## 📄 License
|
| 102 |
|
| 103 |
+
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
|