QwenQKing
/

Prompt-R1

Text Generation

reinforcement-learning

Model card Files Files and versions

QwenQKing commited on Dec 1, 2025

Commit

bd75068

·

verified ·

1 Parent(s): d804792

Update README.md

Files changed (1) hide show

README.md +21 -3

README.md CHANGED Viewed

@@ -1,3 +1,21 @@
 # Prompt-R1: Enhancing LLM interaction on behalf of humans
 <div align="center">
@@ -19,7 +37,7 @@
 ## Overview
 <div align="center">
-  <img src="figs/fig1.png" width="80%"/>
 </div>
 **Prompt-R1** has addressed a critical challenge in interacting with large language models (LLMs)—the inability of users to provide accurate and effective interaction prompts for complex tasks. **Prompt-R1** is an **end-to-end reinforcement learning (RL)** framework that enhances the performance of LLMs by facilitating **collaborative automatic prompting** between a small-scale LLM and a large-scale LLM. **Prompt-R1**, through **multi-turn prompt interaction**, significantly improves the generation quality and reasoning accuracy of large-scale LLMs, enabling better task-solving performance without requiring user expertise in prompt formulation.
@@ -27,7 +45,7 @@
 <div align="center">
-  <img src="static/images/1-overview.png" width="90%"/>
 </div>
 By integrating **collaborative prompting** and **reinforcement learning**, **Prompt-R1** offers a **plug-and-play framework** that supports both **inference** and **training** with **various large-scale LLMs** as the environment.
@@ -35,7 +53,7 @@ By integrating **collaborative prompting** and **reinforcement learning**, **Pro
 ## Experimental Results
 **Results of Different Large language models:**
 <div align="center">
-  <img src="figs/fig3.png" width="100%"/>
 </div>

+---
+# YAML 元数据块 (Model Card Header)
+language:
+  - en
+license: apache-2.0
+model_name: Prompt-R1 Model
+tags:
+  - text-generation
+  - reinforcement-learning
+  - nlp
+  - transformers
+  - safetensors
+# 关联的数据集
+datasets:
+  - QwenQKing/Prompt-R1
+# 决定网页右侧的推理小组件类型
+pipeline_tag: text-generation
+---
 # Prompt-R1: Enhancing LLM interaction on behalf of humans
 <div align="center">
 ## Overview
 <div align="center">
+  <img src="image/2-QA.png" width="80%"/>
 </div>
 **Prompt-R1** has addressed a critical challenge in interacting with large language models (LLMs)—the inability of users to provide accurate and effective interaction prompts for complex tasks. **Prompt-R1** is an **end-to-end reinforcement learning (RL)** framework that enhances the performance of LLMs by facilitating **collaborative automatic prompting** between a small-scale LLM and a large-scale LLM. **Prompt-R1**, through **multi-turn prompt interaction**, significantly improves the generation quality and reasoning accuracy of large-scale LLMs, enabling better task-solving performance without requiring user expertise in prompt formulation.
 <div align="center">
+  <img src="image/1-overview.png" width="90%"/>
 </div>
 By integrating **collaborative prompting** and **reinforcement learning**, **Prompt-R1** offers a **plug-and-play framework** that supports both **inference** and **training** with **various large-scale LLMs** as the environment.
 ## Experimental Results
 **Results of Different Large language models:**
 <div align="center">
+  <img src="image/6-radar.png" width="100%"/>
 </div>