Update README.md
Browse files
README.md
CHANGED
|
@@ -63,11 +63,11 @@ For inference code, prompt templates, and setup instructions, please refer to ou
|
|
| 63 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 64 |
The model underwent a two-stage training process:
|
| 65 |
1. **Stage 1 (General Adaptation):** Fine-tuned on the complete CowCorpus dataset.
|
| 66 |
-
2. **Stage 2 (User Personalization):** Further fine-tuned on the **User Cluster 0 subset** of CowCorpus, consists of 101 trajectories and 793 steps.
|
| 67 |
|
| 68 |
**User Cluster 0 Characteristics:**
|
| 69 |
* **Data Source:** A subset of the collaborative trajectories specific to User Group 0.
|
| 70 |
-
* **Behavioral Profile:** Collaborative user, interact with rare, modest interventions, usually later in the task, with a strong tendency to hand control back to the agent.
|
| 71 |
|
| 72 |
[More Information Needed]
|
| 73 |
|
|
|
|
| 63 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 64 |
The model underwent a two-stage training process:
|
| 65 |
1. **Stage 1 (General Adaptation):** Fine-tuned on the complete CowCorpus dataset.
|
| 66 |
+
2. **Stage 2 (User Personalization):** Further fine-tuned on the **User Cluster 0 subset** of CowCorpus, consists of 101 trajectories and 793 steps. (P4-6, P9, P11-12, P14, P16-17, P19-20)
|
| 67 |
|
| 68 |
**User Cluster 0 Characteristics:**
|
| 69 |
* **Data Source:** A subset of the collaborative trajectories specific to User Group 0.
|
| 70 |
+
* **Behavioral Profile:** Collaborative user, interact with rare, modest interventions, usually later in the task, with a strong tendency to hand control back to the agent.
|
| 71 |
|
| 72 |
[More Information Needed]
|
| 73 |
|