Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -13,12 +13,11 @@ base_model:
|
|
| 13 |
|
| 14 |
Our User Simulator with Implicit Profiles (USP) replicates human-like conversational behavior in interactions with large language models (LLMs). By emulating diverse user dynamics based on predefined profiles, it reconstructs realistic user-LLM dialogues, leveraging the LLaMA-3-base-8B architecture with **Conditional Supervised Fine-Tuning (SFT)** and **Reinforcement Learning with Cycle Consistency (RLCC)**. For a detailed methodology and insights, refer to [Our Paper](#).
|
| 15 |
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
1. **Maximum Context Length**: Supports up to **4,096 tokens**. Exceeding this may degrade performance; keep inputs within this limit for best results.
|
| 19 |
-
|
| 20 |
-
1. **Language Limitation**: Optimized for English. Non-English performance may vary due to limited training data.
|
| 21 |
-
|
| 22 |
|
| 23 |
## Sample Implementation
|
| 24 |
|
|
|
|
| 13 |
|
| 14 |
Our User Simulator with Implicit Profiles (USP) replicates human-like conversational behavior in interactions with large language models (LLMs). By emulating diverse user dynamics based on predefined profiles, it reconstructs realistic user-LLM dialogues, leveraging the LLaMA-3-base-8B architecture with **Conditional Supervised Fine-Tuning (SFT)** and **Reinforcement Learning with Cycle Consistency (RLCC)**. For a detailed methodology and insights, refer to [Our Paper](#).
|
| 15 |
|
| 16 |
+
> *Note*: Our model is subject to the following constraints:
|
| 17 |
+
>
|
| 18 |
+
> 1. **Maximum Context Length**: Supports up to **4,096 tokens**. Exceeding this may degrade performance; keep inputs within this limit for best results.
|
| 19 |
+
>
|
| 20 |
+
> 1. **Language Limitation**: Optimized for English. Non-English performance may vary due to limited training data.
|
|
|
|
| 21 |
|
| 22 |
## Sample Implementation
|
| 23 |
|