Update README.md
Browse files
README.md
CHANGED
|
@@ -14,17 +14,17 @@ WebGen-Agent combines state-of-the-art language models with specialized training
|
|
| 14 |
|
| 15 |
Links to the data and model parameters are as follows:
|
| 16 |
|
| 17 |
-
| Data | HF Link |
|
| 18 |
|----------|------|
|
| 19 |
-
| webgen-agent_train_sft | π€ [luzimu/webgen-agent_train_sft](https://huggingface.co/datasets/luzimu/webgen-agent_train_sft) |
|
| 20 |
-
| webgen-agent_train_step-grpo | π€ [luzimu/webgen-agent_train_step-grpo](https://huggingface.co/datasets/luzimu/webgen-agent_train_step-grpo) |
|
| 21 |
|
| 22 |
-
| Model | HF Link |
|
| 23 |
|----------|------|
|
| 24 |
-
| WebGenAgent-LM-7B-SFT | π€ [luzimu/WebGenAgent-LM-7B-SFT](https://huggingface.co/luzimu/WebGenAgent-LM-7B-SFT) |
|
| 25 |
-
| WebGenAgent-LM-7B-Step-GRPO | π€ [luzimu/WebGenAgent-LM-7B-Step-GRPO](https://huggingface.co/luzimu/WebGenAgent-LM-7B-Step-GRPO) |
|
| 26 |
-
| WebGenAgent-LM-8B-SFT | π€ [luzimu/WebGenAgent-LM-8B-SFT](https://huggingface.co/luzimu/WebGenAgent-LM-8B-SFT) |
|
| 27 |
-
| WebGenAgent-LM-8B-Step-GRPO | π€ [luzimu/WebGenAgent-LM-8B-Step-GRPO](https://huggingface.co/luzimu/WebGenAgent-LM-8B-Step-GRPO) |
|
| 28 |
|
| 29 |
## How WebGen-Agent Works
|
| 30 |
|
|
|
|
| 14 |
|
| 15 |
Links to the data and model parameters are as follows:
|
| 16 |
|
| 17 |
+
| **Data** | **HF Link** |
|
| 18 |
|----------|------|
|
| 19 |
+
| **webgen-agent_train_sft** | π€ [luzimu/webgen-agent_train_sft](https://huggingface.co/datasets/luzimu/webgen-agent_train_sft) |
|
| 20 |
+
| **webgen-agent_train_step-grpo** | π€ [luzimu/webgen-agent_train_step-grpo](https://huggingface.co/datasets/luzimu/webgen-agent_train_step-grpo) |
|
| 21 |
|
| 22 |
+
| **Model** | **HF Link** |
|
| 23 |
|----------|------|
|
| 24 |
+
| **WebGenAgent-LM-7B-SFT** | π€ [luzimu/WebGenAgent-LM-7B-SFT](https://huggingface.co/luzimu/WebGenAgent-LM-7B-SFT) |
|
| 25 |
+
| **WebGenAgent-LM-7B-Step-GRPO** | π€ [luzimu/WebGenAgent-LM-7B-Step-GRPO](https://huggingface.co/luzimu/WebGenAgent-LM-7B-Step-GRPO) |
|
| 26 |
+
| **WebGenAgent-LM-8B-SFT** | π€ [luzimu/WebGenAgent-LM-8B-SFT](https://huggingface.co/luzimu/WebGenAgent-LM-8B-SFT) |
|
| 27 |
+
| **WebGenAgent-LM-8B-Step-GRPO** | π€ [luzimu/WebGenAgent-LM-8B-Step-GRPO](https://huggingface.co/luzimu/WebGenAgent-LM-8B-Step-GRPO) |
|
| 28 |
|
| 29 |
## How WebGen-Agent Works
|
| 30 |
|