Fix SFT source paths in README
Browse files
README.md
CHANGED
|
@@ -4,12 +4,12 @@ Uploaded MatchTIR merged/HF checkpoints for BFCL follow-up evaluation.
|
|
| 4 |
|
| 5 |
| Folder | Source directory |
|
| 6 |
|---|---|
|
| 7 |
-
| `1.7B-SFT` | `/storage/
|
| 8 |
| `1.7B-GRPO` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-answer-em-GRPO-step24-hf` |
|
| 9 |
| `1.7B-FTRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-ftrl-GRPO-step24-hf` |
|
| 10 |
| `1.7B-ToolRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-toolrl-GRPO-step24-hf` |
|
| 11 |
| `1.7B-CHORD` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-chord-phi-1p7B-resume12-step24-hf` |
|
| 12 |
-
| `4B-SFT` | `/storage/
|
| 13 |
| `4B-GRPO` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-answer-em-GRPO-step24-hf` |
|
| 14 |
| `4B-FTRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-ftrl-GRPO-step24-hf` |
|
| 15 |
| `4B-ToolRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-toolrl-GRPO-step24-hf` |
|
|
|
|
| 4 |
|
| 5 |
| Folder | Source directory |
|
| 6 |
|---|---|
|
| 7 |
+
| `1.7B-SFT` | `/storage/home/hcoda1/3/zdu89/scratch/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-extra-SFT/global_step_24` |
|
| 8 |
| `1.7B-GRPO` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-answer-em-GRPO-step24-hf` |
|
| 9 |
| `1.7B-FTRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-ftrl-GRPO-step24-hf` |
|
| 10 |
| `1.7B-ToolRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-1.7B-toolrl-GRPO-step24-hf` |
|
| 11 |
| `1.7B-CHORD` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-chord-phi-1p7B-resume12-step24-hf` |
|
| 12 |
+
| `4B-SFT` | `/storage/home/hcoda1/3/zdu89/scratch/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-extra-SFT/global_step_24` |
|
| 13 |
| `4B-GRPO` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-answer-em-GRPO-step24-hf` |
|
| 14 |
| `4B-FTRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-ftrl-GRPO-step24-hf` |
|
| 15 |
| `4B-ToolRL` | `/storage/scratch1/3/zdu89/MatchTIR/checkpoints/MatchTIR/MatchTIR-4B-toolrl-GRPO-step24-hf` |
|