Update model card
Browse files
README.md
CHANGED
|
@@ -90,21 +90,6 @@ The fine-tuned model generates **36% fewer tokens** while maintaining higher acc
|
|
| 90 |
| [**Toucan-1.5M**](https://huggingface.co/datasets/Agent-Ark/Toucan-1.5M) | 40,000 | **Negative** | Irrelevant queries (Server Shuffle method) |
|
| 91 |
| **Synthetic Negatives** | 6,000 | **Negative** | Domain mismatch, partial fulfillment, permission errors |
|
| 92 |
|
| 93 |
-
### Dataset Statistics
|
| 94 |
-
|
| 95 |
-
```
|
| 96 |
-
Total Training Samples: ~117,000
|
| 97 |
-
βββ Positive Samples: 71,248 (61%)
|
| 98 |
-
β βββ Single tool call: ~55K
|
| 99 |
-
β βββ Multi tool call: ~14K
|
| 100 |
-
βββ Negative Samples: 46,000 (39%)
|
| 101 |
-
βββ Toucan Irrelevant: 40,000 (86.9%)
|
| 102 |
-
βββ Domain Mismatch: 3,000 (6.5%)
|
| 103 |
-
βββ Action Mismatch: 1,500 (3.3%)
|
| 104 |
-
βββ Partial Fulfillment: 1,000 (2.2%)
|
| 105 |
-
βββ Permission/Auth: 300 (0.7%)
|
| 106 |
-
βββ Format Mismatch: 200 (0.4%)
|
| 107 |
-
```
|
| 108 |
|
| 109 |
### Negative Sample Types
|
| 110 |
|
|
|
|
| 90 |
| [**Toucan-1.5M**](https://huggingface.co/datasets/Agent-Ark/Toucan-1.5M) | 40,000 | **Negative** | Irrelevant queries (Server Shuffle method) |
|
| 91 |
| **Synthetic Negatives** | 6,000 | **Negative** | Domain mismatch, partial fulfillment, permission errors |
|
| 92 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 93 |
|
| 94 |
### Negative Sample Types
|
| 95 |
|