Update README.md
Browse files
README.md
CHANGED
|
@@ -113,7 +113,7 @@ For a runnable end-to-end example, see [`examples/test_qwen3.py`](examples/test_
|
|
| 113 |
|
| 114 |
## Released Models
|
| 115 |
|
| 116 |
-
| Base Model | TRIM-KV Checkpoints | Training Datasets |
|
| 117 |
|------------------------------|-----------------------------------------------|--------------------------|-------------------------|--------------|
|
| 118 |
| Qwen3-1.7B | [TRIM-KV-Qwen3-1.7B-Math](https://huggingface.co/ngocbh/TrimKV-Qwen3-1.7B-Math) | OpenR1-Math-220k | 16K | 512 |
|
| 119 |
| Qwen3-4B | [TRIM-KV-Qwen3-4B-Math](https://huggingface.co/ngocbh/TrimKV-Qwen3-4B-Math) | OpenR1-Math-220k | 16K | 512 |
|
|
|
|
| 113 |
|
| 114 |
## Released Models
|
| 115 |
|
| 116 |
+
| Base Model | TRIM-KV Checkpoints | Training Datasets | Training Context Len | Training $M$ |
|
| 117 |
|------------------------------|-----------------------------------------------|--------------------------|-------------------------|--------------|
|
| 118 |
| Qwen3-1.7B | [TRIM-KV-Qwen3-1.7B-Math](https://huggingface.co/ngocbh/TrimKV-Qwen3-1.7B-Math) | OpenR1-Math-220k | 16K | 512 |
|
| 119 |
| Qwen3-4B | [TRIM-KV-Qwen3-4B-Math](https://huggingface.co/ngocbh/TrimKV-Qwen3-4B-Math) | OpenR1-Math-220k | 16K | 512 |
|