Update README.md
Browse files
README.md
CHANGED
|
@@ -101,4 +101,11 @@ This is `BEE-spoke-data/smol_llama-220M-GQA` fine-tuned for code generation on:
|
|
| 101 |
|
| 102 |
- filtered version of stack-smol-XL
|
| 103 |
- deduped version of 'algebraic stack' from proof-pile-2
|
| 104 |
-
- cleaned and deduped pypi (last dataset)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 101 |
|
| 102 |
- filtered version of stack-smol-XL
|
| 103 |
- deduped version of 'algebraic stack' from proof-pile-2
|
| 104 |
+
- cleaned and deduped pypi (last dataset)
|
| 105 |
+
|
| 106 |
+
This model (and the base model) were both trained using ctx length 2048.
|
| 107 |
+
|
| 108 |
+
Example script for inference testing: [here](https://gist.github.com/pszemraj/c7738f664a64b935a558974d23a7aa8c)
|
| 109 |
+
|
| 110 |
+
|
| 111 |
+
---
|