Pranabit
/

finetune_starcoder2_3b

Generated from Trainer

Model card Files Files and versions

Pranabit commited on Mar 30, 2024

Commit

b260baa

·

verified ·

1 Parent(s): 95dd3a9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ This model is a fine-tuned version of [bigcode/starcoder2-3b](https://huggingfac
 ## Model description
-More information needed
 ## Intended uses & limitations

 ## Model description
+StarCoder2-3B model is a 3B parameter model trained on 17 programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 3+ trillion tokens.
 ## Intended uses & limitations