StarCoderBase->StarCoder, more details?

#52

by Bilibili - opened Jun 12, 2023

Jun 12, 2023

StarCoder is kind of like "LLaMa" for the Code LLM! Thanks for the a brilliant job!

Sicne there is a Paper about pretraining, can we have more details on the finetuning side?

For example, tha Paper says: "StarCoder is the fine-tuned version of StarCoderBase, trained on another 35B Python tokens (roughly 2 epochs).", I have such questions:

Does 'another' means they are not from the Stack? Is it open?
Finetuing code and settings, can we find it somewhere?

Glad to have this lovely community!

loubnabnl

BigCode org Jun 15, 2023

It's the same Python dataset that we pre-trained StarCoder on, we just did two more epochs . You can find the parameters for the training in this slurm here.

Bilibili

Jun 19, 2023

Thanks for response, love u~

Bilibili changed discussion status to closed Jun 19, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment