open-r1
/

OlympicCoder-7B

Text Generation

text-generation-inference

Model card Files Files and versions

edbeeching HF Staff commited on Mar 11, 2025

Commit

f928d4f

·

verified ·

1 Parent(s): 98dc221

Update README.md

Files changed (1) hide show

README.md +22 -3

README.md CHANGED Viewed

@@ -46,13 +46,32 @@ pipe = pipeline("text-generation", model="open-r1/NormolLM-coder-7b-v02.12", tor
 # We use the tokenizer's chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
 messages = [
-    {"role": "user", "content": "Write a python program to calulate the 10th fibonaci number"},
 ]
 prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
 outputs = pipe(prompt, max_new_tokens=8000, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
 print(outputs[0]["generated_text"])
 #<|im_start|>user
-#Write a python program to calulate the 10th fibonaci number<|im_end|>
 #<|im_start|>assistant
 #<think>Okay, I need to write a Python program that calculates the 10th Fibonacci number. Hmm, the Fibonacci sequence starts with 0 and 1. Each subsequent number is the sum of the two preceding ones. So the sequence goes: 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, and so on. ...
-```

 # We use the tokenizer's chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
 messages = [
+    {"role": "user", "content": "Write a python program to calculate the 10th Fibonacci number"},
 ]
 prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
 outputs = pipe(prompt, max_new_tokens=8000, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
 print(outputs[0]["generated_text"])
 #<|im_start|>user
+#Write a python program to calculate the 10th fibonacci number<|im_end|>
 #<|im_start|>assistant
 #<think>Okay, I need to write a Python program that calculates the 10th Fibonacci number. Hmm, the Fibonacci sequence starts with 0 and 1. Each subsequent number is the sum of the two preceding ones. So the sequence goes: 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, and so on. ...
+```
+## Training procedure
+### Training hyper-parameters
+The following hyperparameters were used during training:
+learning_rate: 4.0e-5
+train_batch_size: 2
+seed: 42
+packing: false
+distributed_type: deepspeed-zero-3
+num_devices: 8
+gradient_accumulation_steps: 8
+total_train_batch_size: 16
+optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+lr_scheduler_type: cosine_with_min_lr
+min_lr_rate: 0.1
+lr_scheduler_warmup_ratio: 0.03
+num_epochs: 10.0