CannaeAI commited on
Commit
0965cd4
·
verified ·
1 Parent(s): 9adc08e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -12
README.md CHANGED
@@ -1,21 +1,24 @@
1
  ---
2
- base_model: unsloth/llama-3.2-1b-instruct-bnb-4bit
 
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
- - unsloth
 
 
 
7
  - llama
 
8
  license: apache-2.0
9
  language:
10
  - en
 
 
11
  ---
12
-
13
- # Uploaded finetuned model
14
-
15
- - **Developed by:** CannaeAI
16
- - **License:** apache-2.0
17
- - **Finetuned from model :** unsloth/llama-3.2-1b-instruct-bnb-4bit
18
-
19
- This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
20
-
21
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
2
+ base_model:
3
+ - meta-llama/Llama-3.2-1B-Instruct
4
  tags:
5
  - text-generation-inference
6
  - transformers
7
+ - reasoning
8
+ - math
9
+ - thinking
10
+ - conversational
11
  - llama
12
+ - meta
13
  license: apache-2.0
14
  language:
15
  - en
16
+ datasets:
17
+ - unsloth/OpenMathReasoning-mini
18
  ---
19
+ # ReasoningLlama-Math-1B-IT
20
+ ## Model Description
21
+ This is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the [unsloth/OpenMathReasoning-mini](https://huggingface.co/datasets/unsloth/OpenMathReasoning-mini)which is a small version of the [nvidia/OpenMathReasoning](https://huggingface.co/datasets/nvidia/OpenMathReasoning) dataset which was used to win the [AIMO](https://www.kaggle.com/competitions/ai-mathematical-olympiad-progress-prize-2/leaderboard) (AI Mathematical Olympiad) challenge!
22
+ - **recommended settings for inference:** min_p = 0.1 and temperature = 1.5 , Read this [Tweet](https://x.com/menhguin/status/1826132708508213629) to understand why.
23
+ - **License :** apache-2.0
24
+ - **Finetuned from model :** meta-llama/Llama-3.2-1B-Instruct