TitleOS commited on
Commit
cd2c24c
·
verified ·
1 Parent(s): 951ec3c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -3
README.md CHANGED
@@ -1,3 +1,39 @@
1
- ---
2
- license: mpl-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - glaiveai/reasoning-v1-20m
4
+ language:
5
+ - en
6
+ base_model:
7
+ - facebook/galactica-1.3b
8
+ tags:
9
+ - reasoning
10
+ - text-generation-inference
11
+ - medical
12
+ - science
13
+ - chemistry
14
+ - biology
15
+ ---
16
+ ## What is Galactic Reasoning?
17
+
18
+ The Galactic Reasoning adapters are a collection of LoRA adapters, trained for the various sizes of the Facebook/Galactica models. These LoRAs enable the OPT architecture based Galactica models to use reasoning, inspired by more modern models like DeepSeek and OpenAI's O3.
19
+ To achieve this, the [glaiveai/reasoning-v1-20m](https://huggingface.co/datasets/glaiveai/reasoning-v1-20m) dataset was used for both training and evalulation of points.
20
+
21
+ | Size | Parameters | Galactic Reasoning Adapter |
22
+ |:-----------:|:-----------:|:--------------------------:|
23
+ | `mini` | 125 M | Coming Soon™ |
24
+ | `base` | 1.3 B | In Progress |
25
+ | `standard` | 6.7 B | Coming Soon™ |
26
+ | `large` | 30 B | Coming Soon™ |
27
+ | `huge` | 120 B | Short of a GPU grant, unlikely to happen. |
28
+
29
+ ## How were these adapters developed?
30
+ These adapters make use of the RS-LoRA finetuning method, trained locally on a Tesla P40. A rank of 64 and an alpha of 32 were utilized.
31
+ In addition, the following modules were targeted: ""q_proj", "v_proj", "k_proj", "out_proj", "fc1", "fc2"", including FC1 and FC2 due to the OPT architecture.
32
+
33
+ ## License
34
+ Mozilla Public License 2.0 Modified with CC. See license.md.
35
+
36
+ ## Credits
37
+ * Credit to Meta/Facebook for the Galactica OPT Based models.
38
+ * Credit to GlaiveAi for the reasoning-v1-20m dataset.
39
+ * Finally, credit to my highly overworked Tesla P40 who ran for days straight to produce this.