Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,39 @@
|
|
| 1 |
-
---
|
| 2 |
-
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
datasets:
|
| 3 |
+
- glaiveai/reasoning-v1-20m
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
base_model:
|
| 7 |
+
- facebook/galactica-1.3b
|
| 8 |
+
tags:
|
| 9 |
+
- reasoning
|
| 10 |
+
- text-generation-inference
|
| 11 |
+
- medical
|
| 12 |
+
- science
|
| 13 |
+
- chemistry
|
| 14 |
+
- biology
|
| 15 |
+
---
|
| 16 |
+
## What is Galactic Reasoning?
|
| 17 |
+
|
| 18 |
+
The Galactic Reasoning adapters are a collection of LoRA adapters, trained for the various sizes of the Facebook/Galactica models. These LoRAs enable the OPT architecture based Galactica models to use reasoning, inspired by more modern models like DeepSeek and OpenAI's O3.
|
| 19 |
+
To achieve this, the [glaiveai/reasoning-v1-20m](https://huggingface.co/datasets/glaiveai/reasoning-v1-20m) dataset was used for both training and evalulation of points.
|
| 20 |
+
|
| 21 |
+
| Size | Parameters | Galactic Reasoning Adapter |
|
| 22 |
+
|:-----------:|:-----------:|:--------------------------:|
|
| 23 |
+
| `mini` | 125 M | Coming Soon™ |
|
| 24 |
+
| `base` | 1.3 B | In Progress |
|
| 25 |
+
| `standard` | 6.7 B | Coming Soon™ |
|
| 26 |
+
| `large` | 30 B | Coming Soon™ |
|
| 27 |
+
| `huge` | 120 B | Short of a GPU grant, unlikely to happen. |
|
| 28 |
+
|
| 29 |
+
## How were these adapters developed?
|
| 30 |
+
These adapters make use of the RS-LoRA finetuning method, trained locally on a Tesla P40. A rank of 64 and an alpha of 32 were utilized.
|
| 31 |
+
In addition, the following modules were targeted: ""q_proj", "v_proj", "k_proj", "out_proj", "fc1", "fc2"", including FC1 and FC2 due to the OPT architecture.
|
| 32 |
+
|
| 33 |
+
## License
|
| 34 |
+
Mozilla Public License 2.0 Modified with CC. See license.md.
|
| 35 |
+
|
| 36 |
+
## Credits
|
| 37 |
+
* Credit to Meta/Facebook for the Galactica OPT Based models.
|
| 38 |
+
* Credit to GlaiveAi for the reasoning-v1-20m dataset.
|
| 39 |
+
* Finally, credit to my highly overworked Tesla P40 who ran for days straight to produce this.
|