lr note
Browse files
README.md
CHANGED
|
@@ -9,6 +9,13 @@ Special thanks to https://huggingface.co/fahadh4ilyas
|
|
| 9 |
convert_v2.py
|
| 10 |
```
|
| 11 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
Known Issues:
|
| 13 |
|
| 14 |
1. [QUANT GPTQ] PENDING: You can help test quant and/or follow progress at https://github.com/AutoGPTQ/AutoGPTQ/pull/625
|
|
|
|
| 9 |
convert_v2.py
|
| 10 |
```
|
| 11 |
|
| 12 |
+
Training Notes:
|
| 13 |
+
```
|
| 14 |
+
# dbrx trains like a much smaller model (~7B)
|
| 15 |
+
# start with this as reference point and move up or down based on eval/train loss
|
| 16 |
+
learning_rate = 1.5e-5
|
| 17 |
+
```
|
| 18 |
+
|
| 19 |
Known Issues:
|
| 20 |
|
| 21 |
1. [QUANT GPTQ] PENDING: You can help test quant and/or follow progress at https://github.com/AutoGPTQ/AutoGPTQ/pull/625
|