Commit
·
d36cc58
1
Parent(s):
a7b39ef
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,4 +1,9 @@
|
|
| 1 |
-
# MPT-7B LoRA Patch
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.
|
| 4 |
|
|
|
|
| 1 |
+
# MPT-7B LoRA Patch - multi GPU
|
| 2 |
+
|
| 3 |
+
|
| 4 |
+
Multi-GPU bugfix for MPT-30B
|
| 5 |
+
|
| 6 |
+
|
| 7 |
|
| 8 |
This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.
|
| 9 |
|