Update README.md
Browse files
README.md
CHANGED
|
@@ -129,3 +129,6 @@ This model is based on the LLaMA (Large Language Model Meta AI) architecture and
|
|
| 129 |
|
| 130 |
The training was performed on Tesla T4 gpu with 4-bit quantization and gradient checkpointing to optimize memory usage.
|
| 131 |
|
|
|
|
|
|
|
|
|
|
|
|
| 129 |
|
| 130 |
The training was performed on Tesla T4 gpu with 4-bit quantization and gradient checkpointing to optimize memory usage.
|
| 131 |
|
| 132 |
+
|
| 133 |
+
### Feel free to provide any missing details or correct the assumptions made, and I'll update the model card accordingly.
|
| 134 |
+
|