Update README.md
Browse files
README.md
CHANGED
|
@@ -15,8 +15,10 @@ Paper abstract:
|
|
| 15 |
- **Repository:** https://github.com/abao1999/panda
|
| 16 |
|
| 17 |
<!-- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). -->
|
|
|
|
| 18 |
|
| 19 |
NOTE: we are currently in the process of scaling up our model and training, so stay tuned!
|
|
|
|
| 20 |
|
| 21 |
## Citation
|
| 22 |
|
|
|
|
| 15 |
- **Repository:** https://github.com/abao1999/panda
|
| 16 |
|
| 17 |
<!-- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). -->
|
| 18 |
+
This checkpoint was trained for (only) 100k iterations, with per-device batch size 1024, across 4 AMD MI100X GPUs.
|
| 19 |
|
| 20 |
NOTE: we are currently in the process of scaling up our model and training, so stay tuned!
|
| 21 |
+
Update: We have released a bigger model: [panda-72M](https://huggingface.co/GilpinLab/panda-72M)
|
| 22 |
|
| 23 |
## Citation
|
| 24 |
|