Spaces:

SupraLabs
/

README

Running

LH-Tech-AI commited on 5 days ago

Commit

459dc49

verified ·

1 Parent(s): 4d733ae

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -28,6 +28,7 @@ We are **not** making bad (or we try not to!) models and we try to fully open so
 - Supra Mini **v3** 0.5M: the third version of the Supra Mini series.
 - Supra Mini **v4** 2M: the fourth version of the Supra Mini series. Improved. More powerful. With context understanding.
 - Supra Mini **v5** 8M: the fifth version of the Supra Mini series. A huge token-eater monster compared to its siblings.
 - MicroSupra 1k: Trained on GTX 750 Ti 4GB, a scaling laws experiment.
 - StorySupra-10M: Trained on RTX 5060 Ti 16GB for 10 minutes, coherent.
 - DistillSupra-0.2M: Trained on GTX 750 Ti 4GB for 30 minutes, still incoherent, but the first step for distillation research.

 - Supra Mini **v3** 0.5M: the third version of the Supra Mini series.
 - Supra Mini **v4** 2M: the fourth version of the Supra Mini series. Improved. More powerful. With context understanding.
 - Supra Mini **v5** 8M: the fifth version of the Supra Mini series. A huge token-eater monster compared to its siblings.
+- Supra Mini **v6** 1M: the sixth version of the Supra Mini series. Again a smaller one. Beating v2, v3 and v4 of the Supra Mini series
 - MicroSupra 1k: Trained on GTX 750 Ti 4GB, a scaling laws experiment.
 - StorySupra-10M: Trained on RTX 5060 Ti 16GB for 10 minutes, coherent.
 - DistillSupra-0.2M: Trained on GTX 750 Ti 4GB for 30 minutes, still incoherent, but the first step for distillation research.