Spaces:
Configuration error
Configuration error
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,10 +1,36 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
-
|
| 4 |
-
|
| 5 |
-
|
| 6 |
-
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
| 1 |
+
# CoNDeNse
|
| 2 |
+
|
| 3 |
+
> **Compress the knowledge. Keep the capability.**
|
| 4 |
+
|
| 5 |
+
CoNDeNse is a research org built around one idea: small models don't have to be dumb. We take compact, efficient model architectures and train them on the reasoning traces and outputs of models many times their size — distilling capability downward without bloating parameter counts upward.
|
| 6 |
+
|
| 7 |
+
The name says it all: **Con**dense. Take what's big. Make it small. Lose as little as possible.
|
| 8 |
+
|
| 9 |
---
|
| 10 |
+
|
| 11 |
+
## Philosophy
|
| 12 |
+
|
| 13 |
+
- **No fluff.** We don't chase benchmarks with tricks. We train honestly and report honestly.
|
| 14 |
+
- **Smol is serious.** A 0.6B model that reasons is more useful than a 70B model you can't run.
|
| 15 |
+
- **Quality data > more data.** Every dataset we use is curated, filtered, and purposefully scoped.
|
| 16 |
+
- **Reproducibility first.** If you can't replicate it, it didn't happen.
|
| 17 |
+
|
| 18 |
+
---
|
| 19 |
+
|
| 20 |
+
## Support CoNDeNse
|
| 21 |
+
|
| 22 |
+
CoNDeNse is a solo research effort. There's no lab, no grant, no GPU cluster behind this — just genuine curiosity and a conviction that small models deserve better training.
|
| 23 |
+
|
| 24 |
+
The best way to support the work right now is simple: **download and use the models.** Every download signals that this direction matters. If a model works well for you, star the repo, share it, or drop a comment on the model card.
|
| 25 |
+
|
| 26 |
+
If you want to go further — contributions, dataset suggestions, or collaboration ideas — open an issue or reach out directly.
|
| 27 |
+
|
| 28 |
+
---
|
| 29 |
+
|
| 30 |
+
## License
|
| 31 |
+
|
| 32 |
+
All released models inherit the license of their respective base models. Dataset usage follows the terms of the original dataset authors. Training code is MIT.
|
| 33 |
+
|
| 34 |
---
|
| 35 |
|
| 36 |
+
*CoNDeNse — because the best model is the one that actually runs.*
|