Fix GitHub links: buildwithbooks -> DavinciDreams
Browse files
README.md
CHANGED
|
@@ -40,7 +40,7 @@ model-index:
|
|
| 40 |
|
| 41 |
A 4.98M parameter decoder-only Monarch Mixer model trained on classical philosophy texts, implemented entirely in Julia using Lux.jl. To our knowledge, this is the **first Monarch Mixer implementation in Julia**.
|
| 42 |
|
| 43 |
-
Part of the [Julia SLM](https://github.com/
|
| 44 |
|
| 45 |
## Model Family
|
| 46 |
|
|
@@ -236,8 +236,8 @@ text = generate(model, ps, st, tok, "the nature of ";
|
|
| 236 |
## Provenance
|
| 237 |
|
| 238 |
- **Author**: LisaMegaWatts
|
| 239 |
-
- **Training code**: [
|
| 240 |
-
- **Data pipeline**: [
|
| 241 |
- **Training date**: February 2026
|
| 242 |
- **Architecture reference**: Monarch Mixer (Dao et al., 2023), adapted for Julia/Lux.jl
|
| 243 |
- **First Julia implementation** of Monarch Mixer sequence mixing
|
|
|
|
| 40 |
|
| 41 |
A 4.98M parameter decoder-only Monarch Mixer model trained on classical philosophy texts, implemented entirely in Julia using Lux.jl. To our knowledge, this is the **first Monarch Mixer implementation in Julia**.
|
| 42 |
|
| 43 |
+
Part of the [Julia SLM](https://github.com/DavinciDreams/julia-slm) family of models exploring alternative sequence mixing architectures.
|
| 44 |
|
| 45 |
## Model Family
|
| 46 |
|
|
|
|
| 236 |
## Provenance
|
| 237 |
|
| 238 |
- **Author**: LisaMegaWatts
|
| 239 |
+
- **Training code**: [DavinciDreams/julia-slm](https://github.com/DavinciDreams/julia-slm)
|
| 240 |
+
- **Data pipeline**: [DavinciDreams/text-pipeline](https://github.com/DavinciDreams/text-pipeline)
|
| 241 |
- **Training date**: February 2026
|
| 242 |
- **Architecture reference**: Monarch Mixer (Dao et al., 2023), adapted for Julia/Lux.jl
|
| 243 |
- **First Julia implementation** of Monarch Mixer sequence mixing
|