@juiceb0xc0de on Hugging Face: "Gemma-4-E2B SAE Atlas — Work in Progress JumpReLU Sparse Autoencoders trained…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update May 22

Post

240

Gemma-4-E2B SAE Atlas — Work in Progress

JumpReLU Sparse Autoencoders trained on every layer of Gemma-4-E2B-it using an adaptive Lagrangian controller. Training in progress. I'm publishing layers live as they come hot off the press for anyone interested in following along. I will be making further adjustments for finer resolution but the early data should be helpful I think? I'm just a bartender don't trust everything I say. 🤗 The Lagrangian math is pretty cool. It auto-steers the trainer taking the guess work out of hyperparameter adjustments.

Full paper and methodology when ever I get around to writing it up. There's a lot of work to be done. For now though, enjoy! 🤗

https://huggingface.co/juiceb0xc0de/gemma-4-e2b-saes

juiceb0xc0de

May 22

Update: I've completed the first 9 layers and will be taking a step back for a quick mo to adjust and update the auto trainer for finer resolution and other shit I have swimming around in my brain.

Naphula

May 23

•

edited May 23

Looks pretty interesting. Any way you could maybe combine some ideas from this post into your brain atlas? Maybe make it some kind of informative 'brain inspector' type tool for all models, with 3D support?

juiceb0xc0de

May 24

That is exactly what I'm planning on doing!

In this post