Spaces:

mindchain
/

react-blog

Running

App Files Files Community

mindchain commited on Dec 30, 2025

Commit

e1eac63

1 Parent(s): a15826a

Add Gemma Scope 2 + Neuronpedia Discovery/Steering/Freezing Skills

Browse files

Files changed (1) hide show

index.html +35 -4

index.html CHANGED Viewed

@@ -158,13 +158,44 @@ Plus im Gateway: GitHub, Sentry, Z-Image, Web-Search, Browser Automation
 <strong>Die Kombination:</strong> Ralph liefert die Schleife, Beads das Gedächtnis, HF Skills das Lernen.
-Plus: Gemma Scope 2 + Neuronpedia für mechanistic interpretability - sieh WAS der Agent lernt!
-Links:
 <a href="https://github.com/anthropics/claude-code/tree/main/plugins/ralph-wiggum" class="link">Ralph Wiggum GitHub</a>
 <a href="https://github.com/steveyegge/beads" class="link">Beads GitHub</a>
 <a href="https://github.com/huggingface/skills" class="link">HF Skills GitHub</a>
-<a href="https://huggingface.co/blog/hf-skills-training" class="link">HF Skills Blog</a></div>
         </div>
         <div class="post">

 <strong>Die Kombination:</strong> Ralph liefert die Schleife, Beads das Gedächtnis, HF Skills das Lernen.
+<strong>5. Gemma Scope 2 + Neuronpedia (Interpretability + Steering)</strong>
+Das Agent-Training wird transparent und steuerbar.
+<span style="color: #667eea;">Discovery Skills</span> - WAS lernt der Agent?
+• SAE Features finden die das Verhalten bestimmen
+• Circuits identifizieren (Kausal-Ketten im Netzwerk)
+• Neuronpedia: 4TB+ activations, explanations, metadata
+• <a href="https://www.neuronpedia.org/gemma-scope-2" class="link">neuronpedia.org/gemma-scope-2</a>
+<span style="color: #667eea;">Steering Skills</span> - Verhalten beeinflussen
+• Feature-Stärke erhöhen/verringern (↑/↓)
+• API: POST /api/steer mit strength_multiplier
+• "Golden Gate Claude" aber für jeden Feature
+• <a href="https://docs.neuronpedia.org/steering" class="link">Neuronpedia Steering Docs</a>
+<span style="color: #667eea;">Freezing Skills</span> - Gelerntes fixieren
+• Wichtige Circuits identifizieren und speichern
+• Feature-Vektoren exportieren und wiederverwenden
+• Agent-Verhalten konsistent halten
+• <a href="https://github.com/hijohnnylin/neuronpedia-python" class="link">neuronpedia-python GitHub</a>
+<strong>Der erweiterte Loop:</strong>
+1. Ralph startet → Agent führt Task aus
+2. Beads tracked → Graph speichert Fortschritt
+3. Gemma Scope 2 → Activations werden analysiert
+4. Neuronpedia → Discovery: Wichtige Features finden
+5. Steering → Agent-Verhalten aktiv korrigieren
+6. HF Skills → Gelerntes in Model trainieren
+7. Freezing → Erfolgreiche Patterns fixieren
+8. Loop wiederholt → Verbesserter Agent
+<strong>Links:</strong>
 <a href="https://github.com/anthropics/claude-code/tree/main/plugins/ralph-wiggum" class="link">Ralph Wiggum GitHub</a>
 <a href="https://github.com/steveyegge/beads" class="link">Beads GitHub</a>
 <a href="https://github.com/huggingface/skills" class="link">HF Skills GitHub</a>
+<a href="https://huggingface.co/blog/hf-skills-training" class="link">HF Skills Blog</a>
+<a href="https://www.neuronpedia.org/api-doc" class="link">Neuronpedia API</a>
+<a href="https://deepmind.google/blog/gemma-scope-2" class="link">Gemma Scope 2 DeepMind</a></div>
         </div>
         <div class="post">