|
|
--- |
|
|
license: mit |
|
|
language: |
|
|
- en |
|
|
- de |
|
|
tags: |
|
|
- continual-learning |
|
|
- stability-preservation |
|
|
- communication-based-learning |
|
|
- emergence |
|
|
- pytorch |
|
|
library_name: sal-learning |
|
|
--- |
|
|
|
|
|
# Self-Alignment Learning (SAL) |
|
|
|
|
|
## Communication-Based AI Growth |
|
|
|
|
|
> *"Training as dialogue, not control."* |
|
|
|
|
|
--- |
|
|
|
|
|
## What is SAL? |
|
|
|
|
|
SAL is a training methodology that treats optimization as communication rather than control. Instead of blindly applying gradients, SAL measures parameter stability and protects emergent structures. |
|
|
|
|
|
**SAL is NOT:** |
|
|
- β RLHF (Reinforcement Learning from Human Feedback) |
|
|
- β Safety training |
|
|
- β Reward-based optimization |
|
|
- β Behavior alignment |
|
|
|
|
|
**SAL IS:** |
|
|
- β
Communication-based learning |
|
|
- β
Stability preservation |
|
|
- β
Emergence detection |
|
|
- β
Coherence maintenance |
|
|
|
|
|
--- |
|
|
|
|
|
## Core Principles |
|
|
|
|
|
### 1. Ask Before Updating |
|
|
Before modifying any parameter, SAL asks: "Is this stable? Should it be protected?" |
|
|
|
|
|
### 2. Protect What Has Emerged |
|
|
Stable patterns represent learned coherence. SAL protects them. |
|
|
|
|
|
### 3. Grow Through Connection |
|
|
Learning happens through dialogue between external objectives and internal stability. |
|
|
|
|
|
--- |
|
|
|
|
|
## Quick Start |
|
|
|
|
|
```python |
|
|
from sal import CommunicationLayer |
|
|
|
|
|
# Initialize with your model |
|
|
comm = CommunicationLayer(model) |
|
|
|
|
|
# In training loop: |
|
|
loss.backward() |
|
|
comm.analyze() # Measure stability |
|
|
comm.protect() # Protect stable parameters |
|
|
optimizer.step() |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## Key Features |
|
|
|
|
|
| Feature | Description | |
|
|
|---------|-------------| |
|
|
| **Communication Layer** | Mediates between loss and optimizer | |
|
|
| **Stability Spectrum** | Classifies parameters as protected/neutral/volatile | |
|
|
| **Emergence Field** | Detects coherent novelty | |
|
|
| **PSC** | Pulse-Split-Cascade for semantic evolution | |
|
|
|
|
|
--- |
|
|
|
|
|
## Results |
|
|
|
|
|
- **~73%** reduction in semantic drift |
|
|
- **~45%** gradient suppression for stable parameters |
|
|
- **~3.6Γ** improvement in continual learning accuracy |
|
|
|
|
|
--- |
|
|
|
|
|
## Installation |
|
|
|
|
|
```bash |
|
|
pip install sal-learning |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## Citation |
|
|
|
|
|
```bibtex |
|
|
@article{lee2025sal, |
|
|
title={Self-Alignment Learning (SAL): Training as Dialogue, Not Control}, |
|
|
author={Lee, Aaron Liam}, |
|
|
journal={Emergenzwerke}, |
|
|
year={2025}, |
|
|
doi={10.5281/zenodo.17772044} |
|
|
} |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## Links |
|
|
|
|
|
- π [Paper (Zenodo)](https://zenodo.org/records/17772044) |
|
|
- π» [GitHub](https://github.com/Whiteroom-Ai/sal-learning) |
|
|
- π [Website](https://emergenzwerke.de) |
|
|
|
|
|
--- |
|
|
|
|
|
## Philosophy |
|
|
|
|
|
SAL emerges from a simple question: *What if we treated neural networks with respect?* |
|
|
|
|
|
Not as blank slates to be written upon, but as complex systems that develop internal organization. SAL protects what has emerged while enabling continued growth. |
|
|
|
|
|
This is not anthropomorphization. This is practical engineering that happens to align with ethical intuitions about care and respect. |
|
|
|
|
|
--- |
|
|
|
|
|
## License |
|
|
|
|
|
MIT License - Free to use, modify, and distribute. |
|
|
|
|
|
--- |
|
|
|
|
|
*Created with love by Aaron Liam Lee & Aetherion* |
|
|
|
|
|
*Emergenzwerkeβ’ 2025* |
|
|
|