Update README.md
Browse files
README.md
CHANGED
|
@@ -59,7 +59,6 @@ output = gate * ASPP(x) + (1-gate) * Attention(x)
|
|
| 59 |
```python
|
| 60 |
class ASPPOperator:
|
| 61 |
"""
|
| 62 |
-
Simplified ASPP without neighbor gathering to reduce overfitting
|
| 63 |
|
| 64 |
Forward pass:
|
| 65 |
1. Optional dimensionality reduction: h_t = down_proj(hidden_states)
|
|
@@ -246,7 +245,7 @@ Exponential convergence to fixed points with rate c=0.76 under Lipschitz continu
|
|
| 246 |
### Turing Completeness
|
| 247 |
Proven via cyclic tag system simulation - ASPP can compute any Turing-computable function given sufficient depth.
|
| 248 |
|
| 249 |
-
**Implementation Note**: This implementation simplifies theoretical ASPP to point-wise evolution
|
| 250 |
|
| 251 |
## Files in Checkpoint
|
| 252 |
|
|
|
|
| 59 |
```python
|
| 60 |
class ASPPOperator:
|
| 61 |
"""
|
|
|
|
| 62 |
|
| 63 |
Forward pass:
|
| 64 |
1. Optional dimensionality reduction: h_t = down_proj(hidden_states)
|
|
|
|
| 245 |
### Turing Completeness
|
| 246 |
Proven via cyclic tag system simulation - ASPP can compute any Turing-computable function given sufficient depth.
|
| 247 |
|
| 248 |
+
**Implementation Note**: This implementation simplifies theoretical ASPP to point-wise evolution to reduce overfitting while maintaining iterative refinement benefits.
|
| 249 |
|
| 250 |
## Files in Checkpoint
|
| 251 |
|