Context Window Configuration
Stack 2.9 uses full 128K context window (131072 tokens) to provide complete repository awareness.
Settings
- max_model_len: 131072
- max_seq_length: 131072
- block_size: 16 or 32 (adjust for memory/performance tradeoff)
Memory Requirements
| Context | A100 80GB (4-bit) | H100 80GB (4-bit) |
|---|---|---|
| 32K | ~20GB | ~18GB |
| 64K | ~35GB | ~32GB |
| 128K | ~60GB | ~55GB |
Throughput decreases slightly at longer contexts (~30% slower at 128K vs 32K) but provides full repository context.