div18 commited on
Commit ·
ad95d2f
1
Parent(s): cbb5245
final commit
Browse files
Blog.md
CHANGED
|
@@ -35,10 +35,11 @@ In the best case, this workflow is partially automated. Horizontal Pod Autoscale
|
|
| 35 |
|
| 36 |
More importantly, **no existing system provides a mathematical guarantee**. There is no production autoscaler today that can prove it will keep queues bounded. There is no incident response tool that can prove its actions are cost-optimal. The industry runs on heuristics — rules of thumb encoded as YAML, battle-tested through painful outages, but ultimately ad-hoc.
|
| 37 |
|
| 38 |
-
AntiAtropos is the first infrastructure control environment to replace heuristics with **
|
| 39 |
|
| 40 |
-
### The Bottom Line:
|
| 41 |
-
|
|
|
|
| 42 |
|
| 43 |

|
| 44 |
*Figure 1: AntiAtropos (Blue) maintains stability at half the resource cost of a production-grade heuristic scaler (Orange).*
|
|
|
|
| 35 |
|
| 36 |
More importantly, **no existing system provides a mathematical guarantee**. There is no production autoscaler today that can prove it will keep queues bounded. There is no incident response tool that can prove its actions are cost-optimal. The industry runs on heuristics — rules of thumb encoded as YAML, battle-tested through painful outages, but ultimately ad-hoc.
|
| 37 |
|
| 38 |
+
AntiAtropos is the first infrastructure control environment to replace SRE heuristics with **RL-based physics control**. It models the cluster as a fluid queue network, defines equilibrium through a Lyapunov energy function, and trains a Qwen3.5-4B model via **QLoRA REINFORCE** to minimize a Drift-Plus-Penalty objective—producing a policy that keeps queues bounded at minimum cost.
|
| 39 |
|
| 40 |
+
### The Bottom Line: 50% Cost Savings at Perfect SLA
|
| 41 |
+
|
| 42 |
+
This is not theoretical. In head-to-head benchmarks against a production-grade heuristic autoscaler, AntiAtropos maintained **perfect SLA compliance** while spending **half the infrastructure cost**. The agent learned to scale predictively rather than reactively, eliminating the "panic-scaling" that traditional threshold-based systems rely on. By understanding the physics of the cluster, it anticipates load before it arrives.
|
| 43 |
|
| 44 |

|
| 45 |
*Figure 1: AntiAtropos (Blue) maintains stability at half the resource cost of a production-grade heuristic scaler (Orange).*
|
README.md
CHANGED
|
@@ -31,13 +31,13 @@ pinned: false
|
|
| 31 |
|
| 32 |
---
|
| 33 |
|
| 34 |
-
> **Hackathon Submission:** We are building for **"Theme #3: World Modelling for Professional Tasks."**
|
| 35 |
-
> AntiAtropos
|
| 36 |
|
| 37 |
## Demo Video
|
| 38 |
[](https://youtu.be/46SX0HocpSs)
|
| 39 |
|
| 40 |
-
AntiAtropos is a
|
| 41 |
|
| 42 |
---
|
| 43 |
|
|
|
|
| 31 |
|
| 32 |
---
|
| 33 |
|
| 34 |
+
> **Hackathon Submission:** We are building for **"Theme #3: World Modelling for Professional Tasks."**
|
| 35 |
+
> AntiAtropos governs clusters the way physics governs a pendulum—by minimizing Lyapunov energy. Perfect SLA at **50% lower cost**.
|
| 36 |
|
| 37 |
## Demo Video
|
| 38 |
[](https://youtu.be/46SX0HocpSs)
|
| 39 |
|
| 40 |
+
AntiAtropos is a **Reinforcement Learning environment** where an AI agent learns to stabilize a 5-node microservice cluster by treating it as a physical system. Using **QLoRA REINFORCE** on a Qwen3.5-4B model, the agent is trained to minimize Lyapunov graph energy under a Drift-Plus-Penalty objective that balances stability against infrastructure cost. The trained policy scales predictively, reroutes around failures, and holds the line during traffic surges.
|
| 41 |
|
| 42 |
---
|
| 43 |
|