Instructions to use capabl-machines/gridops-strategy-selector-v7 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use capabl-machines/gridops-strategy-selector-v7 with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-1.5B-Instruct") model = PeftModel.from_pretrained(base_model, "capabl-machines/gridops-strategy-selector-v7") - Notebooks
- Google Colab
- Kaggle
Clarify Capabl Machines harness positioning
Browse files
README.md
CHANGED
|
@@ -31,6 +31,10 @@ microgrids: a small learned strategy selector, a deterministic optimizer, and a
|
|
| 31 |
simulated OpenEnv world where every decision is scored by cost, reliability,
|
| 32 |
and diesel use.
|
| 33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
GridOps Strategy Selector v7 is the learned part of that system. It does not
|
| 35 |
pretend to be the whole grid engineer. It reads a GridOps/OpenEnv observation
|
| 36 |
and emits strict strategy JSON. A causal optimizer then converts that strategy
|
|
@@ -48,6 +52,29 @@ microgrid state
|
|
| 48 |
That split keeps the language model focused on contextual judgment while
|
| 49 |
leaving constrained numerical dispatch to an optimizer.
|
| 50 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
## Why This Matters
|
| 52 |
|
| 53 |
Distributed solar is scaling quickly across Indian apartments, societies,
|
|
@@ -75,6 +102,10 @@ environment for truth
|
|
| 75 |
metrics for accountability
|
| 76 |
```
|
| 77 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 78 |
## What We Built
|
| 79 |
|
| 80 |
The environment is a 72-hour community microgrid simulation with three regimes:
|
|
@@ -127,6 +158,12 @@ central engineering result. The strategy abstraction and optimizer harness do
|
|
| 127 |
most of the heavy lifting. The adapter is the packaged, reproducible, audited
|
| 128 |
selector from the training pipeline, but the architecture is the real unlock.
|
| 129 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 130 |
| Release highlight | Value |
|
| 131 |
|---|---:|
|
| 132 |
| v7 deterministic controller average score | 0.7907 |
|
|
|
|
| 31 |
simulated OpenEnv world where every decision is scored by cost, reliability,
|
| 32 |
and diesel use.
|
| 33 |
|
| 34 |
+
GridOps is also a case study in the Capabl Machines thesis: climate-heavy
|
| 35 |
+
problems need more than a checkpoint. They need a harness where models,
|
| 36 |
+
optimizers, simulators, validators, rewards, and evaluation loops work together.
|
| 37 |
+
|
| 38 |
GridOps Strategy Selector v7 is the learned part of that system. It does not
|
| 39 |
pretend to be the whole grid engineer. It reads a GridOps/OpenEnv observation
|
| 40 |
and emits strict strategy JSON. A causal optimizer then converts that strategy
|
|
|
|
| 52 |
That split keeps the language model focused on contextual judgment while
|
| 53 |
leaving constrained numerical dispatch to an optimizer.
|
| 54 |
|
| 55 |
+
## What Capabl Machines Provides
|
| 56 |
+
|
| 57 |
+
This release should not be read as "a LoRA beat every baseline." It should be
|
| 58 |
+
read as evidence for a stronger company pattern:
|
| 59 |
+
|
| 60 |
+
```text
|
| 61 |
+
Capabl Machines builds climate AI operating harnesses.
|
| 62 |
+
|
| 63 |
+
The harness defines:
|
| 64 |
+
- the environment where decisions are tested;
|
| 65 |
+
- the schema for valid actions;
|
| 66 |
+
- the tools that handle physics and constraints;
|
| 67 |
+
- the critics and rewards that judge outcomes;
|
| 68 |
+
- the datasets that teach useful behavior;
|
| 69 |
+
- the model layer that selects intent or strategy;
|
| 70 |
+
- the evals that decide whether the system is actually better.
|
| 71 |
+
```
|
| 72 |
+
|
| 73 |
+
For some domains, the trained model will be the main breakthrough. For others,
|
| 74 |
+
as GridOps shows, the harness and model interface may be the breakthrough. The
|
| 75 |
+
customer still gets the thing that matters: an accountable AI operating system
|
| 76 |
+
for a real climate workflow.
|
| 77 |
+
|
| 78 |
## Why This Matters
|
| 79 |
|
| 80 |
Distributed solar is scaling quickly across Indian apartments, societies,
|
|
|
|
| 102 |
metrics for accountability
|
| 103 |
```
|
| 104 |
|
| 105 |
+
That is the product direction: not model magic, but tested climate-AI systems
|
| 106 |
+
that can be adapted to energy, water, agriculture, logistics, robotics, and
|
| 107 |
+
resilient infrastructure.
|
| 108 |
+
|
| 109 |
## What We Built
|
| 110 |
|
| 111 |
The environment is a 72-hour community microgrid simulation with three regimes:
|
|
|
|
| 158 |
most of the heavy lifting. The adapter is the packaged, reproducible, audited
|
| 159 |
selector from the training pipeline, but the architecture is the real unlock.
|
| 160 |
|
| 161 |
+
This is exactly why Capabl Machines focuses on the harness and model together.
|
| 162 |
+
If a base model is already strong once the interface is correct, we should use
|
| 163 |
+
that. If a domain needs post-training, we should fine-tune. The job is not to
|
| 164 |
+
force model training into every problem; the job is to deliver the most reliable
|
| 165 |
+
AI operating loop for the climate system in front of us.
|
| 166 |
+
|
| 167 |
| Release highlight | Value |
|
| 168 |
|---|---:|
|
| 169 |
| v7 deterministic controller average score | 0.7907 |
|