77ethers commited on
Commit
af1c449
·
verified ·
1 Parent(s): 388a348

Clarify Capabl Machines harness positioning

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md CHANGED
@@ -31,6 +31,10 @@ microgrids: a small learned strategy selector, a deterministic optimizer, and a
31
  simulated OpenEnv world where every decision is scored by cost, reliability,
32
  and diesel use.
33
 
 
 
 
 
34
  GridOps Strategy Selector v7 is the learned part of that system. It does not
35
  pretend to be the whole grid engineer. It reads a GridOps/OpenEnv observation
36
  and emits strict strategy JSON. A causal optimizer then converts that strategy
@@ -48,6 +52,29 @@ microgrid state
48
  That split keeps the language model focused on contextual judgment while
49
  leaving constrained numerical dispatch to an optimizer.
50
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
51
  ## Why This Matters
52
 
53
  Distributed solar is scaling quickly across Indian apartments, societies,
@@ -75,6 +102,10 @@ environment for truth
75
  metrics for accountability
76
  ```
77
 
 
 
 
 
78
  ## What We Built
79
 
80
  The environment is a 72-hour community microgrid simulation with three regimes:
@@ -127,6 +158,12 @@ central engineering result. The strategy abstraction and optimizer harness do
127
  most of the heavy lifting. The adapter is the packaged, reproducible, audited
128
  selector from the training pipeline, but the architecture is the real unlock.
129
 
 
 
 
 
 
 
130
  | Release highlight | Value |
131
  |---|---:|
132
  | v7 deterministic controller average score | 0.7907 |
 
31
  simulated OpenEnv world where every decision is scored by cost, reliability,
32
  and diesel use.
33
 
34
+ GridOps is also a case study in the Capabl Machines thesis: climate-heavy
35
+ problems need more than a checkpoint. They need a harness where models,
36
+ optimizers, simulators, validators, rewards, and evaluation loops work together.
37
+
38
  GridOps Strategy Selector v7 is the learned part of that system. It does not
39
  pretend to be the whole grid engineer. It reads a GridOps/OpenEnv observation
40
  and emits strict strategy JSON. A causal optimizer then converts that strategy
 
52
  That split keeps the language model focused on contextual judgment while
53
  leaving constrained numerical dispatch to an optimizer.
54
 
55
+ ## What Capabl Machines Provides
56
+
57
+ This release should not be read as "a LoRA beat every baseline." It should be
58
+ read as evidence for a stronger company pattern:
59
+
60
+ ```text
61
+ Capabl Machines builds climate AI operating harnesses.
62
+
63
+ The harness defines:
64
+ - the environment where decisions are tested;
65
+ - the schema for valid actions;
66
+ - the tools that handle physics and constraints;
67
+ - the critics and rewards that judge outcomes;
68
+ - the datasets that teach useful behavior;
69
+ - the model layer that selects intent or strategy;
70
+ - the evals that decide whether the system is actually better.
71
+ ```
72
+
73
+ For some domains, the trained model will be the main breakthrough. For others,
74
+ as GridOps shows, the harness and model interface may be the breakthrough. The
75
+ customer still gets the thing that matters: an accountable AI operating system
76
+ for a real climate workflow.
77
+
78
  ## Why This Matters
79
 
80
  Distributed solar is scaling quickly across Indian apartments, societies,
 
102
  metrics for accountability
103
  ```
104
 
105
+ That is the product direction: not model magic, but tested climate-AI systems
106
+ that can be adapted to energy, water, agriculture, logistics, robotics, and
107
+ resilient infrastructure.
108
+
109
  ## What We Built
110
 
111
  The environment is a 72-hour community microgrid simulation with three regimes:
 
158
  most of the heavy lifting. The adapter is the packaged, reproducible, audited
159
  selector from the training pipeline, but the architecture is the real unlock.
160
 
161
+ This is exactly why Capabl Machines focuses on the harness and model together.
162
+ If a base model is already strong once the interface is correct, we should use
163
+ that. If a domain needs post-training, we should fine-tune. The job is not to
164
+ force model training into every problem; the job is to deliver the most reliable
165
+ AI operating loop for the climate system in front of us.
166
+
167
  | Release highlight | Value |
168
  |---|---:|
169
  | v7 deterministic controller average score | 0.7907 |