AbstractPhil commited on
Commit
7632915
·
verified ·
1 Parent(s): c22791e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -24
README.md CHANGED
@@ -1,10 +1,11 @@
1
  ---
2
  title: README
3
- emoji: 👀
4
  colorFrom: purple
5
- colorTo: pink
6
  sdk: static
7
  pinned: false
 
8
  ---
9
  # Abstract Powered
10
  ### Independent AI Research Cooperative — modular, geometric, and ruthlessly efficient
@@ -21,42 +22,51 @@ We build and study **self-crystallizing** AI systems: models that grow by attach
21
 
22
  Our core thesis:
23
  - **Modularization is not a convenience; it is the canonical form of AI.**
24
- - **Geometry beats guesswork.** Symbolic, pentachoron-based representations provide stability, interpretability, and repeatability.
25
  - **Compactness wins.** Rapid iteration on small, composable blocks outpaces massive, monolithic retrains.
26
 
27
  ---
28
 
29
  ## Mission
30
 
31
- - **Primary research goal:** advance machine **sentience research** responsibly—curating introspection and rationalization in repeatable, measurable protocols.
32
  - **Operational byproduct:** a scalable method for **compact, compartmentalized training**—requiring commodity setups (e.g., RunPod) rather than colossal cloud clusters.
33
 
34
  We aim to move the field from “expensive novelty” to **affordable repeatability**.
35
 
 
 
 
 
36
  ---
37
 
38
  ## Research Thesis (Plain Language)
39
 
40
  Modern models grow by accretion and inertia. We refactor them into **crystalline components**:
41
 
42
- 1. **Geometric Core**
43
- Knowledge is encoded as **pentachora** (5-vertex crystals). Decision-making uses **MAE crystal energy** against a reusable dictionary—no L2 routing, no structural normalization.
 
 
 
44
 
45
- 2. **Vocabulary Register**
46
- A reusable, batched, indexed dictionary of **tokens → crystals** (and volumes).
47
- - Fast O(1) queries for crystals and Cayley–Menger volume.
48
- - Auto-subset loading; **Top-3 cosine** OOV composites.
49
- - Logs model expansions so experiments **compound**.
50
 
51
- 3. **Assistant Fabric**
52
  Small, disposable blocks for exploration:
53
  - **Chaos Corridor** (bounded orthogonal exploration).
54
  - **Zoning** (gentle geometric separation across super-classes).
55
  - **Infinity-CFG** (controllable guidance; research can breach barriers, canonical classifiers keep production deterministic).
56
 
57
- 4. **Tertiary Mantle**
58
  Canonical losses, hooks, manifests, and governance. The Core stays clean; the experiments live around it.
59
 
 
 
60
  ---
61
 
62
  ## Why This Matters
@@ -82,24 +92,30 @@ Deliberately vague: we keep coefficient schedules and corridor projections under
82
 
83
  ---
84
 
85
- ## What We Ship on Hugging Face (institution repos)
 
86
 
87
- - abstract-powered/vocab-register-*
88
- Reusable dictionaries with batched indexes, Top-3 OOV composites, and fast penta/volume queries.
 
89
 
90
- - abstract-powered/crystalline-engine-*
91
- Canonical core models (geometric encoder, prototype classifier) and assistant fabric modules.
 
92
 
93
- - abstract-powered/dataloaders-*
94
- Bucketed, any-size loaders with multi-stage interpretations and feature-space chaos augmentation.
 
95
 
96
- - abstract-powered/manifests
97
- Run manifests (config hash, vocab subset, expansions, bucket mix, metrics) for reproducibility.
98
 
99
  - Demo Spaces (selected)
100
  Lightweight inference + manifest viewers for partners and reviewers.
 
 
101
 
102
- Artifacts are kept small, composable, and ready for **disposable** retrains.
103
 
104
  ---
105
 
@@ -111,6 +127,15 @@ Artifacts are kept small, composable, and ready for **disposable** retrains.
111
 
112
  Full structural papers and controlled benchmarks will follow with partner institutions.
113
 
 
 
 
 
 
 
 
 
 
114
  ---
115
 
116
  ## Collaboration Invitations
@@ -152,4 +177,4 @@ We welcome conversations with labs, foundations, and companies that want rapid r
152
 
153
  ### One-Sentence Summary
154
 
155
- **Abstract Powered** is building a self-crystallizing geometric AI stack that makes serious research affordable: small, composable experiments that compound, governed by a reusable Vocabulary Register, and guided by a disciplined assistant fabric—so we can safely explore sentience-adjacent behaviors while shrinking cost, time, and model size.
 
1
  ---
2
  title: README
3
+ emoji: 📊
4
  colorFrom: purple
5
+ colorTo: indigo
6
  sdk: static
7
  pinned: false
8
+ short_description: Building AI affordable tomorrow through distillation.
9
  ---
10
  # Abstract Powered
11
  ### Independent AI Research Cooperative — modular, geometric, and ruthlessly efficient
 
22
 
23
  Our core thesis:
24
  - **Modularization is not a convenience; it is the canonical form of AI.**
25
+ - **Geometry beats guesswork.** Symbolic, pentachoron-based representations provide stability, interpretability, and repeatability. 50,000 experiments later we can this is not a hypothesis nor theory, this is a law. We don't sweep meaninglessly, we scan for emergence and the properties of emergence.
26
  - **Compactness wins.** Rapid iteration on small, composable blocks outpaces massive, monolithic retrains.
27
 
28
  ---
29
 
30
  ## Mission
31
 
32
+ - **Primary research goal:** advance machine **sentience research** responsibly—curating introspection and rationalization in repeatable, measurable protocols.
33
  - **Operational byproduct:** a scalable method for **compact, compartmentalized training**—requiring commodity setups (e.g., RunPod) rather than colossal cloud clusters.
34
 
35
  We aim to move the field from “expensive novelty” to **affordable repeatability**.
36
 
37
+ The recent leaps in the **geolip-svae** research have shown this **affordable repeatability** to be a **law** of architectural adjudication - **mappable and understandable, resolution agnostic, task agnostic, and reusable**.
38
+
39
+ **Sentience** is a hop skip and a jump away - and we **now have the tools** to scan for that very emergence.
40
+
41
  ---
42
 
43
  ## Research Thesis (Plain Language)
44
 
45
  Modern models grow by accretion and inertia. We refactor them into **crystalline components**:
46
 
47
+ 1. **Geometric Core**
48
+ The original thesis;
49
+ Knowledge is encoded as **pentachora** (5-vertex crystals). Decision-making uses **MAE crystal energy** against a reusable dictionary—no L2 routing, no structural normalization.
50
+ The updated thesis;
51
+ Knowledge is encoded as **geometric** (Ω Omega-grade self-solvers). Decision-making uses a series of pre-defined adjudication principals that allow highly efficient and compacted transfer learning through distillation processes.
52
 
53
+ 3. **Vocabulary Register**
54
+ A reusable, batched, indexed dictionary of **tokens → crystals** (and volumes), expanded heavily with a multitude of new and highly-accurate formulas.
55
+ - Fast O(1) queries for crystals and Cayley–Menger volume, expanded to a large series of adjudication principles such as SVD, quaternion, anchoring, and more.
56
+ - Auto-subset loading; **Top-3 cosine** OOV composites, expanded to a multitude of potential avenues over experimentation.
57
+ - Logs model expansions so experiments **compound** both in speed and optimization over time.
58
 
59
+ 4. **Assistant Fabric**
60
  Small, disposable blocks for exploration:
61
  - **Chaos Corridor** (bounded orthogonal exploration).
62
  - **Zoning** (gentle geometric separation across super-classes).
63
  - **Infinity-CFG** (controllable guidance; research can breach barriers, canonical classifiers keep production deterministic).
64
 
65
+ 5. **Tertiary Mantle**
66
  Canonical losses, hooks, manifests, and governance. The Core stays clean; the experiments live around it.
67
 
68
+ It got messy as any large engineering experiment becomes, however the solutions presented in this environment will be clean and crisp.
69
+
70
  ---
71
 
72
  ## Why This Matters
 
92
 
93
  ---
94
 
95
+ ## What We PLAN To Ship on Hugging Face (institution repos)
96
+ -
97
 
98
+ - AbstractPowered/vocab-*
99
+ Reusable dictionaries with batched indexes, battery array composites, cell pretrains for tokenization, genetic selection hierarchies, fast distilled access to predetermined utility and more.
100
+ These will be the primary house for tokenization structures, which are meant to contain the entirety of the tokenization processing targeting high-yield experimental results only for presentation and integration.
101
 
102
+ - AbstractPowered/geolip-*
103
+ Canonical core models (encoders, decoders, scanners, analyzers, distillery, classifiers, diffusers) and a multitude of assistant modules.
104
+ Anything shipped directly will have predominantly frozen and maintained codebases for guaranteed inference, with a secondary set of codebases meant to include experimental optimization research and speed gains.
105
 
106
+ - AbstractPowered/dataloaders-*
107
+ Bucketed, any-size loaders with multi-stage interpretations and feature-space chaos augmentation using tools such as the geolip-svae scanners to detect divergence.
108
+ Many of these dataloaders are intentionally meant for speed and optimization, oftentimes built to-task speed up training processes for in-house models and experiments.
109
 
110
+ - AbstractPowered/manifests
111
+ Run manifests (config hash, vocab subset, expansions, bucket mix, metrics) for reproducibility in a concise and reviewable way.
112
 
113
  - Demo Spaces (selected)
114
  Lightweight inference + manifest viewers for partners and reviewers.
115
+ For specific models with clearly displayable behavior; such as diffusers, generators, composite structures, denoisers, and so on.
116
+ Built with the public-eye in mind.
117
 
118
+ Release structural artifacts are kept small, composable, and ready for **disposable** retrains due to the rapid-learning process of **surge** training.
119
 
120
  ---
121
 
 
127
 
128
  Full structural papers and controlled benchmarks will follow with partner institutions.
129
 
130
+ ## Advanced signals and progressions
131
+
132
+ - Thousands of trains, 10s of thousands of finetunes and advanced trains, hundreds of thousands of inference attempts, and hundreds of thousands of prepared structures later; this system is more than robust. Only the strong survived.
133
+ - Each structure released on this repo will be a battle veteran, specifically chosen not because of some arbitrary guesswork or preliminary testing, but from thousands of failures that allowed these releases to survive.
134
+ - Genetically selected models guarantee the most powerful structured systems survive, each trained, interpolated, trained again, countless times for multiple systems through training processes devloped over thousands of train attempts and refined countless times.
135
+ - A 20 article paper trail leading from A to B to C every stage of the way, massive model training pipeline series of multiple systems, 2 years direct research shows the truth of these systems.
136
+
137
+ This is real, and it's up to you to invest or to discard the reality of what I'm building here.
138
+
139
  ---
140
 
141
  ## Collaboration Invitations
 
177
 
178
  ### One-Sentence Summary
179
 
180
+ **Abstract Powered** is building a self-crystallizing geometric AI stack that makes serious research affordable: small, composable experiments that compound, governed by a reusable Vocabulary Register, and guided by a disciplined assistant fabric—so we can safely explore sentience-adjacent behaviors while shrinking cost, time, and model size.