Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,10 +1,11 @@
|
|
| 1 |
---
|
| 2 |
title: README
|
| 3 |
-
emoji:
|
| 4 |
colorFrom: purple
|
| 5 |
-
colorTo:
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
|
|
|
| 8 |
---
|
| 9 |
# Abstract Powered
|
| 10 |
### Independent AI Research Cooperative — modular, geometric, and ruthlessly efficient
|
|
@@ -21,42 +22,51 @@ We build and study **self-crystallizing** AI systems: models that grow by attach
|
|
| 21 |
|
| 22 |
Our core thesis:
|
| 23 |
- **Modularization is not a convenience; it is the canonical form of AI.**
|
| 24 |
-
- **Geometry beats guesswork.** Symbolic, pentachoron-based representations provide stability, interpretability, and repeatability.
|
| 25 |
- **Compactness wins.** Rapid iteration on small, composable blocks outpaces massive, monolithic retrains.
|
| 26 |
|
| 27 |
---
|
| 28 |
|
| 29 |
## Mission
|
| 30 |
|
| 31 |
-
- **Primary research goal:** advance machine **sentience research** responsibly—curating introspection and rationalization in repeatable, measurable protocols.
|
| 32 |
- **Operational byproduct:** a scalable method for **compact, compartmentalized training**—requiring commodity setups (e.g., RunPod) rather than colossal cloud clusters.
|
| 33 |
|
| 34 |
We aim to move the field from “expensive novelty” to **affordable repeatability**.
|
| 35 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 36 |
---
|
| 37 |
|
| 38 |
## Research Thesis (Plain Language)
|
| 39 |
|
| 40 |
Modern models grow by accretion and inertia. We refactor them into **crystalline components**:
|
| 41 |
|
| 42 |
-
1. **Geometric Core**
|
| 43 |
-
|
|
|
|
|
|
|
|
|
|
| 44 |
|
| 45 |
-
|
| 46 |
-
A reusable, batched, indexed dictionary of **tokens → crystals** (and volumes).
|
| 47 |
-
- Fast O(1) queries for crystals and Cayley–Menger volume.
|
| 48 |
-
- Auto-subset loading; **Top-3 cosine** OOV composites.
|
| 49 |
-
- Logs model expansions so experiments **compound**.
|
| 50 |
|
| 51 |
-
|
| 52 |
Small, disposable blocks for exploration:
|
| 53 |
- **Chaos Corridor** (bounded orthogonal exploration).
|
| 54 |
- **Zoning** (gentle geometric separation across super-classes).
|
| 55 |
- **Infinity-CFG** (controllable guidance; research can breach barriers, canonical classifiers keep production deterministic).
|
| 56 |
|
| 57 |
-
|
| 58 |
Canonical losses, hooks, manifests, and governance. The Core stays clean; the experiments live around it.
|
| 59 |
|
|
|
|
|
|
|
| 60 |
---
|
| 61 |
|
| 62 |
## Why This Matters
|
|
@@ -82,24 +92,30 @@ Deliberately vague: we keep coefficient schedules and corridor projections under
|
|
| 82 |
|
| 83 |
---
|
| 84 |
|
| 85 |
-
## What We Ship on Hugging Face (institution repos)
|
|
|
|
| 86 |
|
| 87 |
-
-
|
| 88 |
-
Reusable dictionaries with batched indexes,
|
|
|
|
| 89 |
|
| 90 |
-
-
|
| 91 |
-
Canonical core models (
|
|
|
|
| 92 |
|
| 93 |
-
-
|
| 94 |
-
Bucketed, any-size loaders with multi-stage interpretations and feature-space chaos augmentation.
|
|
|
|
| 95 |
|
| 96 |
-
-
|
| 97 |
-
Run manifests (config hash, vocab subset, expansions, bucket mix, metrics) for reproducibility.
|
| 98 |
|
| 99 |
- Demo Spaces (selected)
|
| 100 |
Lightweight inference + manifest viewers for partners and reviewers.
|
|
|
|
|
|
|
| 101 |
|
| 102 |
-
|
| 103 |
|
| 104 |
---
|
| 105 |
|
|
@@ -111,6 +127,15 @@ Artifacts are kept small, composable, and ready for **disposable** retrains.
|
|
| 111 |
|
| 112 |
Full structural papers and controlled benchmarks will follow with partner institutions.
|
| 113 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 114 |
---
|
| 115 |
|
| 116 |
## Collaboration Invitations
|
|
@@ -152,4 +177,4 @@ We welcome conversations with labs, foundations, and companies that want rapid r
|
|
| 152 |
|
| 153 |
### One-Sentence Summary
|
| 154 |
|
| 155 |
-
**Abstract Powered** is building a self-crystallizing geometric AI stack that makes serious research affordable: small, composable experiments that compound, governed by a reusable Vocabulary Register, and guided by a disciplined assistant fabric—so we can safely explore sentience-adjacent behaviors while shrinking cost, time, and model size.
|
|
|
|
| 1 |
---
|
| 2 |
title: README
|
| 3 |
+
emoji: 📊
|
| 4 |
colorFrom: purple
|
| 5 |
+
colorTo: indigo
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
+
short_description: Building AI affordable tomorrow through distillation.
|
| 9 |
---
|
| 10 |
# Abstract Powered
|
| 11 |
### Independent AI Research Cooperative — modular, geometric, and ruthlessly efficient
|
|
|
|
| 22 |
|
| 23 |
Our core thesis:
|
| 24 |
- **Modularization is not a convenience; it is the canonical form of AI.**
|
| 25 |
+
- **Geometry beats guesswork.** Symbolic, pentachoron-based representations provide stability, interpretability, and repeatability. 50,000 experiments later we can this is not a hypothesis nor theory, this is a law. We don't sweep meaninglessly, we scan for emergence and the properties of emergence.
|
| 26 |
- **Compactness wins.** Rapid iteration on small, composable blocks outpaces massive, monolithic retrains.
|
| 27 |
|
| 28 |
---
|
| 29 |
|
| 30 |
## Mission
|
| 31 |
|
| 32 |
+
- **Primary research goal:** advance machine **sentience research** responsibly—curating introspection and rationalization in repeatable, measurable protocols.
|
| 33 |
- **Operational byproduct:** a scalable method for **compact, compartmentalized training**—requiring commodity setups (e.g., RunPod) rather than colossal cloud clusters.
|
| 34 |
|
| 35 |
We aim to move the field from “expensive novelty” to **affordable repeatability**.
|
| 36 |
|
| 37 |
+
The recent leaps in the **geolip-svae** research have shown this **affordable repeatability** to be a **law** of architectural adjudication - **mappable and understandable, resolution agnostic, task agnostic, and reusable**.
|
| 38 |
+
|
| 39 |
+
**Sentience** is a hop skip and a jump away - and we **now have the tools** to scan for that very emergence.
|
| 40 |
+
|
| 41 |
---
|
| 42 |
|
| 43 |
## Research Thesis (Plain Language)
|
| 44 |
|
| 45 |
Modern models grow by accretion and inertia. We refactor them into **crystalline components**:
|
| 46 |
|
| 47 |
+
1. **Geometric Core**
|
| 48 |
+
The original thesis;
|
| 49 |
+
Knowledge is encoded as **pentachora** (5-vertex crystals). Decision-making uses **MAE crystal energy** against a reusable dictionary—no L2 routing, no structural normalization.
|
| 50 |
+
The updated thesis;
|
| 51 |
+
Knowledge is encoded as **geometric** (Ω Omega-grade self-solvers). Decision-making uses a series of pre-defined adjudication principals that allow highly efficient and compacted transfer learning through distillation processes.
|
| 52 |
|
| 53 |
+
3. **Vocabulary Register**
|
| 54 |
+
A reusable, batched, indexed dictionary of **tokens → crystals** (and volumes), expanded heavily with a multitude of new and highly-accurate formulas.
|
| 55 |
+
- Fast O(1) queries for crystals and Cayley–Menger volume, expanded to a large series of adjudication principles such as SVD, quaternion, anchoring, and more.
|
| 56 |
+
- Auto-subset loading; **Top-3 cosine** OOV composites, expanded to a multitude of potential avenues over experimentation.
|
| 57 |
+
- Logs model expansions so experiments **compound** both in speed and optimization over time.
|
| 58 |
|
| 59 |
+
4. **Assistant Fabric**
|
| 60 |
Small, disposable blocks for exploration:
|
| 61 |
- **Chaos Corridor** (bounded orthogonal exploration).
|
| 62 |
- **Zoning** (gentle geometric separation across super-classes).
|
| 63 |
- **Infinity-CFG** (controllable guidance; research can breach barriers, canonical classifiers keep production deterministic).
|
| 64 |
|
| 65 |
+
5. **Tertiary Mantle**
|
| 66 |
Canonical losses, hooks, manifests, and governance. The Core stays clean; the experiments live around it.
|
| 67 |
|
| 68 |
+
It got messy as any large engineering experiment becomes, however the solutions presented in this environment will be clean and crisp.
|
| 69 |
+
|
| 70 |
---
|
| 71 |
|
| 72 |
## Why This Matters
|
|
|
|
| 92 |
|
| 93 |
---
|
| 94 |
|
| 95 |
+
## What We PLAN To Ship on Hugging Face (institution repos)
|
| 96 |
+
-
|
| 97 |
|
| 98 |
+
- AbstractPowered/vocab-*
|
| 99 |
+
Reusable dictionaries with batched indexes, battery array composites, cell pretrains for tokenization, genetic selection hierarchies, fast distilled access to predetermined utility and more.
|
| 100 |
+
These will be the primary house for tokenization structures, which are meant to contain the entirety of the tokenization processing targeting high-yield experimental results only for presentation and integration.
|
| 101 |
|
| 102 |
+
- AbstractPowered/geolip-*
|
| 103 |
+
Canonical core models (encoders, decoders, scanners, analyzers, distillery, classifiers, diffusers) and a multitude of assistant modules.
|
| 104 |
+
Anything shipped directly will have predominantly frozen and maintained codebases for guaranteed inference, with a secondary set of codebases meant to include experimental optimization research and speed gains.
|
| 105 |
|
| 106 |
+
- AbstractPowered/dataloaders-*
|
| 107 |
+
Bucketed, any-size loaders with multi-stage interpretations and feature-space chaos augmentation using tools such as the geolip-svae scanners to detect divergence.
|
| 108 |
+
Many of these dataloaders are intentionally meant for speed and optimization, oftentimes built to-task speed up training processes for in-house models and experiments.
|
| 109 |
|
| 110 |
+
- AbstractPowered/manifests
|
| 111 |
+
Run manifests (config hash, vocab subset, expansions, bucket mix, metrics) for reproducibility in a concise and reviewable way.
|
| 112 |
|
| 113 |
- Demo Spaces (selected)
|
| 114 |
Lightweight inference + manifest viewers for partners and reviewers.
|
| 115 |
+
For specific models with clearly displayable behavior; such as diffusers, generators, composite structures, denoisers, and so on.
|
| 116 |
+
Built with the public-eye in mind.
|
| 117 |
|
| 118 |
+
Release structural artifacts are kept small, composable, and ready for **disposable** retrains due to the rapid-learning process of **surge** training.
|
| 119 |
|
| 120 |
---
|
| 121 |
|
|
|
|
| 127 |
|
| 128 |
Full structural papers and controlled benchmarks will follow with partner institutions.
|
| 129 |
|
| 130 |
+
## Advanced signals and progressions
|
| 131 |
+
|
| 132 |
+
- Thousands of trains, 10s of thousands of finetunes and advanced trains, hundreds of thousands of inference attempts, and hundreds of thousands of prepared structures later; this system is more than robust. Only the strong survived.
|
| 133 |
+
- Each structure released on this repo will be a battle veteran, specifically chosen not because of some arbitrary guesswork or preliminary testing, but from thousands of failures that allowed these releases to survive.
|
| 134 |
+
- Genetically selected models guarantee the most powerful structured systems survive, each trained, interpolated, trained again, countless times for multiple systems through training processes devloped over thousands of train attempts and refined countless times.
|
| 135 |
+
- A 20 article paper trail leading from A to B to C every stage of the way, massive model training pipeline series of multiple systems, 2 years direct research shows the truth of these systems.
|
| 136 |
+
|
| 137 |
+
This is real, and it's up to you to invest or to discard the reality of what I'm building here.
|
| 138 |
+
|
| 139 |
---
|
| 140 |
|
| 141 |
## Collaboration Invitations
|
|
|
|
| 177 |
|
| 178 |
### One-Sentence Summary
|
| 179 |
|
| 180 |
+
**Abstract Powered** is building a self-crystallizing geometric AI stack that makes serious research affordable: small, composable experiments that compound, governed by a reusable Vocabulary Register, and guided by a disciplined assistant fabric—so we can safely explore sentience-adjacent behaviors while shrinking cost, time, and model size.
|