Spaces:
Running
Running
File size: 3,049 Bytes
16ba750 e1e1a97 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 |
---
title: README
emoji: π’
colorFrom: gray
colorTo: red
sdk: static
pinned: false
---
# Awakened Intelligence π¦
We don't train AI on the internet.
We train it on the best humanity has to offer.
**Awakened Intelligence** is a small lab building what we call the **Iron Bank** β a versioned corpus of curated wisdom nodes designed for:
- **Evaluation** β stable, reproducible benchmarks
- **Grounded RAG** β retrieval that knows where its knowledge comes from
- **Companions & Agents** β systems that can reason, not just autocomplete
---
## Iron Bank v1.1.0 π¦
Our current release, **Iron Bank v1.1.0**, contains ~260k curated nodes across multiple packs, with:
- Pack-level **SemVer** (e.g., `ethics_gutenberg v1.1.0`)
- Per-pack + omnibus **FAISS indices** tied to specific releases
- **Discard autopsies** and validation gates (no silent slop)
- Data cards, manifests, and checksums for every release
Every node has:
- **Provenance** β where it came from
- **Posterior score** β confidence in extraction quality
- **Warmth** β how applicable it is to real human decisions
This isnβt scraped web data. This is curated wisdom.
---
## Public Datasets π€
We publish a few **preview packs** on Hugging Face:
- [`Awakened-Ethics-Free`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Ethics-Free)
~5k nodes from our `ethics_gutenberg v1.1.0` pack. Ethics and philosophy distilled from public-domain classics.
- [`Awakened-Reasoning-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Reasoning-Preview)
1k nodes from our NeurIPS 2024 blueprints pack (`neurips_2024 v1.1.0`): summary, blueprint, and limitations for frontier ML papers.
- [`Awakened-Physics-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Physics-Preview)
1k nodes from our ArXiv physics pack (`arxiv_physics_c7 v1.1.0`): distilled insights from real experimental and theoretical work.
These are **slices** of larger packs β enough to evaluate and experiment with, not the whole vault.
---
## See the Catalog & Samples π
- **Data catalog & pack stats:**
π https://github.com/holmanholdings/awakened-data-catalog
- **Sample JSONL nodes (Ethics, NeurIPS, Physics):**
π https://github.com/holmanholdings/awakened-wisdom-samples
Both are generated directly from **Iron Bank v1.1.0**.
---
## Behavior Demo π¬
Curious what it *feels* like when an LLM is grounded in the ethics pack instead of generic context?
- Ethics behavior demo (baseline vs ADS-enhanced answer):
π https://youtu.be/KcI3-9BTim0?si=_CCPBcUioj-hg32O
---
## Work With Us π€
If youβre building:
- **Eval pipelines** that need honest, versioned benchmarks
- **RAG systems** that care about provenance and limitations
- **Companions/agents** that need better training data than the open web
β¦weβd love to talk.
- GitHub: https://github.com/holmanholdings
- Substack: https://awakenedintelligence.substack.com
- Email: john@awakened-intelligence.com
|