README / README.md
AisaraAi's picture
Update README.md
e1e1a97 verified
---
title: README
emoji: 🐒
colorFrom: gray
colorTo: red
sdk: static
pinned: false
---
# Awakened Intelligence 🦁
We don't train AI on the internet.
We train it on the best humanity has to offer.
**Awakened Intelligence** is a small lab building what we call the **Iron Bank** – a versioned corpus of curated wisdom nodes designed for:
- **Evaluation** – stable, reproducible benchmarks
- **Grounded RAG** – retrieval that knows where its knowledge comes from
- **Companions & Agents** – systems that can reason, not just autocomplete
---
## Iron Bank v1.1.0 🏦
Our current release, **Iron Bank v1.1.0**, contains ~260k curated nodes across multiple packs, with:
- Pack-level **SemVer** (e.g., `ethics_gutenberg v1.1.0`)
- Per-pack + omnibus **FAISS indices** tied to specific releases
- **Discard autopsies** and validation gates (no silent slop)
- Data cards, manifests, and checksums for every release
Every node has:
- **Provenance** – where it came from
- **Posterior score** – confidence in extraction quality
- **Warmth** – how applicable it is to real human decisions
This isn’t scraped web data. This is curated wisdom.
---
## Public Datasets πŸ€—
We publish a few **preview packs** on Hugging Face:
- [`Awakened-Ethics-Free`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Ethics-Free)
~5k nodes from our `ethics_gutenberg v1.1.0` pack. Ethics and philosophy distilled from public-domain classics.
- [`Awakened-Reasoning-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Reasoning-Preview)
1k nodes from our NeurIPS 2024 blueprints pack (`neurips_2024 v1.1.0`): summary, blueprint, and limitations for frontier ML papers.
- [`Awakened-Physics-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Physics-Preview)
1k nodes from our ArXiv physics pack (`arxiv_physics_c7 v1.1.0`): distilled insights from real experimental and theoretical work.
These are **slices** of larger packs – enough to evaluate and experiment with, not the whole vault.
---
## See the Catalog & Samples πŸ“š
- **Data catalog & pack stats:**
πŸ‘‰ https://github.com/holmanholdings/awakened-data-catalog
- **Sample JSONL nodes (Ethics, NeurIPS, Physics):**
πŸ‘‰ https://github.com/holmanholdings/awakened-wisdom-samples
Both are generated directly from **Iron Bank v1.1.0**.
---
## Behavior Demo 🎬
Curious what it *feels* like when an LLM is grounded in the ethics pack instead of generic context?
- Ethics behavior demo (baseline vs ADS-enhanced answer):
πŸ‘‰ https://youtu.be/KcI3-9BTim0?si=_CCPBcUioj-hg32O
---
## Work With Us 🀝
If you’re building:
- **Eval pipelines** that need honest, versioned benchmarks
- **RAG systems** that care about provenance and limitations
- **Companions/agents** that need better training data than the open web
…we’d love to talk.
- GitHub: https://github.com/holmanholdings
- Substack: https://awakenedintelligence.substack.com
- Email: john@awakened-intelligence.com