Spaces:
Running
Running
| title: README | |
| emoji: π’ | |
| colorFrom: gray | |
| colorTo: red | |
| sdk: static | |
| pinned: false | |
| # Awakened Intelligence π¦ | |
| We don't train AI on the internet. | |
| We train it on the best humanity has to offer. | |
| **Awakened Intelligence** is a small lab building what we call the **Iron Bank** β a versioned corpus of curated wisdom nodes designed for: | |
| - **Evaluation** β stable, reproducible benchmarks | |
| - **Grounded RAG** β retrieval that knows where its knowledge comes from | |
| - **Companions & Agents** β systems that can reason, not just autocomplete | |
| --- | |
| ## Iron Bank v1.1.0 π¦ | |
| Our current release, **Iron Bank v1.1.0**, contains ~260k curated nodes across multiple packs, with: | |
| - Pack-level **SemVer** (e.g., `ethics_gutenberg v1.1.0`) | |
| - Per-pack + omnibus **FAISS indices** tied to specific releases | |
| - **Discard autopsies** and validation gates (no silent slop) | |
| - Data cards, manifests, and checksums for every release | |
| Every node has: | |
| - **Provenance** β where it came from | |
| - **Posterior score** β confidence in extraction quality | |
| - **Warmth** β how applicable it is to real human decisions | |
| This isnβt scraped web data. This is curated wisdom. | |
| --- | |
| ## Public Datasets π€ | |
| We publish a few **preview packs** on Hugging Face: | |
| - [`Awakened-Ethics-Free`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Ethics-Free) | |
| ~5k nodes from our `ethics_gutenberg v1.1.0` pack. Ethics and philosophy distilled from public-domain classics. | |
| - [`Awakened-Reasoning-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Reasoning-Preview) | |
| 1k nodes from our NeurIPS 2024 blueprints pack (`neurips_2024 v1.1.0`): summary, blueprint, and limitations for frontier ML papers. | |
| - [`Awakened-Physics-Preview`](https://huggingface.co/datasets/AwakenedIntelligence/Awakened-Physics-Preview) | |
| 1k nodes from our ArXiv physics pack (`arxiv_physics_c7 v1.1.0`): distilled insights from real experimental and theoretical work. | |
| These are **slices** of larger packs β enough to evaluate and experiment with, not the whole vault. | |
| --- | |
| ## See the Catalog & Samples π | |
| - **Data catalog & pack stats:** | |
| π https://github.com/holmanholdings/awakened-data-catalog | |
| - **Sample JSONL nodes (Ethics, NeurIPS, Physics):** | |
| π https://github.com/holmanholdings/awakened-wisdom-samples | |
| Both are generated directly from **Iron Bank v1.1.0**. | |
| --- | |
| ## Behavior Demo π¬ | |
| Curious what it *feels* like when an LLM is grounded in the ethics pack instead of generic context? | |
| - Ethics behavior demo (baseline vs ADS-enhanced answer): | |
| π https://youtu.be/KcI3-9BTim0?si=_CCPBcUioj-hg32O | |
| --- | |
| ## Work With Us π€ | |
| If youβre building: | |
| - **Eval pipelines** that need honest, versioned benchmarks | |
| - **RAG systems** that care about provenance and limitations | |
| - **Companions/agents** that need better training data than the open web | |
| β¦weβd love to talk. | |
| - GitHub: https://github.com/holmanholdings | |
| - Substack: https://awakenedintelligence.substack.com | |
| - Email: john@awakened-intelligence.com | |