README / README.md
alexanderkhundoev's picture
Update README.md
c327039 verified
# EVAM Lab πŸͺ·
**Ancient logic. Modern AI. Built to know what it doesn't know.**
We implement Buddhist formal logic as ML infrastructure β€” rigorous epistemological frameworks developed over 1,500 years, now running on GPUs.
---
## Phase A: Apoha β€” Available Now
> *"A concept is not defined by what it includes, but by what it excludes."*
> β€” Dignāga, c. 480–540 CE
**Apoha** is a few-shot classifier that defines concepts by their **exclusion sets** β€” examples of what they are NOT. Inputs that don't clearly belong anywhere receive `UNCERTAIN` instead of a forced label. OOD rejection is built into the architecture, not bolted on after.
| Benchmark | Apoha OOD Rejection | Best Baseline | Improvement |
|-----------|-------------------|---------------|-------------|
| CLINC150 (30 intents) | 84.6% | 15.1% (InfoNCE) | **5.6Γ—** |
| Banking77 (50 intents) | 29.1% | 0.0% (InfoNCE) | **∞** |
| HWU64 (40 intents) | 24.9% | 0.2% (InfoNCE) | **166Γ—** |
| Cybersecurity (15 TTPs) | 45.2% | 1.2% (InfoNCE) | **38Γ—** |
πŸ‘‰**[apoha-bge-small-en-v1.5](https://huggingface.co/EVAMLab/apoha-bge-small-en-v1.5)** β€” model weights + inference code
πŸ‘‰ **[apoha-demo](https://huggingface.co/spaces/EVAMLab/apoha-demo)** β€” live interactive demo
πŸ“„ **Preprint:** ArXiv *(coming soon)*
---
## Roadmap
| Phase | Module | What it does |
|-------|--------|-------------|
| **A βœ…** | Apoha | Rejects OOD *inputs* β€” classifier knows what it doesn't know |
| **B πŸ”¨** | Hetuchakra | Verifies LLM *outputs* β€” formal argument gate |