--- title: VORTEXRAG emoji: "🌀" colorFrom: purple colorTo: blue sdk: gradio sdk_version: "5.29.0" app_file: app.py pinned: true license: mit short_description: "7-Layer RAG: +13.6 EM, 0.94 Faithfulness" tags: - retrieval-augmented-generation - RAG - NLP - question-answering - causal-reasoning - hallucination-reduction - LLM - machine-learning --- # VORTEXRAG **Vector Orthogonal Resonance-Tuned EXtraction Retrieval-Augmented Generation** > A 7-layer RAG framework that simultaneously eliminates **Semantic Drift** and **Context Window Poisoning**. [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.20579702.svg)](https://doi.org/10.5281/zenodo.20579702) [![GitHub](https://img.shields.io/badge/GitHub-vignesh2027%2FVORTEXRAG-blue)](https://github.com/vignesh2027/VORTEXRAG) [![Tests](https://img.shields.io/badge/Tests-229%20passing-brightgreen)](https://github.com/vignesh2027/VORTEXRAG) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://github.com/vignesh2027/VORTEXRAG/blob/main/LICENSE) --- ## The Problem Standard RAG Cannot Solve Ask "Why did Lehman Brothers collapse?" Standard RAG retrieves both Dodd-Frank provisions (cosine 0.87, topically related but WRONG) and the CDS mispricing mechanism (cosine 0.91, causally correct). The LLM sees both and hallucinates a policy-response narrative. **This is Semantic Drift.** Even with the right chunk retrieved, 7 surrounding irrelevant chunks dilute the LLM attention. **This is Context Window Poisoning.** VORTEXRAG solves both with a principled 7-layer pipeline. --- ## The 7 Layers | Layer | Name | Formula | |-------|------|---------| | 1 | TVE - Tri-Vector Encoding | score = alpha * cos_sem + beta * cos_syn + gamma * cos_cau | | 2 | VRC - Vortex Retrieval Cone | spiral = TVE * exp(-lambda*r) * cos(n*theta) | | 3 | SDC - Semantic Drift Corrector | SDS = 1 - tanh(norm(D)/tau) >= 0.72 | | 4 | CPG - Context Poison Guard | ESR = sum(S*w)/(P+eps) >= 3.5 (provably optimal) | | 5 | RFG - Rank Fusion Gate | Phi = TVE^alpha * SDS^beta * ESR^gamma | | 6 | CCB - Causal Context Builder | pos = rank(Phi) * causal_depth | | 7 | FV - Faithfulness Verifier | Delta_R = 1 - ROUGE-L * NLI <= 0.15 | --- ## Benchmark Results (v3.0) | System | EM | F1 | Faithfulness | Latency | |--------|----|----|-------------|---------| | **VORTEXRAG** | **74.8** | **82.6** | **0.94** | **185ms** | | Self-RAG | 68.4 | 77.1 | 0.81 | 410ms | | CRAG | 66.9 | 75.8 | 0.79 | 320ms | | Naive RAG | 61.2 | 69.4 | 0.71 | 95ms | **+13.6 EM** over Naive RAG - Semantic Drift **-61%** - Context Poisoning **-71%** - **2.2x faster** than Self-RAG --- ## Ablation ``` Baseline: 61.2 EM / 0.68 Faithfulness +TVE: 65.3 EM (+4.1) +VRC: 67.8 EM (+2.5) +SDC: 70.4 EM (+2.6) +CPG: 72.1 EM (+1.7) +RFG: 73.4 EM (+1.3) +CCB: 73.9 EM (+0.5) VORTEXRAG (full): 74.8 EM (+0.9) ``` --- ## 11 Domain Presets scientific (tau=0.30) - medical (tau=0.35) - legal (tau=0.40) - cybersecurity (tau=0.45) - financial (tau=0.50) - code (tau=0.60) - educational (tau=0.65) - general (tau=0.80) - historical (tau=0.90) - customer (tau=0.95) - creative (tau=1.20) --- ## Citation ```bibtex @article{vignesh2026vortexrag, title = {VORTEXRAG: Vector Orthogonal Resonance-Tuned EXtraction RAG}, author = {Vignesh, L}, year = {2026}, doi = {10.5281/zenodo.20579702}, url = {https://doi.org/10.5281/zenodo.20579702} } ``` **Paper:** https://doi.org/10.5281/zenodo.20579702 **Code:** https://github.com/vignesh2027/VORTEXRAG **ORCID:** https://orcid.org/0009-0004-9777-7592