🔍 RAG Visualization Dashboard

RAG Categories Distribution

RAG Types by Year

📅 Evolution Timeline: Year → RAG Type

RAG Workflow Flowcharts

Chunking Methods Distribution

Chunking Categories & Methods

📅 Chunking Methods Timeline

Chunking Method Flowcharts

🧮 Indexing Techniques (Algorithmic Level)

Algorithmic methods for organizing and searching vectors efficiently. These are the core algorithms that determine how vectors are structured and retrieved (e.g., Flat Index, ANN, IVF, HNSW, PQ/OPQ).

💾 Vector Databases (System Level)

Complete systems/engines that implement indexing techniques along with additional features like persistence, metadata support, distributed architecture, and APIs. They are the production-grade platforms (e.g., FAISS, Milvus, Pinecone, Weaviate).

Indexing Technique	Description	Workflow	Pros	Cons	Peer-Reviewed Reference
Flat Index (Brute-Force)	Exact similarity search over all vectors using distance metrics.	Embed documents → store vectors → compare query with all vectors → return top-k	Exact results Simple implementation	Not scalable High latency & memory cost	📄Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval‑augmented generation for knowledge‑intensive NLP tasks. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, & H. T. Lin (Eds.), Advances in Neural Information Processing Systems, 33, 9459–9474. Curran Associates, Inc.
ANN (Approximate Nearest Neighbor)	Fast similarity search with controlled approximation.	Embed documents → build ANN structure → retrieve approximate neighbors	Fast retrieval Scales to millions/billions	Approximate (not exact)	📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
IVF (Inverted File Index)	Partitions vector space into clusters; searches only relevant partitions.	K-means clustering → assign vectors → query nearest clusters → search inside	Good speed-accuracy trade-off Scalable	Requires tuning Cluster-quality sensitive	📄Johnson, J., Douze, M., & Jégou, H. (2019). Billion‑scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547. https://doi.org/10.1109/TBDATA.2019.2921572
HNSW	Graph-based ANN using hierarchical proximity graphs.	Build layered graph → incremental insertion → top-down graph traversal	Very fast High recall Dynamic updates	High memory usage Complex construction	📄Malkov, Y. A., & Yashunin, D. A. (2020). Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4), 824–836. https://doi.org/10.1109/TPAMI.2018.2889473
PQ / OPQ	Compresses vectors into short codes for memory-efficient ANN.	Subspace split → quantization → store codes → approximate distance	Memory efficient Enables billion-scale RAG	Lossy compression Lower recall if misconfigured	📄Jégou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(1), 117–128. https://doi.org/10.1109/TPAMI.2010.57

Vector DB	Description	Underlying Index	Pros	Cons	Peer-Reviewed Reference
FAISS	Research-oriented vector similarity library widely used in RAG.	Flat, IVF, HNSW, PQ, OPQ	Highly optimized Flexible ANN Academic standard	Not a full DBMS Limited metadata	📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
Milvus	Distributed open-source vector database for large-scale RAG.	HNSW, IVF-PQ	Scalable Distributed Open-source	Deployment complexity Operational overhead	📄Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
Pinecone	Fully managed vector database for production RAG systems.	Proprietary ANN (HNSW-like)	Fully managed High availability Scalable	Closed-source Opaque indexing	📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
Weaviate	Open-source vector DB with schema & metadata-aware retrieval.	HNSW	Simple API Schema support	High memory usage Limited ANN tuning	📄Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
Elasticsearch (Vector)	Hybrid sparse-dense retrieval using BM25 + vectors.	HNSW + BM25	Hybrid retrieval Mature ecosystem	Higher latency Slower pure vector search	📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
Chroma	Lightweight developer-focused vector store for prototyping.	HNSW (ANN backends)	Easy integration Fast prototyping	Limited scalability Not enterprise-grade	📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997

🔍 RAG Visualization Dashboard

RAG Categories Distribution

RAG Types by Year

📅 Evolution Timeline: Year → RAG Type

RAG Workflow Flowcharts

📥 Indexing Workflow

🔄 Inference Workflow

✅ Key Benefits

⚠️ Challenges