Soha85 commited on
Commit
1c221a5
·
verified ·
1 Parent(s): ce4661b

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +11 -11
index.html CHANGED
@@ -560,7 +560,7 @@
560
  <td>Not scalable<br>High latency & memory cost</td>
561
  <td class="timeline-reference">
562
  <a href="https://proceedings.neurips.cc/paper_files/paper/2020/file/6b493230205f780e1bc26945df7481e5-Paper.pdf" target="_blank">
563
- Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval‑augmented generation for knowledge‑intensive NLP tasks. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, & H. T. Lin (Eds.), Advances in Neural Information Processing Systems, 33, 9459–9474. Curran Associates, Inc.
564
  </a>
565
  </td>
566
  </tr>
@@ -575,7 +575,7 @@
575
  <td>Approximate (not exact)</td>
576
  <td class="timeline-reference">
577
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
578
- Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
579
  </a>
580
  </td>
581
  </tr>
@@ -590,7 +590,7 @@
590
  <td>Requires tuning<br>Cluster-quality sensitive</td>
591
  <td class="timeline-reference">
592
  <a href="https://arxiv.org/pdf/1702.08734" target="_blank">
593
- Johnson, J., Douze, M., & Jégou, H. (2019). Billion‑scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547. https://doi.org/10.1109/TBDATA.2019.2921572
594
  </a>
595
  </td>
596
  </tr>
@@ -605,7 +605,7 @@
605
  <td>High memory usage<br>Complex construction</td>
606
  <td class="timeline-reference">
607
  <a href="https://arxiv.org/abs/1603.09320" target="_blank">
608
- Malkov, Y. A., & Yashunin, D. A. (2020). Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4), 824–836. https://doi.org/10.1109/TPAMI.2018.2889473
609
  </a>
610
  </td>
611
  </tr>
@@ -620,7 +620,7 @@
620
  <td>Lossy compression<br>Lower recall if misconfigured</td>
621
  <td class="timeline-reference">
622
  <a href="https://www.irisa.fr/texmex/people/jegou/papers/jegou_searching_with_quantization.pdf" target="_blank">
623
- Jégou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(1), 117–128. https://doi.org/10.1109/TPAMI.2010.57
624
  </a>
625
  </td>
626
  </tr>
@@ -649,7 +649,7 @@
649
  <td>Not a full DBMS<br>Limited metadata</td>
650
  <td class="timeline-reference">
651
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
652
- Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
653
  </a>
654
  </td>
655
  </tr>
@@ -662,7 +662,7 @@
662
  <td>Deployment complexity<br>Operational overhead</td>
663
  <td class="timeline-reference">
664
  <a href="https://ijaibdcms.org/index.php/ijaibdcms/article/view/257?utm_source=chatgpt.com" target="_blank">
665
- Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
666
  </a>
667
  </td>
668
  </tr>
@@ -675,7 +675,7 @@
675
  <td>Closed-source<br>Opaque indexing</td>
676
  <td class="timeline-reference">
677
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
678
- Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
679
  </a>
680
  </td>
681
  </tr>
@@ -688,7 +688,7 @@
688
  <td>High memory usage<br>Limited ANN tuning</td>
689
  <td class="timeline-reference">
690
  <a href="https://ijaibdcms.org/index.php/ijaibdcms/article/view/257?utm_source=chatgpt.com" target="_blank">
691
- Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
692
  </a>
693
  </td>
694
  </tr>
@@ -701,7 +701,7 @@
701
  <td>Higher latency<br>Slower pure vector search</td>
702
  <td class="timeline-reference">
703
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
704
- Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
705
  </a>
706
  </td>
707
  </tr>
@@ -714,7 +714,7 @@
714
  <td>Limited scalability<br>Not enterprise-grade</td>
715
  <td>
716
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
717
- Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
718
  </a>
719
  </td>
720
  </tr>
 
560
  <td>Not scalable<br>High latency & memory cost</td>
561
  <td class="timeline-reference">
562
  <a href="https://proceedings.neurips.cc/paper_files/paper/2020/file/6b493230205f780e1bc26945df7481e5-Paper.pdf" target="_blank">
563
+ 📄Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval‑augmented generation for knowledge‑intensive NLP tasks. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, & H. T. Lin (Eds.), Advances in Neural Information Processing Systems, 33, 9459–9474. Curran Associates, Inc.
564
  </a>
565
  </td>
566
  </tr>
 
575
  <td>Approximate (not exact)</td>
576
  <td class="timeline-reference">
577
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
578
+ 📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
579
  </a>
580
  </td>
581
  </tr>
 
590
  <td>Requires tuning<br>Cluster-quality sensitive</td>
591
  <td class="timeline-reference">
592
  <a href="https://arxiv.org/pdf/1702.08734" target="_blank">
593
+ 📄Johnson, J., Douze, M., & Jégou, H. (2019). Billion‑scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547. https://doi.org/10.1109/TBDATA.2019.2921572
594
  </a>
595
  </td>
596
  </tr>
 
605
  <td>High memory usage<br>Complex construction</td>
606
  <td class="timeline-reference">
607
  <a href="https://arxiv.org/abs/1603.09320" target="_blank">
608
+ 📄Malkov, Y. A., & Yashunin, D. A. (2020). Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4), 824–836. https://doi.org/10.1109/TPAMI.2018.2889473
609
  </a>
610
  </td>
611
  </tr>
 
620
  <td>Lossy compression<br>Lower recall if misconfigured</td>
621
  <td class="timeline-reference">
622
  <a href="https://www.irisa.fr/texmex/people/jegou/papers/jegou_searching_with_quantization.pdf" target="_blank">
623
+ 📄Jégou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(1), 117–128. https://doi.org/10.1109/TPAMI.2010.57
624
  </a>
625
  </td>
626
  </tr>
 
649
  <td>Not a full DBMS<br>Limited metadata</td>
650
  <td class="timeline-reference">
651
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
652
+ 📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
653
  </a>
654
  </td>
655
  </tr>
 
662
  <td>Deployment complexity<br>Operational overhead</td>
663
  <td class="timeline-reference">
664
  <a href="https://ijaibdcms.org/index.php/ijaibdcms/article/view/257?utm_source=chatgpt.com" target="_blank">
665
+ 📄Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
666
  </a>
667
  </td>
668
  </tr>
 
675
  <td>Closed-source<br>Opaque indexing</td>
676
  <td class="timeline-reference">
677
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
678
+ 📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
679
  </a>
680
  </td>
681
  </tr>
 
688
  <td>High memory usage<br>Limited ANN tuning</td>
689
  <td class="timeline-reference">
690
  <a href="https://ijaibdcms.org/index.php/ijaibdcms/article/view/257?utm_source=chatgpt.com" target="_blank">
691
+ 📄Rusum, G. P., & Anasuri, S. (2025). Vector databases in modern applications: Real‑time search, recommendations, and retrieval‑augmented generation (RAG). International Journal of AI, BigData, Computational and Management Studies, 5(4), Article 113. https://doi.org/10.63282/3050‑9416.IJAIBDCMS‑V5I4P113
692
  </a>
693
  </td>
694
  </tr>
 
701
  <td>Higher latency<br>Slower pure vector search</td>
702
  <td class="timeline-reference">
703
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
704
+ 📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
705
  </a>
706
  </td>
707
  </tr>
 
714
  <td>Limited scalability<br>Not enterprise-grade</td>
715
  <td>
716
  <a href="https://simg.baai.ac.cn/paperfile/25a43194-c74c-4cd3-b60f-0a1f27f8b8af.pdf" target="_blank">
717
+ 📄Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval‑Augmented Generation for Large Language Models: A Survey (arXiv:2312.10997). arXiv Preprint. https://doi.org/10.48550/arXiv.2312.10997
718
  </a>
719
  </td>
720
  </tr>