File size: 1,172 Bytes
1ca85a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
eb36a32
 
 
 
 
 
 
6142d69
eb36a32
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
language:
- en
- ur
- hi
- es
tags:
- rag
- retrieval-augmented-generation
- multilingual
- faiss
- llama
- mistral
license: apache-2.0
library_name: transformers
pipeline_tag: question-answering
---


# Multilingual Knowledge RAG Bot – Cross-Lingual Retrieval-Augmented Generation

This model is designed for **cross-lingual question answering** using Retrieval-Augmented Generation (RAG).  
It can take documents in multiple languages — Urdu, Hindi, Spanish, English — and answer in the same or different language.

## Key Features
- **LLM Used:** Meta-Llama-3-8B-Instruct
- **Embedding Model:** `sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2`
- **RAG Pipeline:** FAISS-based vector search + context injection
- **Training/Processing:** Implemented entirely in Google Colab using open-source tools only
- **Zero paid APIs** — 100% free and deployable

## Techniques Used
- **Vector Database:** FAISS for similarity search
- **Cross-Lingual Embeddings:** multilingual sentence transformers
- **Prompt Engineering:** Context-aware question answering
- **Open-Source Deployment Ready:** Hugging Face Spaces compatible

---

### License
Apache-2.0