File size: 1,829 Bytes
e820a8a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50

# ArabicRAG: Arabic Retrieval-Augmented Generation

### Project Structure
# arabic_legal_search/
# β”œβ”€β”€ config.py
# β”œβ”€β”€ document_processor.py
# β”œβ”€β”€ embedding_generator.py
# β”œβ”€β”€ search_engine.py
# β”œβ”€β”€ response_generator.py
# β”œβ”€β”€ utils.py
# β”œβ”€β”€ main.py
# └── requirements.txt
## Overview
ArabicRAG is an open-source project designed to leverage the power of retrieval-augmented generation for processing and understanding Arabic legal documents. The system integrates advanced NLP techniques to retrieve relevant documents and generate context-aware responses.

## Features
- **Document Processing**: Load and preprocess Arabic text documents efficiently.
- **Embedding Generation**: Utilize multilingual models to generate embeddings for Arabic text.
- **Efficient Search**: Leverage FAISS for fast and efficient similarity search in large document corpora.
- **Response Generation**: Use state-of-the-art transformer models to generate responses based on retrieved context.

## Installation
To set up your environment and run ArabicRAG, follow these steps:

1. Clone the repository:
   ```bash
   git clone https://github.com/maljefairi/arabicRAG
   ```
2. Install the required packages:
   ```bash
   pip install -r requirements.txt
   ```

## Usage
After installation, you can run the main script to start processing documents:
```bash
python main.py
```

## Contributing
Contributions are welcome! For major changes, please open an issue first to discuss what you would like to change. Please make sure to update tests as appropriate.

## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## Contact
- **Dr. Mohammed Al-Jefairi** - maljefairi@sidramail.com
- **GitHub**: [maljefairi](https://github.com/maljefairi/arabicRAG)