File size: 2,034 Bytes
b840569
bbb806b
 
 
 
b840569
75ae8d8
b840569
 
bbb806b
b840569
 
bbb806b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8b2fe48
bbb806b
 
 
 
 
 
 
99c8172
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: Semantic Book Search (2.4M)
emoji: πŸ“š
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
license: mit
---

# πŸ“š Semantic Book Search Engine

Welcome to the **AI-powered Book Search Engine**.
Stop searching by exact keywords. This tool allows you to search for books by **describing the plot, the atmosphere, or the emotions** you are looking for.

The system indexes over **2.4 million books**, allowing you to uncover hidden gems using state-of-the-art Natural Language Processing.

## πŸš€ How to use it

### 1. πŸ”Ž Search by Plot (Semantic Search)
Can't remember the title? Looking for a specific vibe?
* Try: *"A dystopian novel where books are banned and burned by firemen"*
* Try: *"A psychological thriller set in Victorian London with a plot twist"*
* The model understands the **concept** and retrieves the most semantically similar books.

### 2. πŸ“– I liked... (Recommendation)
Did you love a specific book?
* Switch to the second tab.
* Search for a title (e.g., *"Harry Potter"*).
* The system retrieves the existing vector from the database and recommends books that are mathematically closest in the latent space (similar style, genre, and plot).

---

## πŸ› οΈ Under the Hood (Technical Architecture)

This project is a showcase of **End-to-End AI Engineering**, designed to handle large-scale datasets in a **Low-Resource Environment**.

* **Dataset:** ~2.4 Million books processed and indexed.
* **AI Embedding Model:** `paraphrase-multilingual-MiniLM-L12-v2`.
* **Hybrid Retrieval Architecture:**
    * 🧠 **Qdrant (Vector DB):** Handles semantic similarity search. Vectors are compressed using **INT8 Scalar Quantization**.
    * πŸ—„οΈ **Turso (LibSQL):** Relational database for low-latency metadata retrieval (Title, Author, Year, Rating), keeping the vector payload lightweight.

### πŸ‘¨β€πŸ’» Author
**Antonio Gagliostro**
* [GitHub Profile](https://github.com/ninooo96)
* [LinkedIn](https://www.linkedin.com/in/antonio-gagliostro-1b4751121)