File size: 1,418 Bytes
b163dc2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37


## Document Search by Metadata
<callout>PageIndex with metadata support is in closed beta. Fill out this form to request early access to this feature.</callout>

For documents that can be easily distinguished by metadata, we recommend using metadata to search the documents.
This method is ideal for the following document types:
- Financial reports categorized by company and time period
- Legal documents categorized by case type
- Medical records categorized by patient or condition
- And many others

In such cases, you can search documents by leveraging their metadata. A popular method is to use "Query to SQL" for document retrieval.


### Example Pipeline

#### PageIndex Tree Generation
Upload all documents into PageIndex to get their `doc_id`.

#### Set up SQL tables

Store documents along with their metadata and the PageIndex `doc_id` in a database table.

#### Query to SQL

Use an LLM to transform a user’s retrieval request into a SQL query to fetch relevant documents.

#### Retrieve with PageIndex

Use the PageIndex `doc_id` of the retrieved documents to perform further retrieval via the PageIndex retrieval API.

## 💬 Help & Community
Contact us if you need any advice on conducting document searches for your use case.

- 🤝 [Join our Discord](https://discord.gg/VuXuf29EUj)  
- 📨 [Leave us a message](https://ii2abc2jejf.typeform.com/to/meB40zV0)