Zaitounish commited on
Commit
cea4125
·
1 Parent(s): eb1ffb2

Upload My findings about Parent Document R.txt

Browse files
My findings about Parent Document R.txt ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ My findings about "Parent Document Retriever"
2
+
3
+ 1) brief about the method/technique
4
+
5
+ if there is one specific splited document that is quite big then that means the embedding itself will be less specific. however, if it is small then the embedding itself will be more specific.
6
+ with split parent docs and the splited child docs, the embeddings themselves become more specific
7
+
8
+ The ParentDocumentRetriever strikes that balance by splitting and storing small chunks of data. During retrieval, it first fetches the small chunks but then looks up the parent IDs for those chunks and returns those larger documents
9
+
10
+ ********************************
11
+
12
+ 2) why do we need to use this technique
13
+
14
+ so basically what parent document retrieval does is look into the documents to find the most relevant answer to a query, and how it is being done is very simple that would be by dividing the large amount of data that we have ( which by the way could be complex too) and then have different parent documents and within the parent documents we do have child documents.
15
+ that being said the embeddings will become more specific
16
+
17
+ ********************************
18
+
19
+ 3) what would make this technique the best choice
20
+
21
+ so this technique really shines when we are dealing with large amounts of data and complex relationships between documents, and it is beneficial when information is scattered over many different sources or when the data isn't easily searchable.
22
+
23
+ ********************************
24
+
25
+ 4) what type of data would be there if we want to use that technique
26
+
27
+ it would be text-based documents which could be news articles, research papers, user reviews, or any other form of written document.