RAG_AIEXP_01 / utils.py

Commit History

query normalization
49bfa92

MrSimple07 commited on

fixing normalizing hypens
9c77451

MrSimple07 commited on

removed normalization
4834e86

MrSimple07 commited on

added the new function to replace latin crylic c25
11e130c

MrSimple07 commited on

big debug change
04f5154

MrSimple07 commited on

top k 200, 50 + max chunk size = 10 000, max chunk row = 40
7062aff

MrSimple07 commited on

top k 200, 150 + max chunk size = 10 000, max chunk row = 40
46dedf9

MrSimple07 commited on

added debugging functions for the c25
8d6a517

MrSimple07 commited on

top k = 50 + topk rerank = 20 + max chunk size is 4000 + max rows =30 + sim cut off = 0.25
03dd25b

MrSimple07 commited on

top k = 100 + topk rerank = 30 + max chunk size is 1024 + max rows =5 + sim cut off = 0.25
95bcac7

MrSimple07 commited on

top k = 100 + max chunk size is 6000 + max rows =30 + sim cut off = 0.45
0647d48

MrSimple07 commited on

new utils py fixed
b867de8

MrSimple07 commited on

new api = retrieve chunks + some more text fixing
33c996e

MrSimple07 commited on

new dedublication
1ca91bc

MrSimple07 commited on

new documents prep
63ebb90

MrSimple07 commited on

new documents prep
38ed4e9

MrSimple07 commited on

new documents prep
d1e7fd2

MrSimple07 commited on

eski holat with utils
a42e1ff

MrSimple07 commited on

eski holat with utils
40de98c

MrSimple07 commited on

api key added
c28dd72

MrSimple07 commited on

api key added
b395a0b

MrSimple07 commited on

api key added
729578d

MrSimple07 commited on

top k reranker = 20, max rows = 10, max chars= 2000 + new deduplication
ec64429

MrSimple07 commited on

top k reranker = 20, max rows = 10, max chars= 4000 + new deduplication
d577496

MrSimple07 commited on

top k reranker = 25, max rows = 5, max chars= 4000
c0c8ab9

MrSimple07 commited on

Much lower reranking threshold (-0.5 instead of 0.1) + detailed score logging
806f3f9

MrSimple07 commited on

removed normalization doc id
ad8e8ec

MrSimple07 commited on

removed normalization doc id
c33deff

MrSimple07 commited on

index retriever = 100 + 100
31659d7

MrSimple07 commited on

index retriever = 100 + 100
8114c87

MrSimple07 commited on

max_chars = 1500 + doc id retriever
ae5a669

MrSimple07 commited on

new keyword score based index retriever + answer question
dfc7ba2

MrSimple07 commited on

new keyword score based index retriever + answer question
d2e7d9e

MrSimple07 commited on

chunk size = 1024 + max chars = 1200 + deduplication variant
f79b229

MrSimple07 commited on

chunk siz = 1000, max_chars = 1500
703587b

MrSimple07 commited on

new embeeding model + new create_quer_engine with keyword matching
2d1ebe6

MrSimple07 commited on

max chars = 2000 for tables + new answer_question
7565a55

MrSimple07 commited on

max rows = 10 + new answer_question + reranking
2edec29

MrSimple07 commited on

removed query enhanced
5166f44

MrSimple07 commited on

simplest version
2595129

MrSimple07 commited on

simplest version
30be7bf

MrSimple07 commited on

simplest version
f3e59e1

MrSimple07 commited on

simplest version
57e4dbd

MrSimple07 commited on

simplest version
09fe356

MrSimple07 commited on

simplest version
0b6ee4f

MrSimple07 commited on

simplest version
c7a9dbd

MrSimple07 commited on

simplest version
123a5db

MrSimple07 commited on