RAG_AIEXP_01 / utils.py

Commit History

max chars = 3000 + max rows =20, topk 60+ 60 = 120, cut off = 0.55
b2055b1

MrSimple07 commited on

new normalizer C to Latin C
78e6c03

MrSimple07 commited on

Extracts weld type from query
52b85db

MrSimple07 commited on

old utils + topk 50 + 50, hybrid = 100, cut off = 0.45
f74c675

MrSimple07 commited on

old utils + topk 70 + 70, hybrid = 100, cut off = 0.45
118cb14

MrSimple07 commited on

removed normalizing query
c9c9c52

MrSimple07 commited on

added the normalization of functions + max chars = 2500+ max rows = 15
722ce17

MrSimple07 commited on

new normalizing + max chars 3000 max rows =15
fc9fe78

MrSimple07 commited on

added the 100 topk
75fe00d

MrSimple07 commited on

removed the part removing hyperh + top 80, cutoff = 0.55
429d2d4

MrSimple07 commited on

chunk size = 3000, 15 + normalized query
154e611

MrSimple07 commited on

normalized fixed + in header text as well
57a8908

MrSimple07 commited on

query normalization
49bfa92

MrSimple07 commited on

fixing normalizing hypens
9c77451

MrSimple07 commited on

removed normalization
4834e86

MrSimple07 commited on

added the new function to replace latin crylic c25
11e130c

MrSimple07 commited on

big debug change
04f5154

MrSimple07 commited on

top k 200, 50 + max chunk size = 10 000, max chunk row = 40
7062aff

MrSimple07 commited on

top k 200, 150 + max chunk size = 10 000, max chunk row = 40
46dedf9

MrSimple07 commited on

added debugging functions for the c25
8d6a517

MrSimple07 commited on

top k = 50 + topk rerank = 20 + max chunk size is 4000 + max rows =30 + sim cut off = 0.25
03dd25b

MrSimple07 commited on

top k = 100 + topk rerank = 30 + max chunk size is 1024 + max rows =5 + sim cut off = 0.25
95bcac7

MrSimple07 commited on

top k = 100 + max chunk size is 6000 + max rows =30 + sim cut off = 0.45
0647d48

MrSimple07 commited on

new utils py fixed
b867de8

MrSimple07 commited on

new api = retrieve chunks + some more text fixing
33c996e

MrSimple07 commited on

new dedublication
1ca91bc

MrSimple07 commited on

new documents prep
63ebb90

MrSimple07 commited on

new documents prep
38ed4e9

MrSimple07 commited on

new documents prep
d1e7fd2

MrSimple07 commited on

eski holat with utils
a42e1ff

MrSimple07 commited on

eski holat with utils
40de98c

MrSimple07 commited on

api key added
c28dd72

MrSimple07 commited on

api key added
b395a0b

MrSimple07 commited on

api key added
729578d

MrSimple07 commited on

top k reranker = 20, max rows = 10, max chars= 2000 + new deduplication
ec64429

MrSimple07 commited on

top k reranker = 20, max rows = 10, max chars= 4000 + new deduplication
d577496

MrSimple07 commited on

top k reranker = 25, max rows = 5, max chars= 4000
c0c8ab9

MrSimple07 commited on

Much lower reranking threshold (-0.5 instead of 0.1) + detailed score logging
806f3f9

MrSimple07 commited on

removed normalization doc id
ad8e8ec

MrSimple07 commited on

removed normalization doc id
c33deff

MrSimple07 commited on

index retriever = 100 + 100
31659d7

MrSimple07 commited on

index retriever = 100 + 100
8114c87

MrSimple07 commited on

max_chars = 1500 + doc id retriever
ae5a669

MrSimple07 commited on

new keyword score based index retriever + answer question
dfc7ba2

MrSimple07 commited on

new keyword score based index retriever + answer question
d2e7d9e

MrSimple07 commited on

chunk size = 1024 + max chars = 1200 + deduplication variant
f79b229

MrSimple07 commited on

chunk siz = 1000, max_chars = 1500
703587b

MrSimple07 commited on