Commit History
Refactor. (#4487) 83ee116
Kevin Hu commited on
Catch the exception while parsing pptx. (#4202) 74b229a
Kevin Hu commited on
fix chunk method "Table" losing content when the Excel file has multi… (#4123) 75faa66
Fix json file parse (#4004) a11ffc5
Added static check at PR CI (#3921) fe9b6b3
Fix errors detected by Ruff (#3918) 0404a52
Fix parsing JSON file error (#3829) 43bceb7
Fix: page_chars attribute does not exist in some formats of PDF (#3796) 3c857ed
Fix out of boundary. (#3786) 1275b47
Kevin Hu commited on
Edit chunk shall update instead of insert it (#3709) 1b2aab6
Always open text file for write with UTF-8 (#3688) 9a73da2
Added kb_id filter to knn. Fix #3458 (#3513) aebd986
Introduced beartype (#3460) eae0334
Move settings initialization after module init phase (#3438) 6101699
Use consistent log file names, introduced initLogger (#3403) 8bc2fc9
fix bug about fetching knowledge graph (#3394) 6a9fa6b
Kevin Hu commited on
Rework logging (#3358) 22fe41e
Fix:#3230 When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful (#3249) ff43695
kuschzzp Kevin Hu commited on
search between multiple indiices for team function (#3079) 95863fc
Kevin Hu commited on
bigger resolution for OCR (#2919) 7b6220c
Kevin Hu commited on
fix: torch dependency start error (#2777) 0de98c4
trival (#2650) 03a3e60
Kevin Hu commited on
rename get_txt to get_text (#2649) d624a4e
Kevin Hu commited on
Add get_txt function (#2639) 82adb1c
yqkcn Kevin Hu commited on
Fix: renrank_model and pdf_parser bugs | Update: session API (#2601) 678763e
liuhua liuhua commited on
add lighten control (#2567) dbcbb17
Kevin Hu commited on
fix parsing spaces in russian language PDFs (#1987) (#2427) bac5213
optimize text parser (#2144) 13b285d
Kevin Hu commited on
Format file format from Windows/dos to Unix (#1949) aeb6dbc
fix parameter error (#1925) 1f75d02
Kevin Hu commited on
add sql to naive parser (#1908) ef2a724
Kevin Hu commited on
add support for eml file parser (#1768) 6ed07a9
Fix docx parser line bug (#1715) dda4c86
H Kevin Hu commited on
fix: When parsing the bold content in PDF, the result is duplicated. (#1729) 971f83c
Fix pdfparser content confusion (#1700) eec0415
H commited on
pypdf2 to pypdf (#1684) 10534c3
Kevin Hu commited on
fix bug of ragflowdocxpparser (#1642) fe5404c
Kevin Hu commited on
fix generate error (#1590) b5e86a6
Kevin Hu commited on
fix bug about divided by zero (#1482) 4e6516f
Kevin Hu commited on
fix: Delete hardcode (#1464) 789efbc
Yuhao Tsui commited on
fix pdf_paser char content confusion (#1462) 1164cba
H commited on
fix pdf_parser content confusion (#1458) ece4f03
H commited on
Support table for markdown file in general parser (#1278) 5bfd79c
Fix occasional errors in pdf table recognition (#1277) b4b278b
feat: support json file (#1217) 77b7e10
fix too long query exception (#1195) 9aa975e
KevinHuSh commited on
add self-rag (#1070) a49657b
KevinHuSh commited on
Update readme and add license (#1018) 9cba22c
fix bug in pdf parser (#986) eefeab4
KevinHuSh commited on