simple-rag-qa / pdf_loader.py
Matvii Hotovych
Added pdf data retrieving
6a7af6b
raw
history blame contribute delete
190 Bytes
def extract_text_from_pdf(pdf):
text = ""
for page in pdf.pages:
page_text = page.extract_text()
if page_text:
text += page_text + "\n"
return text