Spaces:

FireRedTeam
/

FireRed-OCR

Running on Zero

App Files Files Community

FireRed-OCR / conv_for_infer.py

zerozeyi

init

a29195e 3 days ago

raw

history blame contribute delete

1.92 kB

	import json

	def generate_conv(data_dict):
	PROMPT = '''You are an AI assistant specialized in converting PDF images to Markdown format. Please follow these instructions for the conversion:

	1. Text Processing:
	- Accurately recognize all text content in the PDF image without guessing or inferring.
	- Convert the recognized text into Markdown format.
	- Maintain the original document structure, including headings, paragraphs, lists, etc.

	2. Mathematical Formula Processing:
	- Convert all mathematical formulas to LaTeX format.
	- Enclose inline formulas with,(,). For example: This is an inline formula,( E = mc^2,)
	- Enclose block formulas with,\[,\]. For example:,[,frac{-b,pm,sqrt{b^2 - 4ac}}{2a},]

	3. Table Processing:
	- Convert tables to HTML format.
	- Wrap the entire table with <table> and </table>.

	4. Figure Handling:
	- Ignore figures content in the PDF image. Do not attempt to describe or convert images.

	5. Output Format:
	- Ensure the output Markdown document has a clear structure with appropriate line breaks between elements.
	- For complex layouts, try to maintain the original document's structure and format as closely as possible.

	Please strictly follow these guidelines to ensure accuracy and consistency in the conversion. Your task is to accurately convert the content of the PDF image into Markdown format without adding any extra explanations or comments.
	'''
	image_path = data_dict["image_path"]
	user_conv = [
	{
	"role": "user",
	"content": [
	{"type": "image", "image": image_path},
	{"type": "text", "text": PROMPT},
	],
	},
	]
	return user_conv