Spaces:

serverdaun
/

rag-w-binary-quant

Sleeping

rag-w-binary-quant / src /rag_pipeline.py

Implement collection management in Milvus: drop collection during cleanup and on startup if no documents exist. Update embedding generation to clarify data types and improve error handling. Refactor vector store collection creation logic to avoid unnecessary drops.

2f81d82 4 months ago

raw

history blame

792 Bytes

	from dotenv import load_dotenv
	from langchain.chat_models import init_chat_model
	from langchain_core.messages import HumanMessage

	load_dotenv(override=True)

	from .config import MODEL_NAME, MODEL_PROVIDER, PROMPT, TEMPERATURE

	llm = init_chat_model(
	MODEL_NAME, model_provider=MODEL_PROVIDER, temperature=TEMPERATURE
	)


	def answer_question(query: str, contexts: list[str]) -> str:
	"""
	Answer a question using the provided context.

	Args:
	query: The query to answer
	contexts: The context to use for answering the question

	Returns:
	The answer to the question
	"""
	prompt = PROMPT.format(contexts=contexts, query=query)
	human_message = HumanMessage(content=prompt)

	response = llm.invoke([human_message])
	return response.content