Spaces:

Wtvman
/

Real_Estate_Agent_Chatbot

Runtime error

App Files Files Community

AsherKnight commited on Apr 14, 2025

Commit

ad2bdfb

1 Parent(s): 3581003

changes

Browse files

Files changed (5) hide show

.gitignore +1 -0
README.md +37 -0
agents/agent2_tenancy_faq.py +6 -1
requirements.txt +2 -1
utils/llm_utils.py +3 -3

.gitignore ADDED Viewed

	@@ -0,0 +1 @@


1	+ .env

README.md CHANGED Viewed

@@ -13,6 +13,11 @@ short_description: Smart chatbot for tenancy and property image issues.
 # Multi-Agent Real Estate Chatbot – Detailed Explanation
 ## Tools & Technologies Used
 ### 1. **Gradio**
@@ -93,4 +98,36 @@ short_description: Smart chatbot for tenancy and property image issues.
 ---
 This detailed overview describes the tools and logic behind the switching mechanism that allows the chatbot to provide contextual and multimodal support effectively.

 # Multi-Agent Real Estate Chatbot – Detailed Explanation
+## Overview
+This project is a multi-agent chatbot designed for the real estate domain. It intelligently handles both text-based tenancy FAQs and image-based property issue troubleshooting. Users can interact with the chatbot by typing questions or uploading images, and the system will automatically determine the best agent to respond — whether it's a legal assistant for tenancy issues or an image-based troubleshooting expert.
 ## Tools & Technologies Used
 ### 1. **Gradio**
 ---
+## Storage and GPU Limitations: Compromises and Future Work
+While designing and developing this system, we encountered several constraints due to limited computational resources—especially GPU memory, CPU power, and local/Colab-based VRAM and storage limits. These resource limitations impacted multiple aspects of the solution architecture, leading to compromises in model choice and design. Below are key instances where compromises were made and the proposed future work to address them:
+---
+### 1. Model Selection for Text Generation
+Initially, we aimed to use powerful large language models (LLMs) for text generation tasks. However, due to storage and compute limitations, we opted for a smaller variant of the LLaMA model. LLaMA models are generally known for their strong performance in text generation tasks and are open source—making them ideal for POC-level work.
+- **Compromise**: Used a lightweight LLaMA variant for local compatibility.
+- **Future Work**: Once resources are scaled, we intend to incorporate larger LLaMA models (e.g., `llama-3.1-8b-instruct`) or explore commercial models like GPT-4o or Claude (Anthropic) for enhanced performance and naturalness in generated outputs.
+---
+### 2. Zero-shot Text Classification for Agent Routing
+A critical planned feature was dynamic agent switching based on conversation context. For example, if a user, while discussing an image, begins asking tenancy-related questions, a classification pipeline would detect the intent and automatically switch to a relevant agent, passing along the full context. Initially, we used `facebook/bart-large-mnli` for this zero-shot classification task.
+- **Compromise**: Due to low GPU/CPU/VRAM availability on local setups and Google Colab, we had to remove this functionality.
+- **Future Work**: With access to more powerful hardware or inference APIs, we can reintegrate this feature, significantly improving conversation flow and user experience.
+---
+### 3. Multi-model Output Scoring Pipeline
+To boost output quality, we planned to simultaneously generate responses using both LLaMA and Mistral models, and then run a scoring mechanism to select the most relevant response.
+- **Compromise**: Resource constraints made it infeasible to load and run multiple LLMs in parallel.
+- **Future Work**: Revisit the multi-model setup once better hardware (or hosted services) are available. This will allow ensemble-style approaches for higher quality text generation and response reliability.
 This detailed overview describes the tools and logic behind the switching mechanism that allows the chatbot to provide contextual and multimodal support effectively.

agents/agent2_tenancy_faq.py CHANGED Viewed

@@ -45,7 +45,12 @@ def handle_tenancy_query(user_query, user_context, history=[], location_method="
         if location:
             user_context["location"] = location
-    system_prompt = "You are a legal assistant specializing in tenancy laws."
     prompt=""
     if location:
         prompt += f" The user is from {location}."

         if location:
             user_context["location"] = location
+    system_prompt = (
+    "You are a legal assistant specializing in tenancy laws. "
+    "Your primary objective is to provide prompt advice and answers to the user. "
+    "Only ask follow-up questions if absolutely necessary and only when you are unclear about the user's request."
+)
     prompt=""
     if location:
         prompt += f" The user is from {location}."

requirements.txt CHANGED Viewed

@@ -5,4 +5,5 @@ Pillow
 gradio
 ultralytics
 spacy
-geotext

 gradio
 ultralytics
 spacy
+geotext
+python-dotenv

utils/llm_utils.py CHANGED Viewed

@@ -1,13 +1,13 @@
 from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
 import torch
 import os
 class LLaMAHelper:
     def __init__(self, hf_token=None):
         self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
         self.model_id = "meta-llama/Llama-3.2-3B-Instruct"
-        hf_token = hf_token or os.getenv("HUGGINGFACE_TOKEN")
         self.tokenizer = AutoTokenizer.from_pretrained(self.model_id, token=hf_token)
         self.model = AutoModelForCausalLM.from_pretrained(

 from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
 import torch
 import os
+from dotenv import load_dotenv
 class LLaMAHelper:
     def __init__(self, hf_token=None):
         self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
         self.model_id = "meta-llama/Llama-3.2-3B-Instruct"
+        load_dotenv()
+        hf_token = hf_token or os.getenv("HF_TOKEN")
         self.tokenizer = AutoTokenizer.from_pretrained(self.model_id, token=hf_token)
         self.model = AutoModelForCausalLM.from_pretrained(