π΅οΈββοΈ GenAI Real Estate Forensic Detector (Track B)
Status: π Competition Submission for MenaML Winter School 2026
Track: B (Real Estate & Commercial Integrity)
Model Architecture: EfficientNetB0 (Fine-tuned)
Framework: TensorFlow / Keras
π Project Overview
As Generative AI becomes mainstream, "virtual staging" in real estate can mislead buyers by hiding structural flaws or fabricating non-existent features.
This project provides a Forensic Signal Detector capable of identifying GenAI-generated or heavily manipulated images in high-stakes public domains. It focuses on detecting:
- Texture Consistency: "Unnatural smoothness" in walls/skin.
- Compression Discrepancies: Digital splicing of objects.
- Frequency Domain Artifacts: Mathematical fingerprints left by GANs/Diffusion models.
π Evaluation Results
The model was trained on a dynamic stream of Places365 images with synthetic artifact injection.
| Metric | Best Value | Epoch Achieved |
|---|---|---|
| Training Accuracy | 99.07% | Epoch 1 |
| Training Loss | 0.0201 | Epoch 1 |
| Validation Stability | 97.34% | Epoch 5 |
π¦ Repository Contents
| File | Description |
|---|---|
forensic_model.h5 |
The Model. Fine-tuned EfficientNetB0 weights. |
predict.py |
Inference Script. Automates the detection process for a folder of images. |
hackathon_test_set.zip |
The Dataset. Contains 1,000 images (500 Real / 500 Fake) for testing. |
requirements.txt |
Dependencies. Required Python libraries. |
π» How to Run Inference
1. Installation
Clone the repository and install dependencies:
git clone [https://huggingface.co/](https://huggingface.co/)[FatimahEmadEldin]/{GenAI-RealEstate-Detector}
cd {GenAI-RealEstate-Detector}
pip install -r requirements.txt
2. Run Prediction
You can run the detection script on any folder of images:
python predict.py --input_dir ./path_to_images --output_file predictions.json
3. Output Format
The system produces a JSON report with authenticity scores and VLM-based reasoning:
{{
"image_name": "living_room_01.jpg",
"authenticity_score": 0.99,
"manipulation_type": "Synthetic/Inpainting",
"vlm_reasoning": "Forensic analysis detected high-frequency artifacts (0.99). Visual context suggests issues near: window reflection."
}}
π§ Training Strategy: "Self-Supervised Artifact Injection"
Instead of relying on a static dataset of "Fake" images (which may become outdated), we implemented an On-the-Fly Artifact Generator that corrupts real images (from Places365) during training.
The 3 Forensic Signals:
- Simulating Upscalers: Applied randomized Bi-cubic downsampling + Gaussian Blur.
- Simulating Diffusion: Injected High-Frequency Gaussian Noise distributions.
- Simulating GANs: Injected subtle Checkerboard Grid patterns (frequency domain fingerprints).
π License
This project uses the Places365 dataset (MIT CSAIL). The code is open-source under the MIT License.