A newer version of the Gradio SDK is available: 6.10.0
๐ QUICK START GUIDE - MVMยฒ
โก Fastest Way to Get Started
Step 1: Open Terminal in Project Directory
cd c:\Users\Varshith Dharmaj\Downloads\major\math_verification_mvp
Step 2: Run the Startup Script
.\start.ps1
Choose option 2 for quick demo (Dashboard Only)
๐ What You'll See
Dashboard opens at: http://localhost:8501
Two input modes:
- ๐ Text Input - Try the pre-filled example
- ๐ท Image Upload - Upload a handwritten math problem
Click "Verify Solution" to see results
๐งช Testing the System
Quick Test (No Services Required)
The dashboard will work in demo mode even without microservices running.
Full Test (All Services)
.\start.ps1
Choose option 1 - This opens 4 windows:
- OCR Service (Port 8001)
- SymPy Service (Port 8002)
- LLM Service (Port 8003)
- Dashboard (Port 8501)
๐ฏ Try These Examples
Example 1: Valid Solution โ
Problem: "Janet has 3 apples. She buys 2 more. She gives 1 away."
Steps:
Janet starts with 3 apples
She buys 2 more: 3 + 2 = 5 apples
She gives 1 away: 5 - 1 = 4 apples
Expected: VALID with high confidence
Example 2: Error Detection โ
Problem: "There are 5 boxes with 8 apples each."
Steps:
Number of boxes = 5
Apples per box = 8
Total = 5 ร 8 = 45
Expected: ERROR detected (5 ร 8 = 40, not 45)
๐ง Prerequisites
Required (Basic Demo)
- โ Python 3.10+
- โ Virtual environment (./start.ps1 creates this automatically)
Optional (Full Features)
- Tesseract OCR (for image processing)
- Gemini API Key (for LLM reasoning)
๐ฆ Installing Additional Components
Tesseract OCR (for Image Mode)
- Download: https://github.com/tesseract-ocr/tesseract
- Install and add to PATH
- Restart terminal
Gemini API Key (for LLM Features)
- Get free key: https://ai.google.dev/
- Copy
.env.templateto.env - Add:
GEMINI_API_KEY=your_key_here
๐ Troubleshooting
"Module not found"
.\venv\Scripts\Activate.ps1
pip install -r requirements.txt
"Port already in use"
Close any applications using ports 8001-8003, 8501
Services not connecting
- Check if all service windows are still open
- Look for error messages in service windows
- Restart the startup script
๐ What to Expect
Performance Metrics
- โฑ๏ธ Processing time: 1-5 seconds per problem
- ๐ฏ Accuracy: 68%+ on valid test cases
- ๐ Error detection: 78%+ when errors present
Features Working
- โ Text input verification
- โ Multi-model consensus
- โ Error detection and reporting
- โ Confidence scoring
- โ Agreement analysis
Image Input (Requires Tesseract)
- ๐ท Handwritten math problems
- ๐ Printed worksheets
- ๐ผ๏ธ Whiteboard photos
๐ Research Features Demonstrated
- Multimodal Input - Accept both text and images
- Weighted Consensus - Symbolic (40%), LLM (35%), ML (25%)
- OCR-Aware Calibration - Novel uncertainty propagation
- Real-time Processing - <5 second response time
๐ Next Steps
- โ Test basic functionality - Run the text examples
- โก Try image upload - If you have Tesseract installed
- ๐งช Run automated tests -
python tests/test_system.py - ๐ Collect data - Test with your own math problems
- ๐จ Customize - Modify weights, add more patterns
๐ Need Help?
Check the full README.md for:
- Detailed architecture
- API documentation
- Advanced configuration
- Deployment options
MVMยฒ - Making Mathematical Verification Multimodal
VNR VJIET Major Project 2025