hackathongenai / technical_overview.md
kushal2006's picture
Upload 15 files
3083de5 verified
# Technical Architecture
## Core Components
1. **Resume/JD Parser**: PyMuPDF, python-docx, spaCy
2. **Semantic Engine**: sentence-transformers, FAISS, cosine similarity
3. **Fuzzy Matcher**: RapidFuzz for skill variations
4. **LLM Integration**: OpenRouter + Grok for intelligent analysis
5. **Scoring Engine**: TF-IDF, weighted algorithms
6. **Web Interface**: FastAPI backend, Streamlit frontend
## Data Flow
1. File Upload β†’ Text Extraction
2. NLP Processing β†’ Entity Extraction
3. Multi-Stage Analysis:
- Hard Match (TF-IDF + Keywords)
- Semantic Match (Embeddings + Cosine)
- Fuzzy Match (Skill Variations)
- LLM Analysis (Context Understanding)
4. Weighted Scoring β†’ Final Verdict
5. Recommendations Generation β†’ Export Report
## Scalability Features
- RESTful API design
- Async processing
- Vector database integration
- Modular architecture
- Cloud deployment ready