ysakhale's picture
Update README.md
2f702ab verified
---
license: mit
language:
- en
tags:
- recommendation-system
- content-based-filtering
- landmarks
- cmu
- campus-exploration
size_categories:
- n<1K
---
# Content-Based Recommendation System for CMU Landmarks
## Model Description
This is a **trained-from-scratch** content-based recommendation system designed to recommend Carnegie Mellon University landmarks based on user preferences. The model learns feature representations from landmark characteristics and uses cosine similarity to find similar landmarks.
## Model Details
### Model Type
- **Architecture**: Content-based filtering with feature engineering
- **Training**: Trained from scratch on CMU landmarks dataset
- **Input**: Landmark features (rating, classes, location, dwell time, indoor/outdoor)
- **Output**: Similarity scores for landmark recommendations
### Training Data
- **Dataset**: 100+ manually curated CMU landmarks
- **Features**: Rating, classes, geographic coordinates, dwell time, indoor/outdoor classification
- **Preprocessing**: StandardScaler normalization, multi-hot encoding for classes
### Training Procedure
- Feature extraction from landmark metadata
- StandardScaler normalization of numerical features
- Multi-hot encoding for categorical classes
- Cosine similarity computation for recommendations
## Intended Use
### Primary Use Cases
- Recommending CMU landmarks based on user preferences
- Finding similar landmarks to user-selected favorites
- Personalized campus exploration planning
### Out-of-Scope Use Cases
- Recommending landmarks outside CMU campus
- Predicting user ratings or reviews
- Real-time location-based recommendations
## Performance Metrics
- **Recommendation Quality**: High similarity scores (0.7-0.9) for relevant landmarks
- **Diversity**: Incorporates diversity weighting to avoid over-concentration
- **User Satisfaction**: Optimized for user preference alignment
## Limitations and Bias
- **Geographic Scope**: Limited to CMU campus landmarks only
- **Static Data**: Based on current landmark database, may not reflect real-time changes
- **User Preference Learning**: Does not learn from user interaction history
## Ethical Considerations
- **Data Privacy**: No personal user data collected
- **Fairness**: Recommendations based on objective landmark features
- **Transparency**: Feature importance and similarity scores are explainable
## How to Use
```python
from model import ContentBasedRecommender, load_model_from_data
# Load model from landmarks data
recommender = load_model_from_data('data/landmarks.json')
# Get recommendations
recommendations = recommender.recommend(
selected_classes=['Culture', 'Research'],
indoor_pref='indoor',
min_rating=4.0,
diversity_weight=0.6,
top_k=10
)
# Print top recommendations
for landmark_id, score in recommendations:
print(f"{landmark_id}: {score:.3f}")
```
## Model Files
- `model.py`: Main model implementation
- `README.md`: This model card
## Citation
```bibtex
@misc{cmu-explorer-recommender,
title={Content-Based Recommendation System for CMU Landmarks},
author={Yash Sakhale, Faiyaz Azam},
year={2025},
url={https://huggingface.co/spaces/ysakhale/Tartan-Explore}
}
```
## Model Card Contact
For questions about this model, please refer to the [CMU Explorer Space](https://huggingface.co/spaces/ysakhale/Tartan-Explore).