zjkarina's picture
Update README.md
4bfa0cb verified
---
license: mit
datasets:
- seniichev/amazon-fashion-2023-full
language:
- en
base_model:
- Qwen/Qwen2.5-7B-Instruct
---
# zjkarina/omniRecsysLLM_semanticIDsmodality
Recommendation model with semantic IDs for Amazon Fashion.
## Description
This model uses VQ-VAE to create semantic item IDs, enabling a more accurate understanding of semantic relationships between products.
## Architecture
- **Base model**: Qwen2.5-Omni-7B
- **Item vocabulary size**: 709,036
- **ID embedding dimension**: 512
- **VQ-VAE codebook size**: 10,000
- **VQ-VAE codebook dimension**: 256
- **Dataset**: Amazon Fashion 2023 Full
## Usage
```python
from any2any_trainer.models.recommendation import SemanticIDRecommendationModel
# Load model
model = SemanticIDRecommendationModel.from_pretrained("zjkarina/omniRecsysLLM_semanticIDsmodality")
# Generate recommendations with semantic IDs
recommendations = model.predict_next_item(
text="The user bought jeans and a t-shirt",
id_ids=[12345, 67890], # Item IDs from purchase history
top_k=5,
use_semantic_ids=True
)
```
## Training
The model was trained on the Amazon Fashion 2023 dataset using semantic IDs generated via VQ-VAE.