omniRecsysLLM_semanticIDsmodality / README.md

zjkarina

Update README.md

4bfa0cb verified 4 months ago

preview code

raw

history blame contribute delete

1.18 kB

metadata

license: mit
datasets:
  - seniichev/amazon-fashion-2023-full
language:
  - en
base_model:
  - Qwen/Qwen2.5-7B-Instruct

zjkarina/omniRecsysLLM_semanticIDsmodality

Recommendation model with semantic IDs for Amazon Fashion.

Description

This model uses VQ-VAE to create semantic item IDs, enabling a more accurate understanding of semantic relationships between products.

Architecture

Base model: Qwen2.5-Omni-7B
Item vocabulary size: 709,036
ID embedding dimension: 512
VQ-VAE codebook size: 10,000
VQ-VAE codebook dimension: 256
Dataset: Amazon Fashion 2023 Full

Usage

from any2any_trainer.models.recommendation import SemanticIDRecommendationModel

# Load model
model = SemanticIDRecommendationModel.from_pretrained("zjkarina/omniRecsysLLM_semanticIDsmodality")

# Generate recommendations with semantic IDs
recommendations = model.predict_next_item(
    text="The user bought jeans and a t-shirt",
    id_ids=[12345, 67890],  # Item IDs from purchase history
    top_k=5,
    use_semantic_ids=True
)

Training

The model was trained on the Amazon Fashion 2023 dataset using semantic IDs generated via VQ-VAE.