YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Retail World Model

A transformer-based world model for retail sales prediction, trained on the Favorita store sales dataset.

Architecture

  • Encoder: T5 Encoder processes historical retail time-series
  • Latent Dynamics: LSTM-based world model that imagines future states
  • Decoder: Predicts future sales with probabilistic forecasting (mean + variance)
  • Fine-tuning: LoRA adapters on encoder attention layers

Features

  • Context Window: 60 days of historical sales data
  • Prediction Horizon: 14 days of future sales forecasts
  • Inputs: Sales history, promotion flags, day-of-week, month, product family
  • Output: Probabilistic predictions with 90% confidence intervals

Usage

Training

python train_v4.py

Inference

python inference.py --checkpoint /path/to/model --history history.csv --output predictions.csv

Dataset

Model

  • Base: T5 Encoder (tiny)
  • Parameters: ~30M base + LoRA adapters
  • Loss: Negative log-likelihood (probabilistic forecasting)

References

  • Chronos-2: Universal Time Series Forecasting (Amazon Science)
  • iTransformer: Inverted Transformers for Time Series (NeurIPS 2023)
  • TabFormer: Tabular Transformers for Sequential Data (IBM Research)
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support