domainTokenizer / notebooks

Commit History

Fix label leakage: temporal split β€” use first 70% of events as input, predict purchase in last 30%. Remove n_purchases/purchase_rate from features.
e4d8561
verified

rtferraz commited on

Fix model loading: use from_pretrained() instead of torch.load() for safetensors format
165b138
verified

rtferraz commited on

Add 03_ecommerce_finetune.ipynb β€” next-purchase prediction with JointFusion, LightGBM baseline comparison
857ec9a
verified

rtferraz commited on

Update 02_ecommerce notebook: add HF login, memory-free cell, subsample option for <64GB RAM machines
2410b7e
verified

rtferraz commited on

Add 02_ecommerce_pretrain.ipynb β€” REES46 e-commerce pre-training with sequential entropy check, wandb, push to hub
d60868a
verified

rtferraz commited on

Fix notebook: total_mem β†’ total_memory, add hub_model_id push, add wandb logging support
65ecf7e
verified

rtferraz commited on

Add 01_finance_pretrain.ipynb β€” Phase 3.1 notebook for pre-training on 5M Nigerian financial transactions
2c3ddfa
verified

rtferraz commited on