Add finetune.py β finetune_domain_model (HF Trainer Pattern A, auto tabular_features passthrough) 46a6d37 verified rtferraz commited on 9 days ago
Phase 2D: Fine-tuning pipeline β DomainFinetuneDataset, finetune_domain_model, 139 total tests passing 256963c verified rtferraz commited on 9 days ago
Add pretrain.py β pretrain_domain_model with HF Trainer, cosine schedule, DataCollatorForLanguageModeling 6ccb9e6 verified rtferraz commited on 9 days ago
Add data_pipeline.py β tokenize_user_sequences, pack_sequences, prepare_clm_dataset 1dfd4e2 verified rtferraz commited on 9 days ago
Phase 2C: Pre-training pipeline β data pipeline, sequence packing, HF Trainer CLM, 124 total tests passing 28118c7 verified rtferraz commited on 9 days ago