Commit History

Production-ready training script with all robustness improvements
157a91d

smarthillc commited on

Robust fix for LoRA training - disable gradient checkpointing and fix gradient flow
054d619

smarthillc commited on

Ensure train.py uses eval_strategy not evaluation_strategy
962e533

smarthillc commited on

Add debug output to capture training errors
b1b635d

smarthillc commited on

Restore training app with fixed train.py
61b3c92

smarthillc commited on

Fix GPU device placement issue
b2281c5

smarthillc commited on

Add diagnostic app to identify training issues
7c64e26

smarthillc commited on

Add debug output to capture training errors
1d37b91

smarthillc commited on

Fix Gradio Timer issue - use manual refresh instead
a798baf

smarthillc commited on

Add training app with Flan-T5 implementation and datasets
d96cedb

smarthillc commited on

initial commit
0e386ca
verified

aoisfhdugbos commited on