Spaces:
Sleeping
Sleeping
Commit History
solves import error c238a70
non destructive dataset operations cf628aa
adds imports and forces monitoring 190d843
adds scheduler stuff and hopes for the best with track tonic d455d12
adds local and remote training monitors to config 71db310
improve dataset utils non destructive writes 924581c
adds parameters to medical config 81f39f1
adds parameters to medical config b4f1cb3
adds parameters to medical config c68717e
adds improved dev and system prompts to o1-medical fc29a51
adds improved launch for reasoning gpt-oss configs and new config for medical reasoning 0ded6bb
adds improved dataset utils in tracktonic b11b94b
adds optimizations for faster training 3331c7f
adds a100 memory optimized fa9560d
adds template files , adds non destructive dataset updates d47568c
adds flash attention 3 kernel cb276d8
adapt lr_scheduler according to trl version 976e218
sets min_lr 7f45871
increases batchsize and gradient accumulation steps in memory optimized 4e59f6d
increases max samples in memory optimized 8b56686
adds better launch.sh and eval / test splits auto 0fa6045
adds repoid only based on repo name, adds version-robust sfttrainer 665844a
coerce akk numeric config values to safe values c346dad
adds defensive programming (boo) and adaptations based on transformer versions 97dacc7
improves launcher with model family and defaults based on options, updates trl trainer , removes trl config paths by switching to trainingarguments class , tokenizer parameters updated to sfttrainer , resolves evaluation_strategy error 598357a
hide all tokens in logs, never persist to disk, remove max_seq_length from config, add to trainer eb9e91f
changes default model repo name + adds non default option c23e2f5
adds pythonpath based on chatgpt5 suggestion c3ab72f
adds harmony format , configurable gpt-oss parameters, launch.sh logic , improved templates for legml gpt-oss training, dynamic results directory and improve model pushing 59e57ff
adds single token logic read/write , adds gpt-oss demo space , adds spaces refactor , adds new version of track tonic , adds logic in launch.sh 75bcdb3
adds quantization configuration correctly c7cffbb
adds quantization configuration correctly fa7de39
adds memory optimized configuration 7181190
fixes custom trackio implementation dfcb060
adds gpt-oss support fcf2981
adds readme links at the top ce0d824
adds small readme improvements a8275b3
Update README.md 26641fd unverified
fixes typo in mermaid diagram links a552387
adds readme, removes quantization, adds readtoken logic, updates trackio , spaces 3c37508
cleanup a bit the files ad3b15d unverified
workaround for quantization and push 2432208 unverified
Tonic commited on
workaround for quantization and push b79fab9 unverified
Tonic commited on
adds quantize and push script e6ad96a unverified
Tonic commited on
solves model card formatting bug 41e9e02 unverified
Tonic commited on
solves model card formatting bug 2f866e6 unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility 22bb04c unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility fff73fc unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility d0d19b2 unverified
Tonic commited on