Spaces:
Sleeping
Sleeping
Commit History
adds flash attention 3 kernel cb276d8
adapt lr_scheduler according to trl version 976e218
sets min_lr 7f45871
increases batchsize and gradient accumulation steps in memory optimized 4e59f6d
increases max samples in memory optimized 8b56686
adds better launch.sh and eval / test splits auto 0fa6045
adds repoid only based on repo name, adds version-robust sfttrainer 665844a
coerce akk numeric config values to safe values c346dad
adds defensive programming (boo) and adaptations based on transformer versions 97dacc7
improves launcher with model family and defaults based on options, updates trl trainer , removes trl config paths by switching to trainingarguments class , tokenizer parameters updated to sfttrainer , resolves evaluation_strategy error 598357a
hide all tokens in logs, never persist to disk, remove max_seq_length from config, add to trainer eb9e91f
changes default model repo name + adds non default option c23e2f5
adds pythonpath based on chatgpt5 suggestion c3ab72f
adds harmony format , configurable gpt-oss parameters, launch.sh logic , improved templates for legml gpt-oss training, dynamic results directory and improve model pushing 59e57ff
adds single token logic read/write , adds gpt-oss demo space , adds spaces refactor , adds new version of track tonic , adds logic in launch.sh 75bcdb3
adds quantization configuration correctly c7cffbb
adds quantization configuration correctly fa7de39
adds memory optimized configuration 7181190
fixes custom trackio implementation dfcb060
adds gpt-oss support fcf2981
adds readme links at the top ce0d824
adds small readme improvements a8275b3
Update README.md 26641fd unverified
fixes typo in mermaid diagram links a552387
adds readme, removes quantization, adds readtoken logic, updates trackio , spaces 3c37508
cleanup a bit the files ad3b15d unverified
workaround for quantization and push 2432208 unverified
Tonic commited on
workaround for quantization and push b79fab9 unverified
Tonic commited on
adds quantize and push script e6ad96a unverified
Tonic commited on
solves model card formatting bug 41e9e02 unverified
Tonic commited on
solves model card formatting bug 2f866e6 unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility 22bb04c unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility fff73fc unverified
Tonic commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility d0d19b2 unverified
Tonic commited on
matches experiment id for all metrics 08ed534 unverified
Tonic commited on
resolves the urls correctly c3f29a5 unverified
Tonic commited on
revert file rename 8cfe86a unverified
Tonic commited on
use gradio for better connection with the spaces f251d3d unverified
Tonic commited on
adds more compatibility with trl 1919b3b unverified
Tonic commited on
adds update attribute for trl compatibility bug fix 5fe0328 unverified
Tonic commited on
adds update attribute for trl compatibility 764a584 unverified
Tonic commited on
adds config attribute for trl compatibility fbc0479 unverified
Tonic commited on
adds default values to experiment name dbb337d unverified
Tonic commited on
adds monkey patch for trackio monitoring in torch and readme creator improvements 39db0ca unverified
Tonic commited on
fixes linter errors in setup hf dataset 2df26a0 unverified
Tonic commited on
fixes monitoring c61ed6b unverified
Tonic commited on
fixes variable cases sft/dpo d7d1377 unverified
Tonic commited on
fixes authentication 235d769 unverified
Tonic commited on
adds automation for hf cli using token 5d7656c unverified
Tonic commited on