Spaces:
Sleeping
Sleeping
Commit History
add configs to _init_.py
63ce7d6
solves import error
c238a70
non destructive dataset operations
cf628aa
adds imports and forces monitoring
190d843
adds scheduler stuff and hopes for the best with track tonic
d455d12
adds local and remote training monitors to config
71db310
improve dataset utils non destructive writes
924581c
adds parameters to medical config
81f39f1
adds parameters to medical config
b4f1cb3
adds parameters to medical config
c68717e
adds improved dev and system prompts to o1-medical
fc29a51
adds improved launch for reasoning gpt-oss configs and new config for medical reasoning
0ded6bb
adds improved dataset utils in tracktonic
b11b94b
adds optimizations for faster training
3331c7f
adds a100 memory optimized
fa9560d
adds template files , adds non destructive dataset updates
d47568c
adds flash attention 3 kernel
cb276d8
adapt lr_scheduler according to trl version
976e218
sets min_lr
7f45871
increases batchsize and gradient accumulation steps in memory optimized
4e59f6d
increases max samples in memory optimized
8b56686
adds better launch.sh and eval / test splits auto
0fa6045
adds repoid only based on repo name, adds version-robust sfttrainer
665844a
coerce akk numeric config values to safe values
c346dad
adds defensive programming (boo) and adaptations based on transformer versions
97dacc7
improves launcher with model family and defaults based on options, updates trl trainer , removes trl config paths by switching to trainingarguments class , tokenizer parameters updated to sfttrainer , resolves evaluation_strategy error
598357a
hide all tokens in logs, never persist to disk, remove max_seq_length from config, add to trainer
eb9e91f
changes default model repo name + adds non default option
c23e2f5
adds pythonpath based on chatgpt5 suggestion
c3ab72f
adds harmony format , configurable gpt-oss parameters, launch.sh logic , improved templates for legml gpt-oss training, dynamic results directory and improve model pushing
59e57ff
adds single token logic read/write , adds gpt-oss demo space , adds spaces refactor , adds new version of track tonic , adds logic in launch.sh
75bcdb3
adds quantization configuration correctly
c7cffbb
adds quantization configuration correctly
fa7de39
adds memory optimized configuration
7181190
fixes custom trackio implementation
dfcb060
adds gpt-oss support
fcf2981
adds readme links at the top
ce0d824
adds small readme improvements
a8275b3
Update README.md
26641fd
unverified
fixes typo in mermaid diagram links
a552387
adds readme, removes quantization, adds readtoken logic, updates trackio , spaces
3c37508
cleanup a bit the files
ad3b15d
unverified
workaround for quantization and push
2432208
unverified
Tonic
commited on
workaround for quantization and push
b79fab9
unverified
Tonic
commited on
adds quantize and push script
e6ad96a
unverified
Tonic
commited on
solves model card formatting bug
41e9e02
unverified
Tonic
commited on
solves model card formatting bug
2f866e6
unverified
Tonic
commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility
22bb04c
unverified
Tonic
commited on
Fix model recovery and deployment scripts - add safetensors support and Windows compatibility
fff73fc
unverified
Tonic
commited on