Commit History

fix: set kv_cache_dtype q4->float16 and torch_dtype float16
ce84ec6
verified

shreyask commited on

fix: add tokenizer_config with chat_template from base model
c22e42a
verified

shreyask commited on

add model_q4.onnx_data
0fb67a6
verified

shreyask commited on

add model_q4.onnx with correct external data ref
ee58b71
verified

shreyask commited on

fix: config with q4 dtype mapping
d99474e
verified

shreyask commited on

int4 WebGPU ONNX with patched external data refs
1fb46fb
verified

shreyask commited on

cleanup: remove fp16 files
5331d90
verified

shreyask commited on

fix: use full model config from merged model
549ce31
verified

shreyask commited on

fix: add fp16 dtype mapping to transformers.js_config
78d0126
verified

shreyask commited on

fp16 WebGPU ONNX (no quantization, compatible ops)
e059fae
verified

shreyask commited on

cleanup: remove old/duplicate ONNX files
bf6eb86
verified

shreyask commited on

fix: patch external data refs to underscore format (model.onnx_data)
66bbc2f
verified

shreyask commited on

fix: keep original ONNX filenames to match internal data references
09af0ba
verified

shreyask commited on

v3: WebGPU int4 ONNX from 6-epoch fine-tune (loss 1.059)
ecb6782
verified

shreyask commited on

Add generation_config.json
3ec92e5
verified

shreyask commited on

Restructure ONNX for Transformers.js compatibility (onnx/ subdir + config)
80ab998
verified

shreyask commited on

Add model card
aef9c84
verified

shreyask commited on

Upload ONNX int4 model via Xenova's LFM2 builder
0e2e2cb
verified

shreyask commited on

initial commit
f872498
verified

shreyask commited on