fix: unidrive vla quantization, oom failure during calib. add dynamic loading of calibration data, add max_pixel to limit mass loading. 945cba1 richard.lin commited on 5 days ago
fix: remove ONNX_NVFP4 model for UniDriveVLA_Nusc_Base_Stage1 model. 9403cf0 richard.lin commited on 9 days ago
fix: exclude QKV, attention, MatMuls, LayerNorm, MatMuls layers for int8 quant. 570b630 richard.lin commited on 23 days ago
feat: add prepare cloud env script. optimize script for limit network env. cb154ba richard.lin commited on 24 days ago
feat: save engine and calibration function, model quantize function. 5f63376 richard.lin commited on 26 days ago
add: model infer, model eval, within HuggingFace APIs, hello db9ebae richard.lin commited on about 1 month ago