Merge branch 'main' of https://huggingface.co/spaces/Tonic/Petite-LLM-3 53fac62 Tonic commited on Aug 2, 2025
adds flash attention 2 to attention implementation or eager to cpu d749fc8 Tonic commited on Jul 30, 2025
last trt to download the model at build time using the scripts 5a6251e Tonic commited on Jul 29, 2025
last try download int4 model from subfolder using hffilesystem 8dbd7e8 Tonic commited on Jul 29, 2025