Commit History

docs: update model card for Marziel OS v1.0.1
58dbcad
verified

efops commited on

docs: update model card for Marziel OS v1.0.1 (take 4)
e0bedd5
verified

efops commited on

docs: update model card for Marziel OS v1.0.1
aa2c070
verified

efops commited on

Update README.md
143cb29
verified

efops commited on

revert accidental readme overwrite and insert v0.9.2 engine notes
eee5727

Efkan commited on

Upload README.md with huggingface_hub
0bad2c0
verified

efops commited on

Update model card for v0.9.0 with Hybrid RAG and TurboQuant KV cache details
186270e
verified

efops commited on

Delete marziel-8b-custom.gguf with huggingface_hub
9c9e653
verified

efops commited on

Upload README.md with huggingface_hub
9713b86
verified

efops commited on

Upload folder using huggingface_hub
a2619c2
verified

efops commited on

Upload marziel-v6-Q4_K_M.gguf with huggingface_hub
161e2b7
verified

efops commited on

Upload README.md with huggingface_hub
8ff59da
verified

efops commited on

Remove old Llama model files
3206819
verified

efops commited on

Remove old Llama model files
53c1ba9
verified

efops commited on

Remove old Llama model files
caa3812
verified

efops commited on

Remove old Llama model files
2441080
verified

efops commited on

Remove old Llama model files
e951f67
verified

efops commited on

Remove old Llama model files
147871c
verified

efops commited on

Remove old Llama model files
e7ede0a
verified

efops commited on

Remove old Llama model files
38d141f
verified

efops commited on

Remove old Llama model files
11c052e
verified

efops commited on

Remove old Llama model files
746d664
verified

efops commited on

v0.7.0: MLX 4-bit model for Apple Silicon (4.2GB, 4.5 bpw)
b2ba3a8
verified

efops commited on

Remove old chat_template.jinja — replaced by new v0.7.0 model
2616cd2
verified

efops commited on

Remove old recipe.yaml — replaced by new v0.7.0 model
6e54159
verified

efops commited on

Remove old model.safetensors — replaced by new v0.7.0 model
2455e45
verified

efops commited on

v0.7.0: Update model card
9c8caa9
verified

efops commited on

v0.7.0: Merged fp16 safetensors — v5 model
d786cbc
verified

efops commited on

v0.7.0: GGUF Q4_K_M — v5 model with new capabilities
14c3e01
verified

efops commited on

v0.6.0: auto-install semantic router
d9b73ad
verified

efops commited on

Restore original fused float16 model.safetensors for MLX (4.2GB)
0f60d3e
verified

efops commited on

Remove GPTQ model.safetensors.index.json
3fba9c1
verified

efops commited on

Remove GPTQ model-00002-of-00002.safetensors
5c95498
verified

efops commited on

Remove GPTQ model-00001-of-00002.safetensors
9ab0e79
verified

efops commited on

v0.5.9: semantic intent routing
9cd2a84
verified

efops commited on

v0.5.8: 3-tier inference (MLX/vLLM/llama.cpp)
be9c653
verified

efops commited on

v0.5.8: GPTQ W4A16 quantized model for vLLM CPU (~4GB)
6f37080
verified

efops commited on

Delete model.safetensors.index.json with huggingface_hub
3ccfa93
verified

efops commited on

Delete model-00004-of-00004.safetensors with huggingface_hub
7829d1c
verified

efops commited on

Delete model-00003-of-00004.safetensors with huggingface_hub
8a3dca8
verified

efops commited on

Delete model-00002-of-00004.safetensors with huggingface_hub
82c8c01
verified

efops commited on

Delete model-00001-of-00004.safetensors with huggingface_hub
2a6918f
verified

efops commited on

Delete model.safetensors with huggingface_hub
a356bcf
verified

efops commited on

v0.5.8: Replace MLX-quantized with proper dequantized safetensors for llm-compressor
a692ea7
verified

efops commited on

Fix config.json: remove invalid GGML quantization fields
37ba332
verified

efops commited on

v0.5.7
49979de
verified

efops commited on

Fix tokenizer_class: TokenizersBackend → PreTrainedTokenizerFast
47fef26
verified

efops commited on

v0.5.6
5385906
verified

efops commited on

v0.5.5
c4e795f
verified

efops commited on

v0.5.4: vLLM CPU pre-built wheel, bfloat16, TCMalloc
6996b5a
verified

efops commited on