Commit History

Add gguf_file parameter to tokenizer loading and introduce diagnostic script for GGUF validation
6f81ff7

ndc8 commited on

Fix GGUF filename in environment variable and update comment in requirements
a2a4e98

ndc8 commited on

Add protobuf requirement for GGUF model loading
0c37db0

ndc8 commited on

Refactor application to implement GGUF backend with native transformers support; update requirements and add GGUF-specific entry point
6e96e6e

ndc8 commited on

Refactor model loading to utilize accelerate for device management; add test script to verify loading fix and prevent device conflicts
8a3c5dd

ndc8 commited on

Refactor application to use lightweight backend; update requirements and add memory analysis script for optimized model configuration
a4ee3a6

ndc8 commited on

Update Dockerfile and application entry point for GGUF backend; optimize memory usage in model parameters and requirements
358e717

ndc8 commited on

d
994c0b4

ndc8 commited on

Refactor backend service to support Gemma 3n model and update requirements; remove obsolete test script and add new dependency tests
4b4e9ed

ndc8 commited on

c
4f67c26

ndc8 commited on

b
557b035

ndc8 commited on

l
819080c

ndc8 commited on

a
6496777

ndc8 commited on

Add metadata section to README for project details
ae13708

ndc8 commited on

aa
65edee9

ndc8 commited on

Add scripts for converting and generating UltraChat-style SFT dataset
7ecd130

ndc8 commited on

update
91181f3

ndc8 commited on

Update Dockerfile: Add missing system dependencies for build
9fe463f

ndc8 commited on

rabbit-ed
3960f0f

ndc8 commited on

Cleanup: Remove unnecessary files and update .gitignore
78b611a

ndc8 commited on

update
4ecf54e

ndc8 commited on

chg model
375ade4

ndc8 commited on

change to adapter
0c9134e

ndc8 commited on

try
db8cd85

ndc8 commited on

upd
cb5d5f8

ndc8 commited on

update to use unsloth + mistral
172b424

ndc8 commited on

change model
8208c22

ndc8 commited on

try #1
1ba257c

ndc8 commited on

upd
8d9c495

ndc8 commited on

update
3239c69

ndc8 commited on

Fix model dependencies and warnings
97bafdb

ndc8 commited on

Update model to unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
c6f7b75

ndc8 commited on

Revert model back to gemma-3n-E4B-it-GGUF
88c3c6c

ndc8 commited on

Fix: Update to valid HuggingFace model and fix deprecation warnings
04d695c

ndc8 commited on

upd
2cd680b

ndc8 commited on

fix
255b6fc

ndc8 commited on

fix
84eb396

ndc8 commited on

update
e46cec3

ndc8 commited on

upd
68f41f4

ndc8 commited on

update
83df634

ndc8 commited on

fix
4599528

ndc8 commited on

Ignore Hugging Face cache directory
385d87b

ndc8 commited on

fix
e6598e6

ndc8 commited on

Add API response endpoint and tests
09c9042

ndc8 commited on

Set gemma-3n-E4B-it-GGUF as main model for all text generation endpoints
8d962fd

ndc8 commited on

Update app.py
1f4eabe
verified

cong182 commited on

πŸ”§ Fix HuggingFace Space compatibility
b6cf19e

ndc8 commited on

πŸš€ Add multimodal AI capabilities with image-text-to-text pipeline
4e10023

ndc8 commited on

Update space
d3ad561

ndc8 commited on

initial commit
7559c6d
verified

cong182 commited on