Spaces:

chmielvu
/

general-reasoning-agent

Sleeping

App Files Files Community

general-reasoning-agent

Commit History

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf)

6a6a4e3
verified

chmielvu commited on 17 days ago

repurpose: mradermacher/Gemma-3-Prompt-Coder-270m-it-Uncensored-GGUF (Gemma-3-Prompt-Coder-270m-it-Uncensored.Q4_K_S.gguf)

0e20523
verified

chmielvu commited on 17 days ago

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf)

3adc25f
verified

chmielvu commited on 17 days ago

Update README.md

d0d5291
verified

chmielvu commited on 23 days ago

Update requirements.txt

4fffb1c
verified

chmielvu commited on 23 days ago

Update requirements.txt

4649786
verified

chmielvu commited on 23 days ago

Update README.md

ff13da7
verified

chmielvu commited on 23 days ago

Update requirements.txt

fdef4bc
verified

chmielvu commited on 23 days ago

fix: bypass from_pretrained path bug with hf_hub_download

c642bd7
verified

chmielvu commited on 23 days ago

fix: separate draft_model loading to avoid path construction bug

d68f000
verified

chmielvu commited on 23 days ago

fix: correct model filenames for SmolLM3 Q4_K_S and SmolLM2 draft model

b61af84
verified

chmielvu commited on 23 days ago

feat: add production refinements (Phase 1-3)

4454066
verified

chmielvu Claude Sonnet 4.5 commited on 23 days ago

SmolLM3-3B-Instruct streaming

a534bdc
verified

chmielvu commited on 26 days ago

Fix: Gradio 6.0 compatible ChatInterface with streaming

785124e
verified

chmielvu commited on 26 days ago

Fix: Gradio 6.0 compatibility - remove deprecated theme param and show_copy_button

2aa6a26
verified

chmielvu commited on 26 days ago

Major fix: Switch to transformers (no llama-cpp build). Use Qwen2.5-3B for fast CPU inference

371ac0a
verified

chmielvu commited on 26 days ago

Fix: Switch to standard llama-cpp-python package (remove editable install)

5371df2
verified

chmielvu commited on 26 days ago

Fix: Use editable git install for llama-cpp-python, remove version pins

c840812
verified

chmielvu commited on 26 days ago

Fix: Use prebuilt llama-cpp-python wheels, pin dependencies, extend timeout

da3ea1b
verified

chmielvu commited on 27 days ago

Initial deployment: General Reasoning Agent

f6f0051
verified

chmielvu commited on 27 days ago

initial commit

b4505ee
verified

chmielvu commited on 27 days ago

Commit History

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf) 6a6a4e3 verified

repurpose: mradermacher/Gemma-3-Prompt-Coder-270m-it-Uncensored-GGUF (Gemma-3-Prompt-Coder-270m-it-Uncensored.Q4_K_S.gguf) 0e20523 verified

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf) 3adc25f verified

Update README.md d0d5291 verified

Update requirements.txt 4fffb1c verified

Update requirements.txt 4649786 verified

Update README.md ff13da7 verified

Update requirements.txt fdef4bc verified

fix: bypass from_pretrained path bug with hf_hub_download c642bd7 verified

fix: separate draft_model loading to avoid path construction bug d68f000 verified

fix: correct model filenames for SmolLM3 Q4_K_S and SmolLM2 draft model b61af84 verified

feat: add production refinements (Phase 1-3) 4454066 verified

SmolLM3-3B-Instruct streaming a534bdc verified

Fix: Gradio 6.0 compatible ChatInterface with streaming 785124e verified

Fix: Gradio 6.0 compatibility - remove deprecated theme param and show_copy_button 2aa6a26 verified

Major fix: Switch to transformers (no llama-cpp build). Use Qwen2.5-3B for fast CPU inference 371ac0a verified

Fix: Switch to standard llama-cpp-python package (remove editable install) 5371df2 verified

Fix: Use editable git install for llama-cpp-python, remove version pins c840812 verified

Fix: Use prebuilt llama-cpp-python wheels, pin dependencies, extend timeout da3ea1b verified

Initial deployment: General Reasoning Agent f6f0051 verified

initial commit b4505ee verified

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf)

6a6a4e3
verified

repurpose: mradermacher/Gemma-3-Prompt-Coder-270m-it-Uncensored-GGUF (Gemma-3-Prompt-Coder-270m-it-Uncensored.Q4_K_S.gguf)

0e20523
verified

repurpose: HelpingAI/HELVETE-3B (nsfw-3b-q4_k_m.gguf)

3adc25f
verified

Update README.md

d0d5291
verified

Update requirements.txt

4fffb1c
verified

Update requirements.txt

4649786
verified

Update README.md

ff13da7
verified

Update requirements.txt

fdef4bc
verified

fix: bypass from_pretrained path bug with hf_hub_download

c642bd7
verified

fix: separate draft_model loading to avoid path construction bug

d68f000
verified

fix: correct model filenames for SmolLM3 Q4_K_S and SmolLM2 draft model

b61af84
verified

feat: add production refinements (Phase 1-3)

4454066
verified

SmolLM3-3B-Instruct streaming

a534bdc
verified

Fix: Gradio 6.0 compatible ChatInterface with streaming

785124e
verified

Fix: Gradio 6.0 compatibility - remove deprecated theme param and show_copy_button

2aa6a26
verified

Major fix: Switch to transformers (no llama-cpp build). Use Qwen2.5-3B for fast CPU inference

371ac0a
verified

Fix: Switch to standard llama-cpp-python package (remove editable install)

5371df2
verified

Fix: Use editable git install for llama-cpp-python, remove version pins

c840812
verified

Fix: Use prebuilt llama-cpp-python wheels, pin dependencies, extend timeout

da3ea1b
verified

Initial deployment: General Reasoning Agent

f6f0051
verified

initial commit

b4505ee
verified