Commit History

fix: Switch ERNIE 21B IQ2_XXS to bartowski repo
c9df672

Luigi commited on

fix: Correct IQ2_XXS filename pattern for ERNIE 21B
4d1f944

Luigi commited on

feat: Add ERNIE 21B IQ2_XS variant - more stable quantization
813e74d

Luigi commited on

feat: Add thinking buffer for reasoning models - extends n_ctx by 50% of max_tokens when reasoning enabled
3893e85

Luigi commited on

fix: Make download component visible for JSON export
01620d7

Luigi commited on

fix: Download button now works - removed broken .then() chain
7df8d69

Luigi commited on

fix: Disable download button until content exists - prevent empty download field UX issue
79cb8cd

Luigi commited on

feat: Add comprehensive generation metrics to JSON export
c0889b4

Luigi commited on

Fix: Move update_reasoning_visibility to module level - was accidentally defined inside load_model function
451a43d

Luigi commited on

Invalidate Docker cache to force fresh app.py copy
a3a4921

Luigi commited on

Fix: Remove duplicate return statement in get_model_info
23d5038

Luigi commited on

UI/UX improvements: language selector, dynamic reasoning, copy buttons, adjustable temperature
6d54864

Luigi commited on

Add bilingual support: English and Traditional Chinese (zh-TW)
e78283f

Luigi commited on

Update CLAUDE.md with current implementation details
6be837f

Luigi commited on

Adjust ERNIE-4.5 21B Thinking generation settings for better summarization
ae53447

Luigi commited on

Update AGENTS.md with Gradio app coverage and deployment info
ddec8de

Luigi commited on

Add ERNIE-4.5-21B-Thinking (Q1_0) to model registry
80ca4af

Luigi Claude Sonnet 4.5 commited on

Reorder models by total parameter count (smallest to largest)
bc08390

Luigi Claude Sonnet 4.5 commited on

Add 3 new models: Granite 4.0 Tiny 7B, LFM2 2.6B Transcript, Qwen3 4B Thinking
1f857e7

Luigi Claude Sonnet 4.5 commited on

Add model-specific inference settings and dynamic UI controls
8937994

Luigi commited on

Remove DeepSeek-R1 1.5B and add repeat penalty to prevent loops
2fa67ca

Luigi commited on

Add 3 new 1.5B/1.7B models with Q4 quantization and reasoning toggle
56794dd

Luigi commited on

Upgrade model precision to Q8_0 for better quality
bcda366

Luigi commited on

Remove Qwen3 Q2 model (low quality)
f98d497

Luigi commited on

Add multi-model selection with CPU optimizations
6716c6a

Luigi commited on

Fix: Restore tag-based thinking parser, remove slow marker scanning
5f1c65f

Luigi Claude Opus 4.5 commited on

Major UI/UX improvements - Enhanced interface with modern design
8f0478f

Luigi commited on

Major UI/UX improvements
a54f798

Luigi commited on

Fix: Stream thinking tokens live instead of showing all at once
96d104f

Luigi Claude Opus 4.5 commited on

Fix: Separate thinking and summary fields during streaming
7b4eca1

Luigi Claude Sonnet 4.5 commited on

Fix: Support both <think> and <thinking> tag formats in parser
62499af

Luigi Claude Sonnet 4.5 commited on

Test: Verify git workflow preserves commit messages
fc5ac33

Luigi commited on

Test commit: verify git push workflow to HF Spaces
b5d5f43

Luigi commited on

Add CLAUDE.md with comprehensive repository documentation
71a479f

Luigi Claude Sonnet 4.5 commited on

Fix error handler yield types to match Gradio outputs
0bd3745

Luigi commited on

Fix streaming: show all output in thinking field, parsed in summary
1cb8ef5

Luigi commited on

Stream thinking token-by-token in real-time
4bee6d7

Luigi commited on

UI improvements: separate thinking/summary fields + markdown rendering
682522a

Luigi commited on

Add missing parse_thinking_blocks function
b27d02e

Luigi commited on

Remove show_copy_button parameter (not supported in Gradio 5.x)
6122a55

Luigi commited on

Fix streaming output issues
af2dd42

Luigi commited on

Add model info display to UI
0593fe6

Luigi commited on

Increase context window from 4096 to 32768 tokens
a16152d

Luigi commited on

Fix runtime error: add libgomp1 for OpenMP support
65e6b09

Luigi commited on

Use prebuilt llama-cpp-python wheel for faster builds
b3ec254

Luigi commited on

Exclude transcripts from HF Spaces deployment
2ca5026

Luigi commited on

Add HuggingFace Spaces demo with Gradio UI
10d339c

Luigi commited on

comprehensive model benchmark: 6 models evaluated for transcript summarization
f175554

Luigi Claude Sonnet 4.5 commited on

separate Qwen3 thinking blocks into thinking.txt and summary.txt
8a9d263

Luigi Claude Sonnet 4.5 commited on

add to zh-TW post-processing
3ec1246

Luigi commited on