Spaces:

Luigi
/

tiny-scribe

Runtime error

App Files Files Community

tiny-scribe

Commit History

fix: Use official ERNIE 4.5 generation settings

bae5a29

Luigi commited on Feb 1

feat: Replace ERNIE 21B with IQ3_XXS from mradermacher

d335258

Luigi commited on Feb 1

fix: Switch ERNIE 21B IQ2_XXS to bartowski repo

c9df672

Luigi commited on Feb 1

fix: Correct IQ2_XXS filename pattern for ERNIE 21B

4d1f944

Luigi commited on Feb 1

feat: Add ERNIE 21B IQ2_XS variant - more stable quantization

813e74d

Luigi commited on Feb 1

feat: Add thinking buffer for reasoning models - extends n_ctx by 50% of max_tokens when reasoning enabled

3893e85

Luigi commited on Feb 1

fix: Make download component visible for JSON export

01620d7

Luigi commited on Feb 1

fix: Download button now works - removed broken .then() chain

7df8d69

Luigi commited on Feb 1

fix: Disable download button until content exists - prevent empty download field UX issue

79cb8cd

Luigi commited on Feb 1

feat: Add comprehensive generation metrics to JSON export

c0889b4

Luigi commited on Feb 1

Fix: Move update_reasoning_visibility to module level - was accidentally defined inside load_model function

451a43d

Luigi commited on Feb 1

Invalidate Docker cache to force fresh app.py copy

a3a4921

Luigi commited on Feb 1

Fix: Remove duplicate return statement in get_model_info

23d5038

Luigi commited on Feb 1

UI/UX improvements: language selector, dynamic reasoning, copy buttons, adjustable temperature

6d54864

Luigi commited on Feb 1

Add bilingual support: English and Traditional Chinese (zh-TW)

e78283f

Luigi commited on Jan 31

Update CLAUDE.md with current implementation details

6be837f

Luigi commited on Jan 31

Adjust ERNIE-4.5 21B Thinking generation settings for better summarization

ae53447

Luigi commited on Jan 31

Update AGENTS.md with Gradio app coverage and deployment info

ddec8de

Luigi commited on Jan 31

Add ERNIE-4.5-21B-Thinking (Q1_0) to model registry

80ca4af

Luigi Claude Sonnet 4.5 commited on Jan 31

Reorder models by total parameter count (smallest to largest)

bc08390

Luigi Claude Sonnet 4.5 commited on Jan 31

Add 3 new models: Granite 4.0 Tiny 7B, LFM2 2.6B Transcript, Qwen3 4B Thinking

1f857e7

Luigi Claude Sonnet 4.5 commited on Jan 31

Add model-specific inference settings and dynamic UI controls

8937994

Luigi commited on Jan 31

Remove DeepSeek-R1 1.5B and add repeat penalty to prevent loops

2fa67ca

Luigi commited on Jan 31

Add 3 new 1.5B/1.7B models with Q4 quantization and reasoning toggle

56794dd

Luigi commited on Jan 31

Upgrade model precision to Q8_0 for better quality

bcda366

Luigi commited on Jan 31

Remove Qwen3 Q2 model (low quality)

f98d497

Luigi commited on Jan 31

Add multi-model selection with CPU optimizations

6716c6a

Luigi commited on Jan 31

Fix: Restore tag-based thinking parser, remove slow marker scanning

5f1c65f

Luigi Claude Opus 4.5 commited on Jan 30

Major UI/UX improvements - Enhanced interface with modern design

8f0478f

Luigi commited on Jan 30

Major UI/UX improvements

a54f798

Luigi commited on Jan 30

Fix: Stream thinking tokens live instead of showing all at once

96d104f

Luigi Claude Opus 4.5 commited on Jan 30

Fix: Separate thinking and summary fields during streaming

7b4eca1

Luigi Claude Sonnet 4.5 commited on Jan 30

Fix: Support both <think> and <thinking> tag formats in parser

62499af

Luigi Claude Sonnet 4.5 commited on Jan 30

Test: Verify git workflow preserves commit messages

fc5ac33

Luigi commited on Jan 30

Test commit: verify git push workflow to HF Spaces

b5d5f43

Luigi commited on Jan 30

Add CLAUDE.md with comprehensive repository documentation

71a479f

Luigi Claude Sonnet 4.5 commited on Jan 30

Fix error handler yield types to match Gradio outputs

0bd3745

Luigi commited on Jan 30

Fix streaming: show all output in thinking field, parsed in summary

1cb8ef5

Luigi commited on Jan 30

Stream thinking token-by-token in real-time

4bee6d7

Luigi commited on Jan 30

UI improvements: separate thinking/summary fields + markdown rendering

682522a

Luigi commited on Jan 30

Add missing parse_thinking_blocks function

b27d02e

Luigi commited on Jan 30

Remove show_copy_button parameter (not supported in Gradio 5.x)

6122a55

Luigi commited on Jan 30

Fix streaming output issues

af2dd42

Luigi commited on Jan 30

Add model info display to UI

0593fe6

Luigi commited on Jan 30

Increase context window from 4096 to 32768 tokens

a16152d

Luigi commited on Jan 30

Fix runtime error: add libgomp1 for OpenMP support

65e6b09

Luigi commited on Jan 30

Use prebuilt llama-cpp-python wheel for faster builds

b3ec254

Luigi commited on Jan 30

Exclude transcripts from HF Spaces deployment

2ca5026

Luigi commited on Jan 30

Add HuggingFace Spaces demo with Gradio UI

10d339c

Luigi commited on Jan 30

comprehensive model benchmark: 6 models evaluated for transcript summarization

f175554

Luigi Claude Sonnet 4.5 commited on Jan 30

Commit History

fix: Use official ERNIE 4.5 generation settings bae5a29

feat: Replace ERNIE 21B with IQ3_XXS from mradermacher d335258

fix: Switch ERNIE 21B IQ2_XXS to bartowski repo c9df672

fix: Correct IQ2_XXS filename pattern for ERNIE 21B 4d1f944

feat: Add ERNIE 21B IQ2_XS variant - more stable quantization 813e74d

feat: Add thinking buffer for reasoning models - extends n_ctx by 50% of max_tokens when reasoning enabled 3893e85

fix: Make download component visible for JSON export 01620d7

fix: Download button now works - removed broken .then() chain 7df8d69

fix: Disable download button until content exists - prevent empty download field UX issue 79cb8cd

feat: Add comprehensive generation metrics to JSON export c0889b4

Fix: Move update_reasoning_visibility to module level - was accidentally defined inside load_model function 451a43d

Invalidate Docker cache to force fresh app.py copy a3a4921

Fix: Remove duplicate return statement in get_model_info 23d5038

UI/UX improvements: language selector, dynamic reasoning, copy buttons, adjustable temperature 6d54864

Add bilingual support: English and Traditional Chinese (zh-TW) e78283f

Update CLAUDE.md with current implementation details 6be837f

Adjust ERNIE-4.5 21B Thinking generation settings for better summarization ae53447

Update AGENTS.md with Gradio app coverage and deployment info ddec8de

Add ERNIE-4.5-21B-Thinking (Q1_0) to model registry 80ca4af

Reorder models by total parameter count (smallest to largest) bc08390

Add 3 new models: Granite 4.0 Tiny 7B, LFM2 2.6B Transcript, Qwen3 4B Thinking 1f857e7

Add model-specific inference settings and dynamic UI controls 8937994

Remove DeepSeek-R1 1.5B and add repeat penalty to prevent loops 2fa67ca

Add 3 new 1.5B/1.7B models with Q4 quantization and reasoning toggle 56794dd

Upgrade model precision to Q8_0 for better quality bcda366

Remove Qwen3 Q2 model (low quality) f98d497

Add multi-model selection with CPU optimizations 6716c6a

Fix: Restore tag-based thinking parser, remove slow marker scanning 5f1c65f

Major UI/UX improvements - Enhanced interface with modern design 8f0478f

Major UI/UX improvements a54f798

Fix: Stream thinking tokens live instead of showing all at once 96d104f

Fix: Separate thinking and summary fields during streaming 7b4eca1

Fix: Support both <think> and <thinking> tag formats in parser 62499af

Test: Verify git workflow preserves commit messages fc5ac33

Test commit: verify git push workflow to HF Spaces b5d5f43

Add CLAUDE.md with comprehensive repository documentation 71a479f

Fix error handler yield types to match Gradio outputs 0bd3745

Fix streaming: show all output in thinking field, parsed in summary 1cb8ef5

Stream thinking token-by-token in real-time 4bee6d7

UI improvements: separate thinking/summary fields + markdown rendering 682522a

Add missing parse_thinking_blocks function b27d02e

Remove show_copy_button parameter (not supported in Gradio 5.x) 6122a55

Fix streaming output issues af2dd42

Add model info display to UI 0593fe6

Increase context window from 4096 to 32768 tokens a16152d

Fix runtime error: add libgomp1 for OpenMP support 65e6b09

Use prebuilt llama-cpp-python wheel for faster builds b3ec254

Exclude transcripts from HF Spaces deployment 2ca5026

Add HuggingFace Spaces demo with Gradio UI 10d339c

comprehensive model benchmark: 6 models evaluated for transcript summarization f175554

fix: Use official ERNIE 4.5 generation settings

bae5a29

feat: Replace ERNIE 21B with IQ3_XXS from mradermacher

d335258

fix: Switch ERNIE 21B IQ2_XXS to bartowski repo

c9df672

fix: Correct IQ2_XXS filename pattern for ERNIE 21B

4d1f944

feat: Add ERNIE 21B IQ2_XS variant - more stable quantization

813e74d

feat: Add thinking buffer for reasoning models - extends n_ctx by 50% of max_tokens when reasoning enabled

3893e85

fix: Make download component visible for JSON export

01620d7

fix: Download button now works - removed broken .then() chain

7df8d69

fix: Disable download button until content exists - prevent empty download field UX issue

79cb8cd

feat: Add comprehensive generation metrics to JSON export

c0889b4

Fix: Move update_reasoning_visibility to module level - was accidentally defined inside load_model function

451a43d

Invalidate Docker cache to force fresh app.py copy

a3a4921

Fix: Remove duplicate return statement in get_model_info

23d5038

UI/UX improvements: language selector, dynamic reasoning, copy buttons, adjustable temperature

6d54864

Add bilingual support: English and Traditional Chinese (zh-TW)

e78283f

Update CLAUDE.md with current implementation details

6be837f

Adjust ERNIE-4.5 21B Thinking generation settings for better summarization

ae53447

Update AGENTS.md with Gradio app coverage and deployment info

ddec8de

Add ERNIE-4.5-21B-Thinking (Q1_0) to model registry

80ca4af

Reorder models by total parameter count (smallest to largest)

bc08390

Add 3 new models: Granite 4.0 Tiny 7B, LFM2 2.6B Transcript, Qwen3 4B Thinking

1f857e7

Add model-specific inference settings and dynamic UI controls

8937994

Remove DeepSeek-R1 1.5B and add repeat penalty to prevent loops

2fa67ca

Add 3 new 1.5B/1.7B models with Q4 quantization and reasoning toggle

56794dd

Upgrade model precision to Q8_0 for better quality

bcda366

Remove Qwen3 Q2 model (low quality)

f98d497

Add multi-model selection with CPU optimizations

6716c6a

Fix: Restore tag-based thinking parser, remove slow marker scanning

5f1c65f

Major UI/UX improvements - Enhanced interface with modern design

8f0478f

Major UI/UX improvements

a54f798

Fix: Stream thinking tokens live instead of showing all at once

96d104f

Fix: Separate thinking and summary fields during streaming

7b4eca1

Fix: Support both <think> and <thinking> tag formats in parser

62499af

Test: Verify git workflow preserves commit messages

fc5ac33

Test commit: verify git push workflow to HF Spaces

b5d5f43

Add CLAUDE.md with comprehensive repository documentation

71a479f

Fix error handler yield types to match Gradio outputs

0bd3745

Fix streaming: show all output in thinking field, parsed in summary

1cb8ef5

Stream thinking token-by-token in real-time

4bee6d7

UI improvements: separate thinking/summary fields + markdown rendering

682522a

Add missing parse_thinking_blocks function

b27d02e

Remove show_copy_button parameter (not supported in Gradio 5.x)

6122a55

Fix streaming output issues

af2dd42

Add model info display to UI

0593fe6

Increase context window from 4096 to 32768 tokens

a16152d

Fix runtime error: add libgomp1 for OpenMP support

65e6b09

Use prebuilt llama-cpp-python wheel for faster builds

b3ec254

Exclude transcripts from HF Spaces deployment

2ca5026

Add HuggingFace Spaces demo with Gradio UI

10d339c

comprehensive model benchmark: 6 models evaluated for transcript summarization

f175554