SummarizerApp / app /services /structured_summarizer.py

Commit History

style: apply ruff formatting to structured_summarizer
bd7d2c1

ming Claude commited on

docs: add comprehensive V4 API documentation and optimize inference with SDPA
0072188

ming Claude commited on

chore: code formatting improvements and update gitignore
db3b809

ming commited on

Add V4 local server setup with MPS optimization for Android testing
45b6536

ming Claude commited on

Remove outlines library and all related code
d25a17f

ming commited on

Migrate to Ruff for linting/formatting and add comprehensive import tests
29ed661

ming commited on

Fix Outlines API usage for V4 JSON streaming endpoint
b47201f

ming commited on

Fix Outlines API usage - handle different calling patterns
4c4036e

ming commited on

Fix Outlines import - use generator instead of generate
9452571

ming commited on

Change Outlines debug log to info level for visibility
5fa0ba2

ming commited on

Add detailed Outlines API exploration and logging
f099d0c

ming commited on

Fix Outlines import - use correct API for installed version
e8ab865

ming commited on

Improve Outlines import error handling and logging
33cd483

ming commited on

Improve error messaging for Outlines unavailability
734e281

ming commited on

Fix Python 3.10 requirement and torch_dtype deprecation
6b859f2

ming commited on

Add Outlines JSON streaming endpoint for V4 structured summarization
441f66b

ming commited on

Fix JSON parsing errors in V4 NDJSON stream
85dcd04

ming commited on

Implement Option 3: Use FP16 for 2-3x faster inference
7fff563

ming commited on

Optimize V4 generation speed: greedy decoding + reduced max_tokens
fd2a8c1

ming commited on

Fix buffer parsing and strengthen brevity constraints
d112a13

ming commited on

Optimize V4 output verbosity and generation speed
17499f7

ming commited on

Fix bitsandbytes UID error with getpass patch
dd29a6d

ming commited on

Switch V4 to GPU INT4 quantization with Qwen-1.5B
a36f560

ming commited on

debug: Add comprehensive logging to diagnose 4-token issue
df75294

ming commited on

feat: Guarantee complete V4 NDJSON summaries with fallback
b321440

ming commited on

fix: Use Qwen chat template and harden NDJSON parsing
bf21a65

ming commited on

feat: Change device_map to auto for V4 model
d0701b0

ming commited on

feat: Add V4 NDJSON patch-based structured summarization
93c9664

ming commited on