Spaces:

Luigi
/

tiny-scribe

Sleeping

App Files Files Community

Luigi commited on Feb 6

Commit

bc6516c

1 Parent(s): d10c1a0

fix: add missing meeting_summarizer module to Dockerfile for HF Spaces deployment

Browse files

Files changed (3) hide show

AGENTS.md +26 -74
Dockerfile +1 -0
app.py +1 -2

AGENTS.md CHANGED Viewed

@@ -22,7 +22,6 @@ python app.py  # Starts on port 7860
 **Linting (if ruff installed):**
 ```bash
 ruff check .
-ruff check --select I .  # Import sorting
 ruff format .            # Auto-format code
 ```
@@ -32,20 +31,24 @@ mypy summarize_transcript.py
 mypy app.py
 ```
-**Running tests:**
 ```bash
-# No test suite in root project yet
-# Tests exist in llama-cpp-python/tests/ submodule
-cd llama-cpp-python && pip install ".[test]" && pytest tests/test_llama.py -v
 ```
-**Single test:**
 ```bash
-# Run specific test function
-cd llama-cpp-python && pytest tests/test_llama.py::test_function_name -v
-# Run with traceback
-cd llama-cpp-python && pytest --full-trace -v
 ```
 ## Code Style Guidelines
@@ -53,13 +56,13 @@ cd llama-cpp-python && pytest --full-trace -v
 **Formatting:**
 - Use 4 spaces for indentation
 - Line length: 100 characters max
-- Use double quotes for docstrings, single quotes for strings acceptable
 - Two blank lines before function definitions
 - One blank line after docstrings
-**Imports:**
 ```python
-# Standard library first
 import os
 import argparse
 import re
@@ -73,9 +76,9 @@ import gradio as gr
 ```
 **Type Hints:**
-- Use type hints for function parameters and return values
 - Use `Optional[]` for nullable types
-- Use `Generator[str, None, None]` for generator yields
 - Example: `def load_model(repo_id: str, filename: str, cpu_only: bool = False) -> Llama:`
 **Naming Conventions:**
@@ -86,8 +89,8 @@ import gradio as gr
 **Docstrings:**
 - Use triple quotes for all public functions
 - Include Args/Returns sections for complex functions
-- Keep first line as a brief summary
 **Error Handling:**
 - Use explicit error messages with f-strings
@@ -113,15 +116,13 @@ import gradio as gr
 ```
 tiny-scribe/
 ├── summarize_transcript.py    # Main CLI script
-├── app.py                     # Gradio web app (HuggingFace Spaces)
 ├── requirements.txt           # Python dependencies
-├── Dockerfile                 # HF Spaces deployment config
 ├── transcripts/               # Input transcript files
-│   ├── short.txt
-│   └── full.txt
 ├── llama-cpp-python/          # Git submodule
-│   ├── tests/                 # Test suite
-│   └── vendor/llama.cpp/      # Core C++ library
 └── README.md                  # Project documentation
 ```
@@ -148,63 +149,14 @@ stream = llm.create_chat_completion(
 )
 ```
-**Thinking Block Parsing:**
-```python
-# Extract thinking/reasoning blocks from model output
-THINKING_PATTERN = re.compile(r'<think(?:ing)?>(.*?)</think(?:ing)?>', re.DOTALL)
-for chunk in stream:
-    delta = chunk["choices"][0]["delta"]
-    if content := delta.get("content", ""):
-        buffer += content
-        thinking_match = THINKING_PATTERN.search(buffer)
-        if thinking_match:
-            thinking = thinking_match.group(1).strip()
-            buffer = buffer[:thinking_match.start()] + buffer[thinking_match.end():]
-```
-**Chinese Text Conversion (zh-TW mode only):**
-```python
-# Convert Simplified Chinese to Traditional Chinese (Taiwan)
-converter = OpenCC('s2twp')  # s2twp = Simplified to Traditional (Taiwan)
-# Only apply when output_language == "zh-TW"
-if output_language == "zh-TW":
-    traditional_text = converter.convert(simplified_text)
-else:
-    traditional_text = simplified_text  # Skip conversion for English
-```
 ## Notes for AI Agents
-- This is a simple utility project; no formal CI/CD or test suite in root
-- When modifying, maintain the existing streaming output pattern
 - Always call `llm.reset()` after completion to ensure state isolation
 - Model format: `repo_id:quant` (e.g., `unsloth/Qwen3-1.7B-GGUF:Q2_K_L`)
-- Default language output is English (zh-TW available via `-l zh-TW` flag or web UI dropdown)
 - OpenCC conversion only applied when output_language is "zh-TW"
 - HuggingFace cache at `~/.cache/huggingface/hub/` - clean periodically
 - HF Spaces runs on CPU tier with 2 vCPUs, 16GB RAM
 - Keep model sizes under 4GB for reasonable performance on free tier
-## Git Submodule Management
-```bash
-# Initialize/update submodules
-git submodule update --init --recursive
-# Update llama-cpp-python to latest
-cd llama-cpp-python && git pull origin main && cd .. && git add llama-cpp-python
-```
-## Docker/HuggingFace Spaces Deployment
-```bash
-# Build locally
-docker build -t tiny-scribe .
-# Run locally
-docker run -p 7860:7860 tiny-scribe
-# Deploy script
-./deploy.sh  # Commits, pushes, and triggers HF Spaces rebuild
-```

 **Linting (if ruff installed):**
 ```bash
 ruff check .
 ruff format .            # Auto-format code
 ```
 mypy app.py
 ```
+**Running tests (root project tests):**
 ```bash
+# Run E2E test
+python test_e2e.py
+# Run advanced mode test
+python test_advanced_mode.py
+# Run LFM2 extraction test
+python test_lfm2_extract.py
 ```
+**llama-cpp-python submodule tests:**
 ```bash
+cd llama-cpp-python && pip install ".[test]" && pytest tests/test_llama.py -v
+# Run specific test
+cd llama-cpp-python && pytest tests/test_llama.py::test_function_name -v
 ```
 ## Code Style Guidelines
 **Formatting:**
 - Use 4 spaces for indentation
 - Line length: 100 characters max
+- Use double quotes for docstrings
 - Two blank lines before function definitions
 - One blank line after docstrings
+**Imports (ordered):**
 ```python
+# Standard library
 import os
 import argparse
 import re
 ```
 **Type Hints:**
+- Use type hints for parameters and return values
 - Use `Optional[]` for nullable types
+- Use `Generator[str, None, None]` for generators
 - Example: `def load_model(repo_id: str, filename: str, cpu_only: bool = False) -> Llama:`
 **Naming Conventions:**
 **Docstrings:**
 - Use triple quotes for all public functions
+- Keep first line as brief summary
 - Include Args/Returns sections for complex functions
 **Error Handling:**
 - Use explicit error messages with f-strings
 ```
 tiny-scribe/
 ├── summarize_transcript.py    # Main CLI script
+├── app.py                     # Gradio web app
 ├── requirements.txt           # Python dependencies
 ├── transcripts/               # Input transcript files
+├── test_e2e.py               # E2E test
+├── test_advanced_mode.py     # Advanced mode test
+├── test_lfm2_extract.py      # LFM2 extraction test
 ├── llama-cpp-python/          # Git submodule
 └── README.md                  # Project documentation
 ```
 )
 ```
 ## Notes for AI Agents
 - Always call `llm.reset()` after completion to ensure state isolation
 - Model format: `repo_id:quant` (e.g., `unsloth/Qwen3-1.7B-GGUF:Q2_K_L`)
+- Default language output is English (zh-TW available via `-l zh-TW` or web UI)
 - OpenCC conversion only applied when output_language is "zh-TW"
 - HuggingFace cache at `~/.cache/huggingface/hub/` - clean periodically
 - HF Spaces runs on CPU tier with 2 vCPUs, 16GB RAM
 - Keep model sizes under 4GB for reasonable performance on free tier
+- Tests exist in root (test_e2e.py, test_advanced_mode.py, test_lfm2_extract.py)
+- Submodule tests in llama-cpp-python/tests/

Dockerfile CHANGED Viewed

@@ -19,6 +19,7 @@ RUN pip install --no-cache-dir -r requirements.txt
 # Copy application files
 COPY app.py .
 # Pre-download model on build (optional, speeds up first run)
 # RUN python -c "from huggingface_hub import hf_hub_download; hf_hub_download(repo_id='unsloth/Qwen3-0.6B-GGUF', filename='Qwen3-0.6B-Q4_K_M.gguf', local_dir='./models')"

 # Copy application files
 COPY app.py .
+COPY meeting_summarizer/ meeting_summarizer/
 # Pre-download model on build (optional, speeds up first run)
 # RUN python -c "from huggingface_hub import hf_hub_download; hf_hub_download(repo_id='unsloth/Qwen3-0.6B-GGUF', filename='Qwen3-0.6B-Q4_K_M.gguf', local_dir='./models')"

app.py CHANGED Viewed

@@ -3417,6 +3417,5 @@ if __name__ == "__main__":
         server_name="0.0.0.0",
         server_port=7860,
         share=False,
-        show_error=True,
-        css=custom_css
     )

         server_name="0.0.0.0",
         server_port=7860,
         share=False,
+        show_error=True
     )