Spaces:

Plasmoxy
/

DiscordSum-Demo

Sleeping

App Files Files Community

Plasmoxy commited on Dec 20, 2025

Commit

5dda5dc

1 Parent(s): 4477bd7

Add dcsum code

Browse files

Files changed (1) hide show

app.py +222 -4

app.py CHANGED Viewed

@@ -1,7 +1,225 @@
 import gradio as gr
-def greet(name):
-    return "Hello " + name + "!!"
-demo = gr.Interface(fn=greet, inputs="text", outputs="text")
-demo.launch()

+"""
+DiscordSum - Hugging Face Space Gradio App
+Conversation summarization using Qwen3-0.6B-DiscordSum-mini-v1
+"""
 import gradio as gr
+import torch
+import time
+import re
+from typing import Dict, Any
+from transformers import AutoModelForCausalLM, AutoTokenizer
+# Model configuration
+MODEL_NAME = "Plasmoxy/Qwen3-0.6B-DiscordSum-mini-v1"
+# Sample conversation for demo
+SAMPLE_CONVERSATION = """[TechLead_Sarah]: Good morning team! We need to discuss the upcoming Q1 release. There are some critical issues that came up during yesterday's sprint review.
+[Backend_Mike]: Morning! Yeah, I noticed the authentication service is having intermittent failures in staging. We're seeing about 5% of login attempts timing out.
+[DevOps_Chen]: I can confirm that. The logs show connection pool exhaustion during peak load. We might need to increase the max connections or implement better connection recycling.
+[Frontend_Emma]: That explains the user complaints we've been getting. Is this affecting the password reset flow too?
+[Backend_Mike]: Good question. Let me check... yes, it looks like any endpoint that touches the auth service is affected. Password resets, token refreshes, and social login callbacks.
+[TechLead_Sarah]: This is a P0 issue then. Mike, can you take the lead on fixing this? We need it resolved before the release.
+[Backend_Mike]: Absolutely. I'll start by profiling the connection usage patterns. Chen, can you help me analyze the infrastructure metrics?
+[DevOps_Chen]: Sure thing. I'll pull the CloudWatch data and set up a dashboard. Should have something by end of day.
+[QA_Alex]: While we're on critical issues, I found a data corruption bug in the export feature. When users export large datasets (>10k rows), some columns are getting scrambled.
+[Backend_Mike]: Oh no, that sounds serious. Do you have reproduction steps?
+[QA_Alex]: Yes, I documented everything in JIRA ticket ENG-2847. Happens consistently with the customer data export when you select more than 5 columns and filter by date range.
+[Frontend_Emma]: I worked on that feature last month. Let me take a look at the ticket. It might be related to how we're chunking the data before sending it to the backend.
+[TechLead_Sarah]: Emma, pair with Alex on this one. We can't ship with data corruption issues. What's the ETA on a fix?
+[Frontend_Emma]: Give me a few hours to investigate. If it's what I think it is, should be a quick fix in the data serialization logic.
+[Product_Manager_Lisa]: Just joining - are these issues going to delay our release? We have customer commitments for next Friday.
+[TechLead_Sarah]: Too early to say definitively, but we're treating both as blockers. Lisa, can you give us until tomorrow morning to assess the scope?
+[Product_Manager_Lisa]: Tomorrow morning works. I'll prepare a communication plan for customers in case we need to push back the date.
+[DevOps_Chen]: One more thing - our staging environment is going to undergo scheduled maintenance tonight from 11 PM to 2 AM EST. Just a heads up for anyone planning to work late.
+[Backend_Mike]: Thanks for the notice. I'll do my connection pool testing before then.
+[Security_James]: Hey folks, not to pile on, but I need to mention that our security audit identified some concerns with how we're handling API keys in the logging system. We're potentially exposing sensitive tokens in debug logs.
+[TechLead_Sarah]: James, is this something that needs immediate attention or can it wait until after the release?
+[Security_James]: It's not being actively exploited, but it's a significant vulnerability. I'd recommend we fix it this sprint. I can prepare a PR that redacts sensitive data from logs.
+[Backend_Mike]: I can review that PR. Should be straightforward - we just need to update our logging middleware.
+[Frontend_Emma]: Sarah, should we schedule a follow-up meeting to go through all these items in detail?
+[TechLead_Sarah]: Yes, let's do a quick sync at 2 PM today. I'll send out a calendar invite. Priority items: auth service failures, data export corruption, and security logging issue.
+[QA_Alex]: I'll prepare a full regression test plan for the auth service fix. We need to make sure we don't break anything else.
+[DevOps_Chen]: I'll also set up automated load testing for the auth service so we can catch these issues earlier in the future.
+[Product_Manager_Lisa]: Appreciate everyone jumping on this. I'll be in the 2 PM meeting with updates from the customer success team.
+[Backend_Mike]: Quick question - do we have any insight into when the auth issues started? Was it after the last deployment?
+[DevOps_Chen]: Looking at the metrics now... it started appearing about 4 days ago, which coincides with our database migration to the new instance type.
+[Backend_Mike]: Ah! That's a crucial data point. The new instance might have different connection limits or network characteristics.
+[DevOps_Chen]: Exactly what I was thinking. I'll check the RDS configuration and compare it with our old setup.
+[TechLead_Sarah]: Great detective work. Let's keep this thread updated with findings. Mike and Chen, prioritize the auth issue. Emma and Alex, focus on the export bug. James, get that security PR ready for review.
+[Security_James]: Will do. I'll have it ready by noon.
+[Frontend_Emma]: Alex, I'm looking at your ticket now. Can you jump on a quick call to walk me through the reproduction?
+[QA_Alex]: Sure, sending you a Zoom link now.
+[TechLead_Sarah]: Thanks everyone for the quick response. Let's crush these bugs and get back on track for the release!"""
+# Global model and tokenizer
+model = None
+tokenizer = None
+def load_model():
+    """Load model and tokenizer"""
+    global model, tokenizer
+    print(f"Loading model: {MODEL_NAME}")
+    tokenizer = AutoTokenizer.from_pretrained(
+        MODEL_NAME,
+        trust_remote_code=True,
+        padding_side="right"
+    )
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+        tokenizer.pad_token_id = tokenizer.eos_token_id
+    model = AutoModelForCausalLM.from_pretrained(
+        MODEL_NAME,
+        device_map="auto",
+        torch_dtype=torch.float32,
+        trust_remote_code=True,
+    )
+    model.eval()
+    print("Model loaded successfully!")
+def format_inference_prompt(conversation: str) -> str:
+    """Format inference prompt using chat template"""
+    messages = [
+        {
+            "role": "system",
+            "content": "Summarize Discord conversations into a paragraph capturing key points, decisions, and action items."
+        },
+        {
+            "role": "user",
+            "content": f"Summarize the following conversation:\n\n{conversation}"
+        }
+    ]
+    formatted = tokenizer.apply_chat_template(
+        messages,
+        tokenize=False,
+        add_generation_prompt=True,
+        enable_thinking=False
+    )
+    # Clean up chat template output
+    formatted = re.sub(r'<think>[\s\S]*?</think>', '', formatted)
+    formatted = re.sub(r'(<\|im_end\|>)(?=<\|im_start\|>)', r'\1\n', formatted)
+    formatted = re.sub(r'(<\|im_start\|>[^<>\n]+)\s*\n\s*\n', r'\1\n', formatted)
+    formatted = re.sub(r'\n{3,}', '\n\n', formatted)
+    formatted = formatted.strip()
+    return formatted
+def extract_summary(response: str) -> str:
+    """Extract summary from model response"""
+    match = re.search(r'Summary:\s*(.*?)(?:<\|im_end\|>|$)', response, re.DOTALL)
+    if match:
+        return match.group(1).strip()
+    return response.strip()
+def summarize_conversation(conversation: str):
+    """Summarize conversation using the model"""
+    if not conversation or not conversation.strip():
+        return "Error: Conversation cannot be empty", None
+    try:
+        start_time = time.time()
+        # Format prompt
+        prompt = format_inference_prompt(conversation)
+        # Tokenize
+        inputs = tokenizer(
+            prompt,
+            return_tensors="pt",
+            truncation=True,
+            max_length=2048
+        ).to(model.device)
+        input_tokens = inputs["input_ids"].shape[1]
+        warmup_time = time.time() - start_time
+        # Generate
+        generation_start = time.time()
+        with torch.no_grad():
+            outputs = model.generate(
+                **inputs,
+                max_new_tokens=200,
+                temperature=0.7,
+                top_p=0.9,
+                do_sample=True,
+                pad_token_id=tokenizer.pad_token_id,
+                eos_token_id=tokenizer.eos_token_id,
+            )
+        inference_time = time.time() - generation_start
+        # Decode
+        response = tokenizer.decode(
+            outputs[0][input_tokens:],
+            skip_special_tokens=True
+        )
+        # Extract summary
+        summary = extract_summary(response)
+        # Calculate stats
+        output_tokens = outputs.shape[1] - input_tokens
+        total_time = time.time() - start_time
+        tokens_per_second = output_tokens / inference_time if inference_time > 0 else 0
+        # Create stats table data
+        stats_data = [
+            ["Inference Time", f"{inference_time:.2f}s"],
+            ["Warmup Time", f"{warmup_time:.2f}s"],
+            ["Total Time", f"{total_time:.2f}s"],
+            ["Tokens/Second", f"{tokens_per_second:.1f}"],
+            ["Input Tokens", str(input_tokens)],
+            ["Output Tokens", str(output_tokens)],
+            ["Total Tokens", str(outputs.shape[1])],
+        ]
+        return summary, stats_data
+    except Exception as e:
+        return f"Error: {str(e)}", None
+# Load model on startup
+load_model()
+# Create Gradio interface
+demo = gr.Interface(
+    fn=summarize_conversation,
+    inputs=gr.Textbox(
+        label="Discord Conversation",
+        placeholder="Paste your Discord conversation here...",
+        lines=15,
+        value=SAMPLE_CONVERSATION
+    ),
+    outputs=[
+        gr.Textbox(
+            label="Summary",
+            lines=10
+        ),
+        gr.Dataframe(
+            label="Statistics",
+            headers=["Metric", "Value"],
+            datatype=["str", "str"],
+            row_count=7,
+            column_count=2,
+        )
+    ],
+    title="DiscordSum - Conversation Summarizer",
+    description="Summarize Discord conversations into short paragraphs. Runs [Plasmoxy/Qwen3-0.6B-DiscordSum-mini-v1](https://huggingface.co/Plasmoxy/Qwen3-0.6B-DiscordSum-mini-v1).",
+    examples=[[SAMPLE_CONVERSATION]],
+)
+if __name__ == "__main__":
+    demo.launch()