Spaces:

Divyonko
/

LivePulse

Sleeping

App Files Files Community

DivYonko commited on Apr 15

Commit

bcca570

2 Parent(s): a4612d4 58cbb3a

fix: remove unsupported width='stretch' from buttons

Browse files

Files changed (7) hide show

.gitattributes +38 -0
CHANGELOG.md +177 -0
Dockerfile +23 -0
README.md +20 -0
app.py +25 -23
requirements.txt +6 -0
src/streamlit_app.py +40 -0

.gitattributes CHANGED Viewed

@@ -1,4 +1,42 @@
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text

+<<<<<<< HEAD
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
+=======
+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+>>>>>>> 58cbb3ad16a724133db9fe31bffce8783a85648a

CHANGELOG.md ADDED Viewed

	@@ -0,0 +1,177 @@

+# LivePulse — Development Changelog
+**Date:** April 14, 2026
+**Session summary:** Dashboard UX upgrades, multi-stream comparison, analytics features, performance optimizations, and bug fixes.
+---
+## Files Modified
+| File | Original Lines | Final Lines | Change |
+|------|---------------|-------------|--------|
+| `frontend/streamlit_app.py` | ~540 | 1354 | +814 |
+| `backend/scraper.py` | 115 | 135 | +20 |
+| `requirements.txt` | 22 | 35 | +13 |
+---
+## 1. Dashboard UX Upgrades (`frontend/streamlit_app.py`)
+### 1.1 Sentiment Heatmap Over Time
+- Added `build_heatmap_data()` — buckets all messages into 1-minute intervals and counts Positive / Neutral / Negative per bucket
+- Rendered as a stacked bar chart (Plotly) showing mood volume over the full stream lifetime
+- Includes "View data" toggle and CSV export
+### 1.2 Sentiment Velocity
+- Added `compute_velocity()` — compares positive ratio of last 20 messages vs previous 20
+- Displayed as a 5th stat card alongside cumulative counts
+- Three states: ↑ Rising (green), → Stable (yellow), ↓ Falling (red)
+- Shows delta percentage shift
+### 1.3 Notification / Alert System
+- **Negative spike alert** — pulsing red banner when negative % in rolling window exceeds configurable threshold (default 40%)
+- **Spam surge alert** — separate orange banner when spam topic % exceeds configurable threshold (default 30%)
+- Both alerts are dismissable with a ✕ button and re-arm automatically when new messages arrive
+- Alert window size and thresholds configurable from sidebar sliders
+### 1.4 Pinned Messages
+- Every message in the live feed has a 📍 pin button
+- Pinned messages appear in a dedicated "Pinned Messages" section above the feed with gold highlight styling
+- Individual unpin buttons per message
+- Sidebar shows pin count and a "Clear pins" button
+- Pin state persists across auto-refreshes via `st.session_state`
+### 1.5 Multi-Stream Comparison (fully rebuilt)
+- Sidebar now manages up to **5 independent stream slots** (A–E), each with its own color, video ID field, Redis key field, and Start/Stop buttons
+- **＋ Add stream / － Remove last** buttons to dynamically add/remove slots
+- Comparison section appears automatically when 2+ streams have data — no toggle needed
+- Renders sentiment bar charts in rows of 3
+- Overlay line chart shows rolling positive % for all active streams on the same axis
+- Fixed Streamlit widget re-render bug: widget keys used as single source of truth instead of `value=` overrides
+---
+## 2. Analytics & Insights Features (`frontend/streamlit_app.py`)
+### 2.1 Engagement Score
+- `compute_engagement()` — composite 0–100 score from:
+  - Message rate (msgs/min) — 40% weight
+  - Positive ratio — 40% weight
+  - Question density — 20% weight
+- Displayed as a large score card with a fill bar and grade (🔥 High / ⚡ Medium / 💤 Low)
+- Three supporting metric tiles: Msgs/min, Positive ratio, Question density
+### 2.2 Top Contributors Leaderboard
+- `compute_top_contributors()` — ranks authors by message count, tracks per-author sentiment breakdown
+- Left panel: ranked list with 🥇🥈🥉 medals, progress bar, colored sentiment dots per author
+- Right panel: stacked horizontal bar chart showing sentiment % for top 5 authors
+- CSV export of full leaderboard
+### 2.3 Word Cloud
+- `compute_word_freq()` — extracts top 60 words after removing stopwords (English + common Hinglish filler words)
+- Filterable by sentiment (All / Positive / Neutral / Negative) and topic
+- Renders word cloud image via `wordcloud` library using `wc.to_array()` directly (no matplotlib pipeline)
+- Top-20 frequency bar chart shown below the cloud
+- Falls back to bar chart only if `wordcloud` not installed
+### 2.4 Spam Rate Alert
+- `check_spam_alert()` — monitors spam topic ratio in rolling window
+- Separate dismissable banner distinct from the negative sentiment alert
+- Configurable threshold and window from sidebar
+---
+## 3. Backend: Multi-Stream Scraper (`backend/scraper.py`)
+### Changes
+- Added `argparse` CLI interface with two arguments:
+  - `--video_id` — YouTube video ID to scrape (defaults to `config.py` value)
+  - `--redis_key` — Redis list key to write messages to (defaults to `chat_messages`)
+- `run()` function now accepts `video_id` and `redis_key` as parameters instead of reading globals
+- Redis connection moved inside `run()` so each scraper instance is fully independent
+- Each stream writes to its own Redis key, enabling true parallel multi-stream operation
+**Usage:**
+```bash
+# Stream A (default)
+python -m backend.scraper --video_id ABC123 --redis_key chat_messages
+# Stream B
+python -m backend.scraper --video_id XYZ789 --redis_key chat_messages_b
+# Stream C
+python -m backend.scraper --video_id DEF456 --redis_key chat_messages_c
+```
+---
+## 4. Performance Optimizations (`frontend/streamlit_app.py`)
+### 4.1 Redis Read Deduplication
+- `load_stream_data("chat_messages")` called **once** per refresh cycle
+- Windowed slice (`data = all_data[-msg_limit:]`) derived in-memory instead of a second Redis read
+- Multi-stream comparison reuses cached data instead of calling `load_stream_data` twice per stream
+### 4.2 `st.cache_data` on Heavy Functions
+| Function | TTL | Benefit |
+|----------|-----|---------|
+| `load_stream_data()` | 5s | Prevents redundant Redis reads within same refresh |
+| `compute_velocity()` | 10s | Skips recompute if data unchanged |
+| `build_heatmap_data()` | 10s | Skips full groupby on every refresh |
+| `compute_engagement()` | 10s | Skips recompute if data unchanged |
+| `compute_top_contributors()` | 10s | Skips recompute if data unchanged |
+| `compute_word_freq()` | 10s | Skips word counting on every refresh |
+### 4.3 Cache-Compatible Function Signatures
+- `compute_velocity()` and `build_heatmap_data()` refactored to accept JSON strings instead of DataFrames — `st.cache_data` requires hashable arguments and DataFrames are not hashable
+### 4.4 DataFrame Construction
+- `all_df` built once from `all_data`, `df` sliced from it — no duplicate parsing
+---
+## 5. Bug Fixes
+### 5.1 Multi-Stream Widget Re-render Bug
+- **Problem:** `st.text_input(value=stream["video_id"])` was resetting the field to the old value on every Streamlit rerun, so video IDs typed for Stream B/C were wiped before the Start button handler could read them
+- **Fix:** Widget keys (`vid_0`, `rkey_0`, etc.) initialized once via `st.session_state[key] = ...` and used as the sole source of truth. `value=` parameter removed entirely.
+### 5.2 Active Stream Detection
+- **Problem:** `r.exists(key)` returns an integer (0 or 1), not a bool, and returns 1 for any existing key including empty lists
+- **Fix:** Changed to `r.llen(key) > 0` which correctly checks for actual message data
+### 5.3 WordCloud Crash
+- **Problem:** `background_color="transparent"` is not a valid PIL color specifier, causing `ValueError: unknown color specifier: 'transparent'`
+- **Fix:** Changed to `background_color="white"` and render via `wc.to_array()` directly — removes the matplotlib pipeline entirely
+### 5.4 Streamlit Deprecation Warning
+- **Problem:** `use_container_width=True/False` deprecated, removed after 2025-12-31
+- **Fix:** All 21 occurrences replaced with `width='stretch'` / `width='content'`
+---
+## 6. Dependencies Added (`requirements.txt`)
+```
+matplotlib
+wordcloud
+```
+---
+## Architecture Overview (Post-Session)
+```
+Redis
+ ├── chat_messages        ← Stream A scraper writes here
+ ├── chat_messages_b      ← Stream B scraper writes here
+ ├── chat_messages_c      ← Stream C scraper writes here
+ ├── chat_messages_d      ← Stream D scraper writes here
+ ├── chat_messages_e      ← Stream E scraper writes here
+ └── video_title          ← Stream A title for page header
+backend/scraper.py        ← One process per stream, --video_id + --redis_key args
+backend/main.py           ← FastAPI REST API (reads from chat_messages)
+frontend/streamlit_app.py ← Dashboard (reads from all active Redis keys)
+ml/sentiment_model.py     ← 3-model ensemble (MuRIL + XLM-R + Multilingual)
+ml/topic_model.py         ← Keyword fast-path + BART zero-shot fallback
+```

Dockerfile CHANGED Viewed

@@ -1,3 +1,4 @@
 FROM python:3.11-slim
 WORKDIR /app
@@ -8,3 +9,25 @@ COPY . .
 EXPOSE 7860
 CMD ["streamlit", "run", "app.py", "--server.port", "7860", "--server.address", "0.0.0.0"]

+<<<<<<< HEAD
 FROM python:3.11-slim
 WORKDIR /app
 EXPOSE 7860
 CMD ["streamlit", "run", "app.py", "--server.port", "7860", "--server.address", "0.0.0.0"]
+=======
+FROM python:3.13.5-slim
+WORKDIR /app
+RUN apt-get update && apt-get install -y \
+    build-essential \
+    curl \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+COPY requirements.txt ./
+COPY src/ ./src/
+RUN pip3 install -r requirements.txt
+EXPOSE 8501
+HEALTHCHECK CMD curl --fail http://localhost:8501/_stcore/health
+ENTRYPOINT ["streamlit", "run", "src/streamlit_app.py", "--server.port=8501", "--server.address=0.0.0.0"]
+>>>>>>> 58cbb3ad16a724133db9fe31bffce8783a85648a

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 title: LivePulse
 emoji: 📡
 colorFrom: purple
 colorTo: indigo
@@ -31,3 +32,22 @@ Real-time Hinglish sentiment and topic analysis for YouTube live streams.
 1. Paste a YouTube live video ID or URL in the **Stream Control** section in the sidebar
 2. Click **▶ Start** — the scraper launches in the background
 3. The dashboard auto-refreshes and shows live sentiment + topic data

 ---
 title: LivePulse
+<<<<<<< HEAD
 emoji: 📡
 colorFrom: purple
 colorTo: indigo
 1. Paste a YouTube live video ID or URL in the **Stream Control** section in the sidebar
 2. Click **▶ Start** — the scraper launches in the background
 3. The dashboard auto-refreshes and shows live sentiment + topic data
+=======
+emoji: 🚀
+colorFrom: red
+colorTo: red
+sdk: docker
+app_port: 8501
+tags:
+- streamlit
+pinned: false
+short_description: YoutubeLive Comment Analysis
+---
+# Welcome to Streamlit!
+Edit `/src/streamlit_app.py` to customize this app to your heart's desire. :heart:
+If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
+forums](https://discuss.streamlit.io).
+>>>>>>> 58cbb3ad16a724133db9fe31bffce8783a85648a

app.py CHANGED Viewed

@@ -1,4 +1,4 @@
-# -*- coding: utf-8 -*-
 """
 app.py — Hugging Face Spaces adaptation of frontend/streamlit_app.py
 All features identical; infrastructure layer uses in-memory deque store
@@ -684,7 +684,7 @@ with st.sidebar:
         sc1, sc2 = st.columns(2)
         with sc1:
-            if st.button("▶ Start", key=f"start_{idx}", width='stretch'):
                 vid  = extract_video_id(st.session_state[vid_skey])
                 rkey = st.session_state[rkey_skey].strip() or f"chat_messages_{label.lower()}"
                 if vid:
@@ -703,7 +703,7 @@ with st.sidebar:
                 else:
                     st.error("Invalid video ID or URL")
         with sc2:
-            if st.button("⏹ Stop", key=f"stop_{idx}", width='stretch'):
                 if is_scraper_running(idx):
                     stop_scraper(idx)
                     st.session_state.streams[idx]["proc"] = None
@@ -722,7 +722,7 @@ with st.sidebar:
     add_col, rem_col = st.columns(2)
     with add_col:
         if len(st.session_state.streams) < MAX_STREAMS:
-            if st.button("＋ Add stream", width='stretch'):
                 n = len(st.session_state.streams)
                 st.session_state.streams.append({
                     "video_id":  "",
@@ -733,7 +733,7 @@ with st.sidebar:
                 st.rerun()
     with rem_col:
         if len(st.session_state.streams) > 1:
-            if st.button("－ Remove last", width='stretch'):
                 removed = st.session_state.streams.pop()
                 removed_idx = len(st.session_state.streams)
                 stop_scraper(removed_idx)
@@ -745,14 +745,14 @@ with st.sidebar:
     st.markdown('<p style="font-size:0.68rem;font-weight:700;color:var(--accent);text-transform:uppercase;letter-spacing:0.1em;margin-bottom:8px;">Pinned Messages</p>', unsafe_allow_html=True)
     pin_count = len(st.session_state.pinned_messages)
     st.markdown(f'<div style="font-size:0.78rem;color:var(--text-3);">{pin_count} message{"s" if pin_count != 1 else ""} pinned</div>', unsafe_allow_html=True)
-    if pin_count > 0 and st.button("🗑 Clear pins", width='stretch'):
         st.session_state.pinned_messages = []
         st.rerun()
     st.divider()
     # ── Danger Zone ──
     st.markdown('<p style="font-size:0.68rem;font-weight:700;color:#ef4444;text-transform:uppercase;letter-spacing:0.1em;margin-bottom:8px;">Danger Zone</p>', unsafe_allow_html=True)
-    if st.button("🗑 Clear all data", width='stretch'):
         for s in st.session_state.streams:
             store_delete(s["redis_key"])
         st.session_state.pinned_messages = []
@@ -952,7 +952,7 @@ with col_l:
         hovertemplate="<b>%{x}</b><br>Count: %{y}<extra></extra>",
     ))
     fig_bar.update_layout(**plotly_layout(260))
-    st.plotly_chart(fig_bar, width='stretch', config={"displayModeBar": False})
     bar_hdr, bar_dl = st.columns([1, 1])
     with bar_hdr:
         show_bar_data = st.checkbox("View data", key="show_bar")
@@ -960,7 +960,7 @@ with col_l:
         bar_df = pd.DataFrame({"Sentiment": ["Positive", "Neutral", "Negative"], "Count": [pos, neu, neg]})
         csv_download(bar_df, "Download CSV", "sentiment_distribution.csv")
     if show_bar_data:
-        st.dataframe(bar_df, width='stretch', hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 with col_r:
@@ -979,7 +979,7 @@ with col_r:
            "showlegend": True,
            "legend": dict(orientation="h", y=-0.08, font=dict(size=11))}
     )
-    st.plotly_chart(fig_pie, width='stretch', config={"displayModeBar": False})
     pie_hdr, pie_dl = st.columns([1, 1])
     with pie_hdr:
         show_pie_data = st.checkbox("View data", key="show_pie")
@@ -991,7 +991,7 @@ with col_r:
         })
         csv_download(pie_df, "Download CSV", "sentiment_breakdown.csv")
     if show_pie_data:
-        st.dataframe(pie_df, width='stretch', hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 # ── Confidence trend ──────────────────────────────────────────
@@ -1012,7 +1012,7 @@ if "confidence" in df.columns:
     ))
     fig_line.update_layout(**plotly_layout(180))
     fig_line.update_yaxes(range=[0, 1])
-    st.plotly_chart(fig_line, width='stretch', config={"displayModeBar": False})
     conf_hdr, conf_dl = st.columns([1, 1])
     with conf_hdr:
         show_conf_data = st.checkbox("View data", key="show_conf")
@@ -1021,7 +1021,7 @@ if "confidence" in df.columns:
         conf_export.columns = ["message_index", "confidence"]
         csv_download(conf_export, "Download CSV", "confidence_trend.csv")
     if show_conf_data:
-        st.dataframe(conf_export, width='stretch', hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
@@ -1054,7 +1054,7 @@ if not heatmap_data.empty:
     layout["legend"] = dict(orientation="h", y=1.08, font=dict(size=11))
     layout["xaxis"]["tickformat"] = "%H:%M"
     fig_heat.update_layout(**layout)
-    st.plotly_chart(fig_heat, width='stretch', config={"displayModeBar": False})
     heat_hdr, heat_dl = st.columns([1, 1])
     with heat_hdr:
@@ -1062,7 +1062,7 @@ if not heatmap_data.empty:
     with heat_dl:
         csv_download(heatmap_data.rename(columns={"bucket": "time_bucket"}), "Download CSV", "sentiment_heatmap.csv")
     if show_heat_data:
-        st.dataframe(heatmap_data.rename(columns={"bucket": "time_bucket"}), width='stretch', hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 else:
     st.info("Not enough timestamped data for heatmap yet.")
@@ -1105,7 +1105,7 @@ fig_topic = go.Figure(go.Bar(
     hovertemplate="<b>%{x}</b><br>Count: %{y}<extra></extra>",
 ))
 fig_topic.update_layout(**plotly_layout(250))
-st.plotly_chart(fig_topic, width='stretch', config={"displayModeBar": False})
 topic_hdr, topic_dl = st.columns([1, 1])
 with topic_hdr:
     show_topic_data = st.checkbox("View data", key="show_topic")
@@ -1113,7 +1113,7 @@ with topic_dl:
     topic_df = pd.DataFrame({"Topic": TOPIC_LABELS, "Count": [topic_counts[l] for l in TOPIC_LABELS]})
     csv_download(topic_df, "Download CSV", "topic_distribution.csv")
 if show_topic_data:
-    st.dataframe(topic_df, width='stretch', hide_index=True)
 st.markdown('</div>', unsafe_allow_html=True)
@@ -1208,7 +1208,7 @@ if contributors:
         layout_lb["xaxis"]["range"] = [0, 100]
         layout_lb["xaxis"]["ticksuffix"] = "%"
         fig_lb.update_layout(**layout_lb)
-        st.plotly_chart(fig_lb, width='stretch', config={"displayModeBar": False})
     contrib_df = pd.DataFrame(contributors)
     csv_download(contrib_df, "Download CSV", "top_contributors.csv")
@@ -1248,7 +1248,7 @@ if word_freq:
         ).generate_from_frequencies(freq_dict)
         st.markdown('<div class="chart-wrap">', unsafe_allow_html=True)
-        st.image(wc.to_array(), width="stretch")
         st.markdown('</div>', unsafe_allow_html=True)
         top20 = word_freq[:20]
@@ -1261,7 +1261,7 @@ if word_freq:
         ))
         layout_wf = plotly_layout(180)
         fig_wf.update_layout(**layout_wf)
-        st.plotly_chart(fig_wf, width='stretch', config={"displayModeBar": False})
     except ImportError:
         top20 = word_freq[:20]
@@ -1272,7 +1272,7 @@ if word_freq:
             marker_line_width=0,
         ))
         fig_wf.update_layout(**plotly_layout(200))
-        st.plotly_chart(fig_wf, width='stretch', config={"displayModeBar": False})
 else:
     st.info("Not enough text data yet.")
@@ -1328,7 +1328,7 @@ if len(active_streams) > 1:
                     f'Stream {slabel} — {stream["redis_key"]}</span>',
                     unsafe_allow_html=True
                 )
-                st.plotly_chart(fig, width='stretch', config={"displayModeBar": False})
                 st.markdown(
                     f'<div style="font-size:0.78rem;color:var(--text-3);margin-bottom:8px;">'
                     f'{t} msgs · <span style="color:#22c55e;">{p/t*100:.1f}% pos</span> · '
@@ -1363,7 +1363,7 @@ if len(active_streams) > 1:
     layout_ov["legend"] = dict(orientation="h", y=1.1, font=dict(size=11))
     layout_ov["yaxis"]["range"] = [0, 100]
     fig_overlay.update_layout(**layout_ov)
-    st.plotly_chart(fig_overlay, width='stretch', config={"displayModeBar": False})
     st.markdown('</div>', unsafe_allow_html=True)
 elif len(st.session_state.streams) > 1:
@@ -1483,3 +1483,5 @@ for i, (_, row) in enumerate(filtered.iloc[::-1].iterrows()):
 if auto_refresh:
     time.sleep(refresh_rate)
     st.rerun()

+# -*- coding: utf-8 -*-
 """
 app.py — Hugging Face Spaces adaptation of frontend/streamlit_app.py
 All features identical; infrastructure layer uses in-memory deque store
         sc1, sc2 = st.columns(2)
         with sc1:
+            if st.button("▶ Start", key=f"start_{idx}"):
                 vid  = extract_video_id(st.session_state[vid_skey])
                 rkey = st.session_state[rkey_skey].strip() or f"chat_messages_{label.lower()}"
                 if vid:
                 else:
                     st.error("Invalid video ID or URL")
         with sc2:
+            if st.button("⏹ Stop", key=f"stop_{idx}"):
                 if is_scraper_running(idx):
                     stop_scraper(idx)
                     st.session_state.streams[idx]["proc"] = None
     add_col, rem_col = st.columns(2)
     with add_col:
         if len(st.session_state.streams) < MAX_STREAMS:
+            if st.button("＋ Add stream"):
                 n = len(st.session_state.streams)
                 st.session_state.streams.append({
                     "video_id":  "",
                 st.rerun()
     with rem_col:
         if len(st.session_state.streams) > 1:
+            if st.button("－ Remove last"):
                 removed = st.session_state.streams.pop()
                 removed_idx = len(st.session_state.streams)
                 stop_scraper(removed_idx)
     st.markdown('<p style="font-size:0.68rem;font-weight:700;color:var(--accent);text-transform:uppercase;letter-spacing:0.1em;margin-bottom:8px;">Pinned Messages</p>', unsafe_allow_html=True)
     pin_count = len(st.session_state.pinned_messages)
     st.markdown(f'<div style="font-size:0.78rem;color:var(--text-3);">{pin_count} message{"s" if pin_count != 1 else ""} pinned</div>', unsafe_allow_html=True)
+    if pin_count > 0 and st.button("🗑 Clear pins"):
         st.session_state.pinned_messages = []
         st.rerun()
     st.divider()
     # ── Danger Zone ──
     st.markdown('<p style="font-size:0.68rem;font-weight:700;color:#ef4444;text-transform:uppercase;letter-spacing:0.1em;margin-bottom:8px;">Danger Zone</p>', unsafe_allow_html=True)
+    if st.button("🗑 Clear all data"):
         for s in st.session_state.streams:
             store_delete(s["redis_key"])
         st.session_state.pinned_messages = []
         hovertemplate="<b>%{x}</b><br>Count: %{y}<extra></extra>",
     ))
     fig_bar.update_layout(**plotly_layout(260))
+    st.plotly_chart(fig_bar, config={"displayModeBar": False})
     bar_hdr, bar_dl = st.columns([1, 1])
     with bar_hdr:
         show_bar_data = st.checkbox("View data", key="show_bar")
         bar_df = pd.DataFrame({"Sentiment": ["Positive", "Neutral", "Negative"], "Count": [pos, neu, neg]})
         csv_download(bar_df, "Download CSV", "sentiment_distribution.csv")
     if show_bar_data:
+        st.dataframe(bar_df, hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 with col_r:
            "showlegend": True,
            "legend": dict(orientation="h", y=-0.08, font=dict(size=11))}
     )
+    st.plotly_chart(fig_pie, config={"displayModeBar": False})
     pie_hdr, pie_dl = st.columns([1, 1])
     with pie_hdr:
         show_pie_data = st.checkbox("View data", key="show_pie")
         })
         csv_download(pie_df, "Download CSV", "sentiment_breakdown.csv")
     if show_pie_data:
+        st.dataframe(pie_df, hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 # ── Confidence trend ──────────────────────────────────────────
     ))
     fig_line.update_layout(**plotly_layout(180))
     fig_line.update_yaxes(range=[0, 1])
+    st.plotly_chart(fig_line, config={"displayModeBar": False})
     conf_hdr, conf_dl = st.columns([1, 1])
     with conf_hdr:
         show_conf_data = st.checkbox("View data", key="show_conf")
         conf_export.columns = ["message_index", "confidence"]
         csv_download(conf_export, "Download CSV", "confidence_trend.csv")
     if show_conf_data:
+        st.dataframe(conf_export, hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
     layout["legend"] = dict(orientation="h", y=1.08, font=dict(size=11))
     layout["xaxis"]["tickformat"] = "%H:%M"
     fig_heat.update_layout(**layout)
+    st.plotly_chart(fig_heat, config={"displayModeBar": False})
     heat_hdr, heat_dl = st.columns([1, 1])
     with heat_hdr:
     with heat_dl:
         csv_download(heatmap_data.rename(columns={"bucket": "time_bucket"}), "Download CSV", "sentiment_heatmap.csv")
     if show_heat_data:
+        st.dataframe(heatmap_data.rename(columns={"bucket": "time_bucket"}), hide_index=True)
     st.markdown('</div>', unsafe_allow_html=True)
 else:
     st.info("Not enough timestamped data for heatmap yet.")
     hovertemplate="<b>%{x}</b><br>Count: %{y}<extra></extra>",
 ))
 fig_topic.update_layout(**plotly_layout(250))
+st.plotly_chart(fig_topic, config={"displayModeBar": False})
 topic_hdr, topic_dl = st.columns([1, 1])
 with topic_hdr:
     show_topic_data = st.checkbox("View data", key="show_topic")
     topic_df = pd.DataFrame({"Topic": TOPIC_LABELS, "Count": [topic_counts[l] for l in TOPIC_LABELS]})
     csv_download(topic_df, "Download CSV", "topic_distribution.csv")
 if show_topic_data:
+    st.dataframe(topic_df, hide_index=True)
 st.markdown('</div>', unsafe_allow_html=True)
         layout_lb["xaxis"]["range"] = [0, 100]
         layout_lb["xaxis"]["ticksuffix"] = "%"
         fig_lb.update_layout(**layout_lb)
+        st.plotly_chart(fig_lb, config={"displayModeBar": False})
     contrib_df = pd.DataFrame(contributors)
     csv_download(contrib_df, "Download CSV", "top_contributors.csv")
         ).generate_from_frequencies(freq_dict)
         st.markdown('<div class="chart-wrap">', unsafe_allow_html=True)
+        st.image(wc.to_array())
         st.markdown('</div>', unsafe_allow_html=True)
         top20 = word_freq[:20]
         ))
         layout_wf = plotly_layout(180)
         fig_wf.update_layout(**layout_wf)
+        st.plotly_chart(fig_wf, config={"displayModeBar": False})
     except ImportError:
         top20 = word_freq[:20]
             marker_line_width=0,
         ))
         fig_wf.update_layout(**plotly_layout(200))
+        st.plotly_chart(fig_wf, config={"displayModeBar": False})
 else:
     st.info("Not enough text data yet.")
                     f'Stream {slabel} — {stream["redis_key"]}</span>',
                     unsafe_allow_html=True
                 )
+                st.plotly_chart(fig, config={"displayModeBar": False})
                 st.markdown(
                     f'<div style="font-size:0.78rem;color:var(--text-3);margin-bottom:8px;">'
                     f'{t} msgs · <span style="color:#22c55e;">{p/t*100:.1f}% pos</span> · '
     layout_ov["legend"] = dict(orientation="h", y=1.1, font=dict(size=11))
     layout_ov["yaxis"]["range"] = [0, 100]
     fig_overlay.update_layout(**layout_ov)
+    st.plotly_chart(fig_overlay, config={"displayModeBar": False})
     st.markdown('</div>', unsafe_allow_html=True)
 elif len(st.session_state.streams) > 1:
 if auto_refresh:
     time.sleep(refresh_rate)
     st.rerun()

requirements.txt CHANGED Viewed

@@ -1,3 +1,4 @@
 # Core ML
 torch>=2.0.0
 transformers>=4.38.0
@@ -16,3 +17,8 @@ pandas>=2.0.0
 plotly>=5.18.0
 wordcloud>=1.9.3
 matplotlib>=3.8.0

+<<<<<<< HEAD
 # Core ML
 torch>=2.0.0
 transformers>=4.38.0
 plotly>=5.18.0
 wordcloud>=1.9.3
 matplotlib>=3.8.0
+=======
+altair
+pandas
+streamlit
+>>>>>>> 58cbb3ad16a724133db9fe31bffce8783a85648a

src/streamlit_app.py ADDED Viewed

	@@ -0,0 +1,40 @@

+import altair as alt
+import numpy as np
+import pandas as pd
+import streamlit as st
+"""
+# Welcome to Streamlit!
+Edit `/streamlit_app.py` to customize this app to your heart's desire :heart:.
+If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
+forums](https://discuss.streamlit.io).
+In the meantime, below is an example of what you can do with just a few lines of code:
+"""
+num_points = st.slider("Number of points in spiral", 1, 10000, 1100)
+num_turns = st.slider("Number of turns in spiral", 1, 300, 31)
+indices = np.linspace(0, 1, num_points)
+theta = 2 * np.pi * num_turns * indices
+radius = indices
+x = radius * np.cos(theta)
+y = radius * np.sin(theta)
+df = pd.DataFrame({
+    "x": x,
+    "y": y,
+    "idx": indices,
+    "rand": np.random.randn(num_points),
+})
+st.altair_chart(alt.Chart(df, height=700, width=700)
+    .mark_point(filled=True)
+    .encode(
+        x=alt.X("x", axis=None),
+        y=alt.Y("y", axis=None),
+        color=alt.Color("idx", legend=None, scale=alt.Scale()),
+        size=alt.Size("rand", legend=None, scale=alt.Scale(range=[1, 150])),
+    ))