Spaces:

cmu-adcs
/

videogenic

Runtime error

App Files Files Community

chuanenlin commited on Aug 12, 2022

Commit

5a11d0a

1 Parent(s): f58a6c0

Upload files

Browse files

Files changed (44) hide show

.DS_Store +0 -0
.gitattributes +3 -0
README.md +5 -7
files/.DS_Store +0 -0
files/skydiving.npy +3 -0
files/skydiving_features.npy +3 -0
files/surfing.npy +3 -0
files/surfing_features.npy +3 -0
music/.DS_Store +0 -0
music/and-it-sounds-like.mp3 +3 -0
music/and-it-went-like.mp3 +3 -0
music/comfort-chain.mp3 +3 -0
music/coming-in-hot.mp3 +3 -0
music/loop.mp3 +3 -0
music/lovewave.mp3 +3 -0
music/ready-set.mp3 +3 -0
music/sheesh.mp3 +3 -0
music/thinking-out-loud.mp3 +3 -0
photos/.DS_Store +0 -0
photos/skydiving/AdobeStock_10001953_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_120216166_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_138896480_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_166023598_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_279780585_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_33345390_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_348814707_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_350837731_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_7005042_Preview.jpeg +3 -0
photos/skydiving/AdobeStock_96129011_Preview.jpeg +3 -0
photos/surfing/AdobeStock_185663731_Preview.jpeg +3 -0
photos/surfing/AdobeStock_211437413_Preview.jpeg +3 -0
photos/surfing/AdobeStock_220162637_Preview.jpeg +3 -0
photos/surfing/AdobeStock_220164473_Preview.jpeg +3 -0
photos/surfing/AdobeStock_328826367_Preview.jpeg +3 -0
photos/surfing/AdobeStock_415484898_Preview.jpeg +3 -0
photos/surfing/AdobeStock_46444136_Preview.jpeg +3 -0
photos/surfing/AdobeStock_495442848_Preview.jpeg +3 -0
photos/surfing/AdobeStock_54024377_Preview.jpeg +3 -0
photos/surfing/AdobeStock_70293058_Preview.jpeg +3 -0
requirements.txt +11 -0
videogenic.py +607 -0
videos/.DS_Store +0 -0
videos/skydiving.mp4 +3 -0
videos/surfing.mp4 +3 -0

.DS_Store ADDED Viewed

Binary file (8.2 kB). View file

.gitattributes CHANGED Viewed

@@ -29,3 +29,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.mp3 filter=lfs diff=lfs merge=lfs -text
+*.mp4 filter=lfs diff=lfs merge=lfs -text
+*.jpeg filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,12 +1,10 @@
 ---
 title: Videogenic
-emoji: 📉
-colorFrom: blue
-colorTo: red
 sdk: streamlit
-sdk_version: 1.10.0
-app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
 title: Videogenic
+emoji: ✨
+colorFrom: purple
+colorTo: pink
 sdk: streamlit
+sdk_version: 1.11.0
+app_file: videogenic.py
 pinned: false
 ---

files/.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

files/skydiving.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:62d9ee5c22ab5d17a4713ff64796d545b81bcfd40cb7d238cc7228434e5b8f3e
+size 870930480

files/skydiving_features.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a0ec76c93a1b5b4fd293ddd66340a9f6aff9f62a5c5adecfcfbbdd20f66de310
+size 645248

files/surfing.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d246ebd959b39c63681a1ddd61912a5866b91990b01d31dcbde00a6db4e37036
+size 859871040

files/surfing_features.npy ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:20cb9e6d3818cc93280d14b53ff31c98c7eb4d09968d8e1e8992eae7e597a3c6
+size 637056

music/.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

music/and-it-sounds-like.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1147e33873dd2ddfb8414c50e9b91fd6f88813a8fd4493a66d4d45d21107bb61
+size 287354

music/and-it-went-like.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c9ea37f421d2ebea3a35876e3b3a710496aa0c80adbeea6152169adef29cba90
+size 297360

music/comfort-chain.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd12feddd5ca89f8b3ac5cc5332d0f9eae6287fe10040b57225de75a67ac3b63
+size 295632

music/coming-in-hot.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:25d27fa1d10ed3ed59a2a6ed706a9df319645a0b3208066fcabd2225c2f67814
+size 229719

music/loop.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9655e2b062b6aaf54e709b2c6e53f7774686fd04f93629730bcc95946696e3d9
+size 61269030

music/lovewave.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7d3b788ea8a983b36f01cd31a765e840e1bff906d84e16efd7b641351021a8d6
+size 219936

music/ready-set.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6d11d286777853692d0ce7f88306af9e119335d9ddc40dd9dea4ee03361fb7ba
+size 380324

music/sheesh.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e370d0a9f0e261cdd2e0fbfece30a2a35b5c23e30b11964d812bfb615a017c90
+size 317043

music/thinking-out-loud.mp3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0839c8ced02007f098b2295c177ecb8d0335d55dbec77ae33bab06de9346afea
+size 134543

photos/.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

photos/skydiving/AdobeStock_10001953_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 0a315ef51f608d01d67a911edb137c0c225af197b59ddeaf20b98ed114bac319
Pointer size: 131 Bytes
Size of remote file: 147 kB

photos/skydiving/AdobeStock_120216166_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 72925aae1b640e4ec4d3de9af99135f8f6081f400af126af6acbacfbf7c9fdd0
Pointer size: 131 Bytes
Size of remote file: 234 kB

photos/skydiving/AdobeStock_138896480_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 564303243268fff80ffb21421421b686ef488b5ae4cfbd24bb21d243f819ef19
Pointer size: 131 Bytes
Size of remote file: 228 kB

photos/skydiving/AdobeStock_166023598_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: bd586ef2ad1421067e8a6d4d0179b7891782c456bd6ea546d78afb83ba0e532f
Pointer size: 131 Bytes
Size of remote file: 230 kB

photos/skydiving/AdobeStock_279780585_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 94c8d8c4d9ea258aaf3a5fc9ecb9efd5f8cc2a12dfef69b3ae21893155cf4c99
Pointer size: 131 Bytes
Size of remote file: 275 kB

photos/skydiving/AdobeStock_33345390_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 7f10b904e67ab0398c0bd633ebdb5061627b0a3d3aaa1e56b57d4663af05240d
Pointer size: 131 Bytes
Size of remote file: 229 kB

photos/skydiving/AdobeStock_348814707_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 555d3e22eca3dafd8de0266252c288823925b28b0f830a2fcb5728f6eaf5e6a6
Pointer size: 131 Bytes
Size of remote file: 206 kB

photos/skydiving/AdobeStock_350837731_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 63e3513159602dc7e24bfb023a8d810dd2d24e5f230efaeb399b9d52d81b4bab
Pointer size: 131 Bytes
Size of remote file: 109 kB

photos/skydiving/AdobeStock_7005042_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 6675396bfc2ccd8006ff54efa9ff9428ecd36912ca223d530facb9dde8579acd
Pointer size: 131 Bytes
Size of remote file: 312 kB

photos/skydiving/AdobeStock_96129011_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: eff50fd1d4bdc12ef4c6c2eb1e4b15c7d0b096b36af2869ad583b1257b72ea45
Pointer size: 131 Bytes
Size of remote file: 168 kB

photos/surfing/AdobeStock_185663731_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: c8bd75ca95b75641bff10d2cb8327dde4073416e1af66183f2dab6767422d90f
Pointer size: 131 Bytes
Size of remote file: 427 kB

photos/surfing/AdobeStock_211437413_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: eab1a3c219bf7d6da517a302605f40cd932d06f4206907b7ad115f7000f00cad
Pointer size: 131 Bytes
Size of remote file: 312 kB

photos/surfing/AdobeStock_220162637_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: b60eebd3b8b2c3d02344e0656a5dc57259e2296b369deb941f5da89c00b31551
Pointer size: 131 Bytes
Size of remote file: 316 kB

photos/surfing/AdobeStock_220164473_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: da3ef19dcccbb8b61a605f83cafb0a803bd0a0aed2e9d638084584758d5a9bc3
Pointer size: 131 Bytes
Size of remote file: 391 kB

photos/surfing/AdobeStock_328826367_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: ad051a7a2adf85234d1e56dab267033e9d8c962c0430ab208595dc44bb96ed7e
Pointer size: 131 Bytes
Size of remote file: 295 kB

photos/surfing/AdobeStock_415484898_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 52917b362d75a89d79337ad84eae406d1900b84a233fd4dc6b43b66e1abb169d
Pointer size: 131 Bytes
Size of remote file: 248 kB

photos/surfing/AdobeStock_46444136_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 85986bbdf57872395a11011acb44a40fc41b6febd6de2be6317f225bb78e4995
Pointer size: 131 Bytes
Size of remote file: 477 kB

photos/surfing/AdobeStock_495442848_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: b1fcdf9c2a1708c2435a3be71bd097d2c3dbeb0428e4f414a013457306ef8800
Pointer size: 131 Bytes
Size of remote file: 399 kB

photos/surfing/AdobeStock_54024377_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 5df855e201893f1d751d8c3b5cf1c5a2fe3f8ac11061d300ce58d70cf5325ebb
Pointer size: 131 Bytes
Size of remote file: 440 kB

photos/surfing/AdobeStock_70293058_Preview.jpeg ADDED Viewed

Git LFS Details

SHA256: 2838f0b9f681254fe0a66faf1d932ff5ade35a770d3c150fed7f738ef8012629
Pointer size: 131 Bytes
Size of remote file: 328 kB

requirements.txt ADDED Viewed

	@@ -0,0 +1,11 @@

+streamlit
+streamlit_vega_lite
+opencv-python
+Pillow
+torch
+numpy
+decord
+moviepy
+altair
+pandas
+glob2

videogenic.py ADDED Viewed

	@@ -0,0 +1,607 @@

+import streamlit as st
+# from pytube import YouTube
+# from pytube import extract
+import cv2
+from PIL import Image
+import clip as openai_clip
+import torch
+import math
+import numpy as np
+import tempfile
+# from humanfriendly import format_timespan
+import json
+import sys
+from random import randrange
+import logging
+# from pyunsplash import PyUnsplash
+import requests
+import io
+from io import BytesIO
+import base64
+import altair as alt
+from streamlit_vega_lite import altair_component
+import pandas as pd
+from datetime import timedelta
+import math
+from decord import VideoReader, cpu, gpu
+from moviepy.video.io.VideoFileClip import VideoFileClip
+from moviepy.audio.io.AudioFileClip import AudioFileClip
+from moviepy.video.io.ffmpeg_tools import ffmpeg_extract_subclip
+from moviepy.editor import *
+import glob
+def fetch_video(url):
+  yt = YouTube(url)
+  streams = yt.streams.filter(adaptive=True, subtype='mp4', resolution='360p', only_video=True)
+  length = yt.length
+  if length >= 300:
+    st.error('Please find a YouTube video shorter than 5 minutes. Sorry about this, the server capacity is limited for the time being.')
+    st.stop()
+  video = streams[0]
+  return video, video.url
+# @st.cache()
+# def extract_frames(video):
+#   frames = []
+#   capture = cv2.VideoCapture(video)
+#   fps = capture.get(cv2.CAP_PROP_FPS)
+#   current_frame = 0
+#   while capture.isOpened():
+#     ret, frame = capture.read()
+#     if ret == True:
+#       frames.append(Image.fromarray(frame[:, :, ::-1]))
+#     else:
+#       break
+#     current_frame += fps
+#     capture.set(cv2.CAP_PROP_POS_FRAMES, current_frame)
+#   # print(f'Frames extracted: {len(frames)}')
+#   return frames, fps
+# @st.cache()
+def video_to_frames(video):
+  vr = VideoReader(video)
+  frames = []
+  frame_count = len(vr)
+  fps = vr.get_avg_fps()
+  for i in range(0, frame_count, int(fps)):
+  # for i in range(0, frame_count):
+    frame = vr[i].asnumpy()
+    y_dim = frame.shape[0]
+    x_dim = frame.shape[1]
+    frames.append(Image.fromarray(frame))
+  return frames, fps, x_dim, y_dim
+def video_to_info(video):
+  vr = VideoReader(video)
+  frames = []
+  frame_count = len(vr)
+  fps = vr.get_avg_fps()
+  frame = vr[0].asnumpy()
+  y_dim = frame.shape[0]
+  x_dim = frame.shape[1]
+  return fps, x_dim, y_dim
+# @st.cache()
+def encode_frames(video_frames):
+  batch_size = 256
+  batches = math.ceil(len(video_frames) / batch_size)
+  video_features = torch.empty([0, 512], dtype=torch.float16).to(st.session_state.device)
+  for i in range(batches):
+    batch_frames = video_frames[i*batch_size : (i+1)*batch_size]
+    batch_preprocessed = torch.stack([st.session_state.preprocess(frame) for frame in batch_frames]).to(st.session_state.device)
+    with torch.no_grad():
+      batch_features = st.session_state.model.encode_image(batch_preprocessed)
+      batch_features /= batch_features.norm(dim=-1, keepdim=True)
+    video_features = torch.cat((video_features, batch_features))
+  # print(f'Features: {video_features.shape}')
+  return video_features
+def classify_activity(video_features, activities_list):
+	text = torch.cat([openai_clip.tokenize(
+		f'{activity}') for activity in activities_list]).to(st.session_state.device)
+	with torch.no_grad():
+		text_features = st.session_state.model.encode_text(text)
+		text_features /= text_features.norm(dim=-1, keepdim=True)
+	logit_scale = st.session_state.model.logit_scale.exp()
+	video_features = torch.from_numpy(video_features)
+	similarities = (logit_scale * video_features @
+                 text_features.t()).softmax(dim=-1)
+	probs, word_idxs = similarities[0].topk(5)
+	primary_activity = []
+	for prob, word_idx in zip(probs, word_idxs):
+		primary_activity.append(activities_list[word_idx])
+	# primary_activity = activities_list[word_idx]
+	return primary_activity
+def encode_photos(photos):
+  batch_size = 256
+  batches = math.ceil(len(photos) / batch_size)
+  video_features = torch.empty([0, 512], dtype=torch.float16).to(st.session_state.device)
+  for i in range(batches):
+    batch_frames = photos[i*batch_size : (i+1)*batch_size]
+    batch_preprocessed = torch.stack([st.session_state.preprocess(Image.open(frame)) for frame in batch_frames]).to(st.session_state.device)
+    with torch.no_grad():
+      batch_features = st.session_state.model.encode_image(batch_preprocessed)
+      batch_features /= batch_features.norm(dim=-1, keepdim=True)
+    video_features = torch.cat((video_features, batch_features))
+  # print(f'Features: {video_features.shape}')
+  return video_features
+def img_to_bytes(img):
+  img_byte_arr = io.BytesIO()
+  img.save(img_byte_arr, format='JPEG')
+  img_byte_arr = img_byte_arr.getvalue()
+  return img_byte_arr
+def normalize(vector):
+  return (vector - np.min(vector)) / (np.max(vector) - np.min(vector))
+def format_img(img):
+  size = 150, 150
+  # img = Image.fromarray(img)
+  img.thumbnail(size, Image.Resampling.LANCZOS)
+  output = io.BytesIO()
+  img.save(output, format='PNG')
+  encoded_string = f'data:image/png;base64,{base64.b64encode(output.getvalue()).decode()}'
+  return encoded_string
+def get_photos(keyword):
+  photo_collection = []
+  for filename in glob.glob(f'photos/{st.session_state.domain.lower()}/*.jpeg'):
+    photo = Image.open(filename)
+    photo_collection.append(photo)
+  return photo_collection
+  # # api_key = 'hzcKZ0e4we95wSd8_ip2zTB3m2DrOMWehAxrYjqjwg0'
+  # api_key = 'fZ1nE7Y4NC-iYGmqgv-WuyM8m9p0LroCdAOZOR6tyho'
+  # unsplash_search = PyUnsplash(api_key=api_key)
+  # logging.getLogger('pyunsplash').setLevel(logging.DEBUG)
+  # search = unsplash_search.search(type_='photos', query=keyword) # per_page
+  # photo_collection = []
+  # # st.markdown(f'**Unsplash photos for `{keyword}`**')
+  # for result in search.entries:
+  #   photo_url = result.link_download
+  #   response = requests.get(photo_url)
+  #   photo = Image.open(BytesIO(response.content))
+  #   # st.image(photo, width=200)
+  #   photo_collection.append(photo)
+  # return photo_collection
+def display_results(best_photo_idx):
+  st.markdown('**Top 10 highlights**')
+  result_arr = []
+  for frame_id in best_photo_idx:
+    result = st.session_state.video_frames[frame_id]
+    st.image(result)
+  return result_arr
+def make_df(similarities):
+  similarities = similarities
+  df = pd.DataFrame()
+  df['keyword'] = [keyword] * len(similarities)
+  df['x'] = [i for i, _ in enumerate(similarities)]
+  df['y'] = normalize(np.power(similarities, 8))
+  df['image'] = [format_img(frame) for frame in st.session_state.video_frames]
+  return df
+# @st.cache()
+def compute_scores(search_query, video_features, text_query, display_results_count=10):
+  sum_photo = torch.zeros(1, 512)
+  for photo in search_query:
+    with torch.no_grad():
+      image_features = st.session_state.model.encode_image(st.session_state.preprocess(photo).unsqueeze(0).to(st.session_state.device))
+      image_features /= image_features.norm(dim=-1, keepdim=True)
+      sum_photo += sum_photo + image_features
+  avg_photo = sum_photo / len(search_query)
+  video_features = torch.from_numpy(video_features)
+  similarities = (100.0 * video_features @ avg_photo.T)
+  # values, best_photo_idx = similarities.topk(display_results_count, dim=0)
+  # display_results(best_photo_idx)
+  return similarities.cpu().numpy()
+def avenir():
+    font = 'Avenir'
+    return {
+        'config' : {
+             'title': {'font': font},
+             'axis': {
+                  'labelFont': font,
+                  'titleFont': font
+             }
+        }
+    }
+alt.themes.register('avenir', avenir)
+alt.themes.enable('avenir')
+# TODO: Make playhead scores and average according to keyword
+# TODO: Maximum interval selection
+# TODO: Interactive legend https://altair-viz.github.io/gallery/interactive_legend.html
+# TODO: Multi-line highlight https://altair-viz.github.io/gallery/multiline_highlight.html
+@st.cache
+def draw_chart(df, mode):
+  if st.session_state.mode == 'Automatic':
+    nearest = alt.selection(type='single', nearest=True, on='mouseover', empty='none')
+    line = alt.Chart(df).mark_line().encode(
+      x=alt.X('x:Q', axis=alt.Axis(labels=True, tickSize=0, title='')),
+      y=alt.Y('y', axis=alt.Axis(labels=False, tickSize=0, title='')),
+      # color=alt.Color('keyword:N', scale=alt.Scale(scheme='tableau20')),
+      color=alt.value('#00C7BE'),
+      # color=alt.Color('#9b59b6'),
+    )
+    selectors = alt.Chart(df).mark_point().encode(
+      x='x:Q',
+      opacity=alt.value(0),
+    ).add_selection(
+        nearest
+    )
+    rules = alt.Chart(df).mark_rule(color='black').encode(
+      x='x:Q',
+    ).transform_filter(
+      nearest
+    )
+    points = line.mark_point().encode(
+      opacity=alt.condition(nearest, alt.value(1), alt.value(0))
+    )
+    text = line.mark_text(align='center', yOffset=-110, fontSize=16).encode(
+      text=alt.condition(nearest, 'y:N', alt.value(' ')),
+      color=alt.value('#000000'),
+      # fontSize=30
+    ).transform_calculate(y=f'format(datum.y, ".2f")')
+    image = line.mark_image(align='center', width=150, height=150, yOffset=-60).encode(
+      url=alt.condition(nearest, 'image', alt.value(' '))
+    )
+    chart = alt.layer(line, selectors, points, rules, text, image)
+  elif st.session_state.mode == 'brush':
+    brush = alt.selection(type='interval', encodings=['x'])
+    line = alt.Chart(df).mark_line().encode( # https://www.rdocumentation.org/packages/vegalite/versions/0.6.1/topics/mark_line
+      x=alt.X('x:Q', axis=alt.Axis(labels=True, tickSize=0, title='')),
+      y=alt.Y('y:Q', axis=alt.Axis(labels=False, tickSize=0, title='')),
+      # color=alt.Color('keyword:N', scale=alt.Scale(scheme='tableau20')),
+      color=alt.value('#00C7BE'),
+    ).add_selection(
+      brush
+    )
+    text = alt.Chart(df).transform_filter(brush).mark_text(
+      align='right',
+      # baseline='top',
+      # dx=1500
+      dx=750,
+      dy=-12,
+      fontSize=24,
+      fontWeight=800,
+    ).encode(
+      # x='max(x):Q',
+      y='mean(y):Q',
+      # dy=alt.value(10),
+      text=alt.Text('mean(y):Q', format='.2f'),
+    )
+    average = alt.Chart(df).mark_rule(color='black', strokeDash=[5, 5]).encode(
+      y='mean(y):Q',
+      # size=alt.SizeValue(3),
+    ).transform_filter(
+      brush
+    )
+    # chart = alt.layer(line, average, text)
+    chart = line
+  elif st.session_state.mode == 'User selection':
+    brush = alt.selection(type='interval', encodings=['x'])
+    line = alt.Chart(df).mark_line().encode( # https://www.rdocumentation.org/packages/vegalite/versions/0.6.1/topics/mark_line
+      x=alt.X('x:Q', axis=alt.Axis(labels=True, tickSize=0, title='')),
+      y=alt.Y('y:Q', axis=alt.Axis(labels=False, tickSize=0, title='')),
+      # color=alt.Color('keyword:N', scale=alt.Scale(scheme='tableau20')),
+      color=alt.value('#00C7BE'),
+    ).add_selection(
+      brush
+    )
+    text = alt.Chart(df).transform_filter(brush).mark_text(
+      align='right',
+      # baseline='top',
+      # dx=1500
+      dx=750,
+      dy=-12,
+      fontSize=24,
+      fontWeight=800,
+    ).encode(
+      # x='max(x):Q',
+      y='mean(y):Q',
+      # dy=alt.value(10),
+      text=alt.Text('mean(y):Q', format='.2f'),
+    )
+    average = alt.Chart(df).mark_rule(color='black', strokeDash=[5, 5]).encode(
+      y='mean(y):Q',
+      # size=alt.SizeValue(3),
+    ).transform_filter(
+      brush
+    )
+    # chart = alt.layer(line, average, text)
+    chart = line
+  return chart.properties(width=1250, height=500).configure_axis(grid=False, domain=False).configure_view(strokeOpacity=0)
+  # return line
+def max_subarray(arr, k):
+  n = len(arr)
+  if (n < k):
+    st.write('Video too short')
+  res = 0
+  left = 0
+  right = k
+  for i in range(k):
+    res += arr[i]
+  curr_sum = res
+  for i in range(k, n):
+    curr_sum += arr[i] - arr[i - k]
+    if curr_sum > res:
+      res = curr_sum
+      left = i - k
+      right = i
+  return res, left, right
+def edit_video(template, df_all):
+  video_path = f'videos/{st.session_state.domain.lower()}.mp4'
+  if template == 'Coming In Hot by Andy Mineo & Lecrae (hype, 7 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 7)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/coming-in-hot.mp3'
+    blank1 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.6)
+    flash1 = video.subclip(t_start=0, t_end=1.2)
+    blank2 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash2 = video.subclip(t_start=1.3, t_end=1.4)
+    blank3 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash3 = video.subclip(t_start=1.5, t_end=3.3)
+    blank4 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash4 = video.subclip(t_start=3.4, t_end=3.5)
+    blank5 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash5 = video.subclip(t_start=3.6, t_end=4.6)
+    blank6 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash6 = video.subclip(t_start=4.7, t_end=4.8)
+    blank7 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    highlight = video.subclip(t_start=4.9, t_end=6.384)
+    output = concatenate_videoclips([blank1, flash1, blank2, flash2, blank3, flash3, blank4, flash4, blank5, flash5, blank6, flash6, blank7, highlight])
+  elif template == 'Thinking Out Loud Cypher by Jermsego (hype, 8 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 7)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/thinking-out-loud.mp3'
+    blank = ColorClip((x_dim, y_dim), (0, 0, 0), duration=1.6)
+    highlight = video.subclip(t_start=0, t_end=6.852)
+    output = concatenate_videoclips([blank, highlight])
+  elif template == 'Sheesh by Surfaces (upbeat, 10 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 8)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/sheesh.mp3'
+    blank1 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=3.5)
+    flash1 = video.subclip(t_start=0, t_end=0.1)
+    blank2 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash2 = video.subclip(t_start=0.2, t_end=0.3)
+    blank3 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash3 = video.subclip(t_start=0.4, t_end=0.5)
+    blank4 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.1)
+    flash4 = video.subclip(t_start=0.6, t_end=0.7)
+    blank5 = ColorClip((x_dim, y_dim), (0, 0, 0), duration=0.9)
+    highlight = video.subclip(t_start=1.6, t_end=7.18408163265)
+    output = concatenate_videoclips([blank1, flash1, blank2, flash2, blank3, flash3, blank4, flash4, blank5, highlight])
+  elif template == 'Moon by Kid Francescoli (tranquil, 10 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 9)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/and-it-went-like.mp3'
+    blank = ColorClip((x_dim, y_dim), (0, 0, 0), duration=1.9)
+    highlight = video.subclip(t_start=0, t_end=8.132)
+    output = concatenate_videoclips([blank, highlight])
+  elif template == 'Ready Set by Joey Valence & Brae (old school, 10 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 11)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/ready-set.mp3'
+    highlight = video.subclip(t_start=0, t_end=10.512)
+    output = highlight
+  elif template == 'Lovewave by The 1-800 (tranquil, 13 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 12)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/lovewave.mp3'
+    blank = ColorClip((x_dim, y_dim), (0, 0, 0), duration=2.1)
+    highlight = video.subclip(t_start=0, t_end=11.58)
+    output = concatenate_videoclips([blank, highlight])
+  elif template == 'And It Sounds Like by Forrest Nolan (tranquil, 17 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 16)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/and-it-sounds-like.mp3'
+    blank = ColorClip((x_dim, y_dim), (0, 0, 0), duration=2)
+    highlight = video.subclip(t_start=0, t_end=15.928)
+    output = concatenate_videoclips([blank, highlight])
+  elif template == 'Comfort Chain by Instupendo (lofi, 18 seconds)':
+    res, left, right = max_subarray(df_all['y'].tolist(), 19)
+    video = VideoFileClip(video_path).subclip(t_start=left, t_end=right)
+    fps = video.fps
+    x_dim = st.session_state.x_dim
+    y_dim = st.session_state.y_dim
+    music_path = 'music/comfort-chain.mp3'
+    highlight = video.subclip(t_start=0, t_end=18.432000000000002)
+    output = highlight
+  # st.write(res, left, right)
+  song = AudioFileClip(music_path)
+  output = output.set_audio(song)
+  output.write_videofile('output.mp4', temp_audiofile='temp.m4a', remove_temp=True, audio_codec='aac', logger=None, fps=fps)
+  st.video('output.mp4')
+  # return output
+def crop_video(df_all, left, right):
+  video_path = f'videos/{st.session_state.domain.lower()}.mp4'
+  video = VideoFileClip(video_path)
+  fps = video.fps
+  music_path = 'music/loop.mp3'
+  song = AudioFileClip(music_path)
+  video = video.set_audio(song)
+  output = video.subclip(t_start=left, t_end=right)
+  output.write_videofile('output.mp4', temp_audiofile='temp.m4a', remove_temp=True, audio_codec='aac', logger=None, fps=fps)
+  st.video('output.mp4')
+  # return output
+st.set_page_config(page_title='Videogenic', page_icon = '✨', layout = 'wide', initial_sidebar_state = 'collapsed')
+hide_streamlit_style = """
+                      <style>
+                      #MainMenu {visibility: hidden;}
+                      footer {visibility: hidden;}
+                      * {font-family: Avenir; cursor: pointer;}
+                      .css-gma2qf {display: flex; justify-content: center; font-size: 42px; font-weight: bold;}
+                      a:link {text-decoration: none;}
+                      a:hover {text-decoration: none;}
+                      .st-ba {font-family: Avenir;}
+                      </style>
+                      """
+st.markdown(hide_streamlit_style, unsafe_allow_html=True)
+# clustrmaps = """
+#             <a href="https://clustrmaps.com/site/1bham" target="_blank" title="Visit tracker"><img src="//www.clustrmaps.com/map_v2.png?d=NhNk5g9hy6Y06nqo7RirhHvZSr89uSS8rPrt471wAXw&cl=ffffff" width="0" height="0"></a>
+#             """
+# st.markdown(clustrmaps, unsafe_allow_html=True)
+# ss = SessionState.get(url=None, id=None, input=None, file_name=None, video=None, video_name=None, video_frames=None, video_features=None, fps=None, mode=None, query=None, progress=1)
+st.title('Videogenic ✨')
+if 'progress' not in st.session_state:
+  st.session_state.progress = 1
+# mode = 'play'
+# mode = 'brush'
+# mode = 'select'
+if st.session_state.progress == 1:
+  device = 'cuda' if torch.cuda.is_available() else 'cpu'
+  model, preprocess = openai_clip.load('ViT-B/32', device=device)
+  if 'model' not in st.session_state:
+    st.session_state.model = model
+    st.session_state.preprocess = preprocess
+    st.session_state.device = device
+  st.session_state.model = model
+  st.session_state.preprocess = preprocess
+  st.session_state.device = device
+  domain = st.selectbox('Select video',('Skydiving', 'Surfing')) # Entire journey, montage, vlog
+  if 'domain' not in st.session_state:
+    st.session_state.domain = domain
+  st.session_state.domain = domain
+  if st.button('Process video'):
+    video_name = f'videos/{st.session_state.domain.lower()}.mp4'
+    video_file = open(video_name, 'rb')
+    video_bytes = video_file.read()
+    if 'video' not in st.session_state:
+      st.session_state.video = video_bytes
+    st.session_state.video = video_bytes
+    # st.video(st.session_state.video)
+    # video_frames, fps, x_dim, y_dim = video_to_frames(video_name) # first run; video_to_info
+    # np.save(f'files/{st.session_state.domain.lower()}.npy', video_frames)
+    fps, x_dim, y_dim = video_to_info(video_name)
+    video_frames = np.load(f'files/{st.session_state.domain.lower()}.npy', allow_pickle=True)
+    if 'video_frames' not in st.session_state:
+      st.session_state.video_frames = video_frames
+      st.session_state.fps = fps
+      st.session_state.x_dim = x_dim
+      st.session_state.y_dim = y_dim
+    st.session_state.video_frames = video_frames
+    st.session_state.fps = fps
+    st.session_state.x_dim = x_dim
+    st.session_state.y_dim = y_dim
+    print('Extracted frames')
+    # encoded_frames = encode_frames(video_frames) # first run
+    # np.save(f'files/{st.session_state.domain.lower()}_features.npy', encoded_frames)
+    encoded_frames = np.load(f'files/{st.session_state.domain.lower()}_features.npy', allow_pickle=True)
+    if 'video_features' not in st.session_state:
+      # st.session_state.video_features = encoded_frames
+      st.session_state.video_features = encoded_frames
+    st.session_state.video_features = encoded_frames
+    print('Encoded frames')
+    st.session_state.progress = 2
+# with open('activities.txt') as f:
+#   activities_list = [line.rstrip('\n') for line in f]
+# keywords = classify_activity(st.session_state.video_features, activities_list)
+# st.write(keywords)
+if st.session_state.progress == 2:
+  mode = st.radio('Select mode', ('Automatic', 'User selection'))
+  if 'mode' not in st.session_state:
+    st.session_state.mode = mode
+  st.session_state.mode = mode
+  # keywords = list(st.text_input('Enter topic').split(','))
+  # if st.button('Compute scores') and keywords is not None:
+  keyword = st.session_state.domain.lower()
+  df_list = []
+  # for keyword in keywords:
+  img_set = get_photos(keyword)
+  similarities = compute_scores(img_set, st.session_state.video_features, keyword)
+  # st.write(similarities)
+  df = make_df(similarities)
+  df_list.append(df)
+  df_all = pd.concat(df_list, ignore_index=True, sort=False)
+  if 'df_all' not in st.session_state:
+    st.session_state.df_all = df_all
+  st.session_state.df_all = df_all
+  # st.write(df_all)
+  # highlight_length = 7.033
+  # st.write(st.session_state.fps)
+  selection = altair_component(draw_chart(df_all, st.session_state.mode))
+  print(selection)
+# if '_vgsid_' in selection:
+#   # the ids start at 1
+#   st.write(df.iloc[[selection['_vgsid_'][0] - 1]])
+# else:
+#   st.info('Hover over the chart above to see details about the Penguin here.')
+  # if 'x' in selection:
+  #   # the ids start at 1
+  #   st.write(selection['x'])
+    # chart = draw_chart(df_all, mode)
+    # st.altair_chart(chart, use_container_width=False)
+    # st.session_state.progress = 3
+  # if st.session_state.progress == 3:
+  if st.session_state.mode == 'Automatic':
+    # template = st.selectbox('Select template', ['Coming In Hot by Andy Mineo & Lecrae (hype, 7 seconds)', 'Thinking Out Loud Cypher by Jermsego (hype, 8 seconds)', 'Sheesh by Surfaces (upbeat, 10 seconds)',
+    #                           'Moon by Kid Francescoli (tranquil, 10 seconds)', 'Ready Set by Joey Valence & Brae (old school, 10 seconds)', 'Lovewave by The 1-800 (tranquil, 13 seconds)',
+    #                           'And It Sounds Like by Forrest Nolan (tranquil, 17 seconds)', 'Comfort Chain by Instupendo (lofi, 18 seconds)'])
+    template = st.selectbox('Select template', ['Coming In Hot by Andy Mineo & Lecrae (hype, 7 seconds)', 'Sheesh by Surfaces (upbeat, 10 seconds)', 'Lovewave by The 1-800 (tranquil, 13 seconds)'])
+    if st.button('Generate video'):
+      edit_video(template, st.session_state.df_all)
+  elif st.session_state.mode == 'User selection':
+    if st.button('Generate video'):
+      left = selection['x'][0]
+      right = selection['x'][1]
+      crop_video(st.session_state.df_all, left, right)
+      # res, left, right = max_subarray(df_all['y'].tolist(), 8)
+      # if 'left' not in st.session_state:
+      #   st.session_state.left = left
+      #   st.session_state.right = right
+    # video_path = f'videos/{domain.lower()}.mp4'
+    # music_path = 'music/sheesh.wav'
+    # video = VideoFileClip(video_path).subclip(t_start=st.session_state.left, t_end=st.session_state.right)
+    # fps = video.fps
+    # x_dim = st.session_state.x_dim
+    # y_dim = st.session_state.y_dim
+    # song = AudioFileClip(music_path)
+    # output = edit_video(video, template)
+    # st.video('output.mp4')
+  # np.save('skydiving_features', st.session_state.video_features)
+  # np.save('skydiving_frames', st.session_state.video_frames)

videos/.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

videos/skydiving.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:96534459fdba80dd076a7c4de5e6d9553db55640baf3ff5956450de3efd0586b
+size 79814669

videos/surfing.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b566c3d0c6894193302096f069b2970f402f0048b4640a6f764889ebe3dfa817
+size 81639070