Spaces:

omm7
/

lip_reader

Sleeping

omm7 commited on Mar 19

Commit

798fdc2

verified ·

1 Parent(s): 6163fd7

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,19 +1,37 @@
 ---
-title: Lip Reader
-emoji: 🚀
-colorFrom: red
-colorTo: red
 sdk: docker
-app_port: 8501
-tags:
-- streamlit
 pinned: false
-short_description: Streamlit template space
 ---
-# Welcome to Streamlit!
-Edit `/src/streamlit_app.py` to customize this app to your heart's desire. :heart:
-If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
-forums](https://discuss.streamlit.io).

 ---
+title: LipNet Silent Speech Recognition
+emoji: 👄
+colorFrom: purple
+colorTo: indigo
 sdk: docker
 pinned: false
 ---
+# LipNet — Silent Speech Recognition
+Reads lips from video and predicts spoken text — no audio required.
+## File Structure
+```
+├── Dockerfile
+├── requirements.txt
+├── README.md
+├── models/
+│   └── checkpoint.weights.h5      ← upload your weights here
+└── app/
+    ├── app.py
+    ├── modelutil.py
+    ├── utils.py
+    └── data/
+        ├── s1/
+        │   └── *.mpg              ← sample videos from GRID corpus
+        └── alignments/
+            └── s1/
+                └── *.align        ← alignment files
+```
+## Model
+- **Input**: 75 frames, mouth crop 46×140px, grayscale, z-score normalized
+- **Architecture**: Conv3D × 3 → Reshape → BiLSTM × 2 → Dense(41) → CTC
+- **Dataset**: GRID Corpus Speaker S1
+- **Vocab**: a–z, 1–9, `'`, `?`, `!`, space (40 chars + CTC blank = 41)