omm7 commited on
Commit
798fdc2
Β·
verified Β·
1 Parent(s): 6163fd7

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +30 -12
README.md CHANGED
@@ -1,19 +1,37 @@
1
  ---
2
- title: Lip Reader
3
- emoji: πŸš€
4
- colorFrom: red
5
- colorTo: red
6
  sdk: docker
7
- app_port: 8501
8
- tags:
9
- - streamlit
10
  pinned: false
11
- short_description: Streamlit template space
12
  ---
13
 
14
- # Welcome to Streamlit!
15
 
16
- Edit `/src/streamlit_app.py` to customize this app to your heart's desire. :heart:
17
 
18
- If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
19
- forums](https://discuss.streamlit.io).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: LipNet Silent Speech Recognition
3
+ emoji: πŸ‘„
4
+ colorFrom: purple
5
+ colorTo: indigo
6
  sdk: docker
 
 
 
7
  pinned: false
 
8
  ---
9
 
10
+ # LipNet β€” Silent Speech Recognition
11
 
12
+ Reads lips from video and predicts spoken text β€” no audio required.
13
 
14
+ ## File Structure
15
+ ```
16
+ β”œβ”€β”€ Dockerfile
17
+ β”œβ”€β”€ requirements.txt
18
+ β”œβ”€β”€ README.md
19
+ β”œβ”€β”€ models/
20
+ β”‚ └── checkpoint.weights.h5 ← upload your weights here
21
+ └── app/
22
+ β”œβ”€β”€ app.py
23
+ β”œβ”€β”€ modelutil.py
24
+ β”œβ”€β”€ utils.py
25
+ └── data/
26
+ β”œβ”€β”€ s1/
27
+ β”‚ └── *.mpg ← sample videos from GRID corpus
28
+ └── alignments/
29
+ └── s1/
30
+ └── *.align ← alignment files
31
+ ```
32
+
33
+ ## Model
34
+ - **Input**: 75 frames, mouth crop 46Γ—140px, grayscale, z-score normalized
35
+ - **Architecture**: Conv3D Γ— 3 β†’ Reshape β†’ BiLSTM Γ— 2 β†’ Dense(41) β†’ CTC
36
+ - **Dataset**: GRID Corpus Speaker S1
37
+ - **Vocab**: a–z, 1–9, `'`, `?`, `!`, space (40 chars + CTC blank = 41)