wissemkarous commited on
Commit
7a8a215
·
verified ·
1 Parent(s): 6b8d8cb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -12,14 +12,14 @@ Lipreading is an advanced neural network model designed for accurate lip reading
12
 
13
  - **Dual Input System**: Utilizes both raw image sequences and corresponding lip landmark coordinates for improved context.
14
  - **Enhanced Spatial Resolution**: Improved spatial analysis of lip movements through detailed landmark tracking.
15
- - **State-of-the-Art Performance**: Outperforms the original LipNet, as well as VIPL's [PyTorch implementation of LipNet](https://github.com/VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch).
16
  | Scenario | Image Size (W x H) | CER | WER |
17
  | :-------------------------------: | :----------------: | :--: | :---: |
18
  | Unseen speakers (Original) | 100 x 50 | 6.7% | 13.6% |
19
  | Overlapped speakers (Original) | 100 x 50 | 2.0% | 5.6% |
20
- | Unseen speakers (VIPL LipNet) | 128 x 64 | 6.7% | 13.3% |
21
- | Overlapped speakers (VIPL LipNet) | 128 x 64 | 1.9% | 4.6% |
22
- | Overlapped speakers (LipCoordNet) | 128 x 64 | 0.6% | 1.7% |
23
 
24
  ## Getting Started
25
 
 
12
 
13
  - **Dual Input System**: Utilizes both raw image sequences and corresponding lip landmark coordinates for improved context.
14
  - **Enhanced Spatial Resolution**: Improved spatial analysis of lip movements through detailed landmark tracking.
15
+ - **State-of-the-Art Performance**: Outperforms the original Lipreading, as well as LipReading-final-year-project[ Implementation of LipReading](https://github.com/wissemkarous/Lip-reading-Final-Year-Project).
16
  | Scenario | Image Size (W x H) | CER | WER |
17
  | :-------------------------------: | :----------------: | :--: | :---: |
18
  | Unseen speakers (Original) | 100 x 50 | 6.7% | 13.6% |
19
  | Overlapped speakers (Original) | 100 x 50 | 2.0% | 5.6% |
20
+ | Unseen speakers (LipReading-final-year-project) | 128 x 64 | 6.7% | 13.3% |
21
+ | Overlapped speakers ( LipReading-final-year-project) | 128 x 64 | 1.9% | 4.6% |
22
+ | Overlapped speakers (LipReading) | 128 x 64 | 0.6% | 1.7% |
23
 
24
  ## Getting Started
25