Update README.md
Browse files
README.md
CHANGED
|
@@ -12,14 +12,14 @@ Lipreading is an advanced neural network model designed for accurate lip reading
|
|
| 12 |
|
| 13 |
- **Dual Input System**: Utilizes both raw image sequences and corresponding lip landmark coordinates for improved context.
|
| 14 |
- **Enhanced Spatial Resolution**: Improved spatial analysis of lip movements through detailed landmark tracking.
|
| 15 |
-
- **State-of-the-Art Performance**: Outperforms the original
|
| 16 |
| Scenario | Image Size (W x H) | CER | WER |
|
| 17 |
| :-------------------------------: | :----------------: | :--: | :---: |
|
| 18 |
| Unseen speakers (Original) | 100 x 50 | 6.7% | 13.6% |
|
| 19 |
| Overlapped speakers (Original) | 100 x 50 | 2.0% | 5.6% |
|
| 20 |
-
| Unseen speakers (
|
| 21 |
-
| Overlapped speakers (
|
| 22 |
-
| Overlapped speakers (
|
| 23 |
|
| 24 |
## Getting Started
|
| 25 |
|
|
|
|
| 12 |
|
| 13 |
- **Dual Input System**: Utilizes both raw image sequences and corresponding lip landmark coordinates for improved context.
|
| 14 |
- **Enhanced Spatial Resolution**: Improved spatial analysis of lip movements through detailed landmark tracking.
|
| 15 |
+
- **State-of-the-Art Performance**: Outperforms the original Lipreading, as well as LipReading-final-year-project[ Implementation of LipReading](https://github.com/wissemkarous/Lip-reading-Final-Year-Project).
|
| 16 |
| Scenario | Image Size (W x H) | CER | WER |
|
| 17 |
| :-------------------------------: | :----------------: | :--: | :---: |
|
| 18 |
| Unseen speakers (Original) | 100 x 50 | 6.7% | 13.6% |
|
| 19 |
| Overlapped speakers (Original) | 100 x 50 | 2.0% | 5.6% |
|
| 20 |
+
| Unseen speakers (LipReading-final-year-project) | 128 x 64 | 6.7% | 13.3% |
|
| 21 |
+
| Overlapped speakers ( LipReading-final-year-project) | 128 x 64 | 1.9% | 4.6% |
|
| 22 |
+
| Overlapped speakers (LipReading) | 128 x 64 | 0.6% | 1.7% |
|
| 23 |
|
| 24 |
## Getting Started
|
| 25 |
|