chenchen2121
/

CNVSRC2024Baseline

Model card Files Files and versions

chenchen2121 commited on May 3, 2024

Commit

3a2497d

·

verified ·

1 Parent(s): 8ce63a7

Create README.md

Files changed (1) hide show

README.md +22 -0

README.md ADDED Viewed

	@@ -0,0 +1,22 @@

+---
+metrics:
+- cer
+---
+## Introduction
+This repository provides the baseline model files for CNVSRC2024 (Chinese Continuous Visual Speech Recognition Challenge 2024).
+## Usage
+Please download these model files and use them in the [baseline code](https://github.com/sectum1919/CNVSRC2024Baseline).
+## Performance
+The following table shows these models' performance on their own tasks.
+|       Training Data       |           Task         |   CER  | File Name                                |
+|:-------------------------:|:----------------------:|:------:|:-----------------------------------------|
+| CN-CVS (<4s)              |      Pre-training      |   /    | model_avg_14_23_cncvs_4s.pth             |
+| CN-CVS (full)             |      Pre-training      |   /    | model_avg_last10_cncvs_4s_30s.pth        |
+| CN-CVS + CNVSRC-Single.Dev| Single-speaker VSR (T1)| 39.66% | model_avg_last5_cncvs_cnvsrc-single.pth  |
+| CN-CVS + CNVSRC-Multi.Dev | Multi-speaker VSR  (T2)| 52.20% | model_avg_last5_cncvs_cnvsrc-multi.pth   |