s3prl/superb
Viewer • Updated • 304k • 1.51k • 33
How to use yujiepan/internal.wav2vec2-base-superb-ks-int8-structured64-quantize-inputs with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("audio-classification", model="yujiepan/internal.wav2vec2-base-superb-ks-int8-structured64-quantize-inputs") # Load model directly
from transformers import AutoProcessor, NNCFNetwork
processor = AutoProcessor.from_pretrained("yujiepan/internal.wav2vec2-base-superb-ks-int8-structured64-quantize-inputs")
model = NNCFNetwork.from_pretrained("yujiepan/internal.wav2vec2-base-superb-ks-int8-structured64-quantize-inputs")This model is a fine-tuned version of anton-l/wav2vec2-base-ft-keyword-spotting on the superb dataset. It achieves the following results on the evaluation set:
This model is quantized. The input is also quantized. Structured Sparsity in transformer block linear layers is 64%.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 0.4481 | 1.0 | 399 | 0.2105 | 0.9469 |
| 5.6584 | 2.0 | 798 | 5.5480 | 0.9428 |
| 8.7915 | 3.0 | 1197 | 8.6634 | 0.9601 |
| 10.4775 | 4.0 | 1596 | 10.2819 | 0.9553 |
| 10.9142 | 5.0 | 1995 | 10.7770 | 0.9657 |
| 10.9478 | 6.0 | 2394 | 10.7637 | 0.9660 |
| 0.2765 | 7.0 | 2793 | 0.1335 | 0.9678 |
| 0.2532 | 8.0 | 3192 | 0.1075 | 0.9732 |
| 0.2837 | 9.0 | 3591 | 0.1109 | 0.9700 |
| 0.2 | 10.0 | 3990 | 0.1006 | 0.9765 |
| 0.1742 | 11.0 | 4389 | 0.0930 | 0.9776 |
| 0.1718 | 12.0 | 4788 | 0.0933 | 0.9769 |