Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
novateur
/
WavTokenizer
like
56
Text-to-Speech
audio-feature-extraction
speech-language-models
gpt4-o
tokenizer
codec-representation
automatic-speech-recognition
arxiv:
2408.16532
arxiv:
2402.12208
License:
mit
Model card
Files
Files and versions
xet
Community
3
Copy to bucket
new
main
WavTokenizer
3.17 GB
Ctrl+K
Ctrl+K
1 contributor
History:
18 commits
novateur
Update README.md
917d513
verified
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
Safe
5.99 kB
Update README.md
over 1 year ago
WavTokenizer_small_320_24k_4096.ckpt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
1.58 GB
xet
Upload WavTokenizer_small_320_24k_4096.ckpt
almost 2 years ago
WavTokenizer_small_600_24k_4096.ckpt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
1.59 GB
xet
Upload WavTokenizer_small_600_24k_4096.ckpt
almost 2 years ago
result.png
Safe
285 kB
Upload result.png
almost 2 years ago
wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
Safe
2.78 kB
Update wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
almost 2 years ago
wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
Safe
2.86 kB
Update wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
almost 2 years ago