https://arxiv.org/abs/2408.16532 โข 4 items โข Updated โข 11
WavTokenizer
SOTA Discrete Codec Models With Forty Tokens Per Second for Audio Language Modeling
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for novateur/WavTokenizer-large-speech-75token
Quantizations
2 models