VOOZH about

URL: https://huggingface.co/cstr/dac-44khz-GGUF

⇱ cstr/dac-44khz-GGUF · Hugging Face


dac-44khz-GGUF

GGUF conversion of descript/dac_44khz — the Descript Audio Codec (DAC) decoder at 44.1 kHz.

Used as the vocoder backend for Zonos-v0.1-transformer-GGUF and other DAC-based TTS models running in CrispASR.

Files

File Size Description
dac-44khz-f16.gguf 104 MB F16 weights, full precision

Architecture

The DAC decoder converts 9 residual VQ codebooks (1024 entries each, ~86 codes/s) to 44.1 kHz mono PCM:

  • RVQ quantizer: 9 codebooks × 1024 entries × 8-dim → projected to 1024-dim
  • 4-stage convolutional decoder with Snake1d activations and upsampling ratios [8, 8, 4, 2]
  • Total upsampling factor: 512 → 86 Hz codes × 512 = 44,032 Hz ≈ 44.1 kHz

Usage

Place alongside the Zonos transformer GGUF or pass via --codec-model:

# Auto-discovered if named dac-44khz-f16.gguf in the same directory as the AR model
crispasr tts --backend zonos-tts \
 --model /path/to/zonos-v0.1-transformer-q4_k.gguf \
 --codec-model /path/to/dac-44khz-f16.gguf \
 "Hello world."

Conversion

Converted from descript/dac_44khz using models/convert-dac-to-gguf.py in CrispASR. Source model is MIT licensed (descriptinc/descript-audio-codec).

Downloads last month
262
GGUF
Model size
54.3M params
Architecture
dac-44khz
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cstr/dac-44khz-GGUF

Quantized
(2)
this model