Voozh

VOOZH

URL: https://huggingface.co/nthngdy/Llama-3.2-3B-Instruct_qfilt

⇱ nthngdy/Llama-3.2-3B-Instruct_qfilt · Hugging Face

This model has been pushed to the Hub using the PytorchModelHubMixin integration:

Library: [More Information Needed]
Docs: [More Information Needed]

Downloads last month: 4,564

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including nthngdy/Llama-3.2-3B-Instruct_qfilt

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3, 2025 • 7