Voozh

VOOZH

URL: https://dev.to/t/gguf

⇱ Gguf - DEV Community

👁 creeta profile

NeMo out, GGUF in: how parakeet.cpp ports NVIDIA ASR to C++

#parakeet #ggml #gguf #asr

6 min read

👁 creeta profile

llama-bench skipped FA on capable GPUs — b9437 corrects it

#llamacpp #llm #gguf #flashattention

7 min read

👁 jaychkdsk profile

Local LLM Security Best Practices: Beyond Basic Hashing

#llmsecurity #localai #supplychain #gguf

4 min read

👁 pat9000 profile

How to Pick a GGUF Quant Level for Your VRAM Budget

#localllm #gguf #quantization #gpu

3 min read

👁 lingdas1 profile

GGUF & Modelfile: The Power User's Guide to Local LLMs

#gguf #llm #opensource #tutorial

5 min read

👁 pat9000 profile

GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)

#llamacpp #gguf #quantization #localai

4 min read

👁 rosgluk profile

Llama-Server Router Mode - Dynamic Model Switching Without Restarts

#cheatsheet #gguf #ai #llm

9 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.