VOOZH
about
URL: https://dev.to/t/gguf
⇱ Gguf - DEV Community
NeMo out, GGUF in: how parakeet.cpp ports NVIDIA ASR to C++
👁 creeta profile
Creeta
👁 Image
Creeta
Jun 18
NeMo out, GGUF in: how parakeet.cpp ports NVIDIA ASR to C++
#
parakeet
#
ggml
#
gguf
#
asr
Add Comment
6 min read
llama-bench skipped FA on capable GPUs — b9437 corrects it
👁 creeta profile
Creeta
👁 Image
Creeta
Jun 18
llama-bench skipped FA on capable GPUs — b9437 corrects it
#
llamacpp
#
llm
#
gguf
#
flashattention
Add Comment
7 min read
Local LLM Security Best Practices: Beyond Basic Hashing
👁 jaychkdsk profile
Jay Grider
👁 Image
Jay Grider
Jun 13
Local LLM Security Best Practices: Beyond Basic Hashing
#
llmsecurity
#
localai
#
supplychain
#
gguf
Add Comment
4 min read
How to Pick a GGUF Quant Level for Your VRAM Budget
👁 pat9000 profile
Patrick Hughes
👁 Image
Patrick Hughes
Jun 11
How to Pick a GGUF Quant Level for Your VRAM Budget
#
localllm
#
gguf
#
quantization
#
gpu
Add Comment
3 min read
GGUF & Modelfile: The Power User's Guide to Local LLMs
👁 lingdas1 profile
Lingdas1
👁 Image
Lingdas1
May 23
GGUF & Modelfile: The Power User's Guide to Local LLMs
#
gguf
#
llm
#
opensource
#
tutorial
Add Comment
5 min read
GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)
👁 pat9000 profile
Patrick Hughes
👁 Image
Patrick Hughes
May 13
GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)
#
llamacpp
#
gguf
#
quantization
#
localai
Add Comment
4 min read
Llama-Server Router Mode - Dynamic Model Switching Without Restarts
👁 rosgluk profile
Rost
👁 Image
Rost
Apr 27
Llama-Server Router Mode - Dynamic Model Switching Without Restarts
#
cheatsheet
#
gguf
#
ai
#
llm
Add Comment
9 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
👁 DEV Community
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account
👁 Image
👁 Image
👁 Image
👁 Image
👁 Image