VOOZH

URL: https://github.com/topics/gguf-quantization

⇱ gguf-quantization · GitHub Topics · GitHub

#

gguf-quantization

Here are 12 public repositories matching this topic...

magiccodingman / MagicQuant-Wiki

Evolution process to find the best quant tensor weights to build the most optimal GGUF options for an AI model.

ai quantization quantization-algorithms gguf gguf-quantization gguf-hybrid

Updated

qskousen / ggufy

CLI tool for efficient and easy safetensors and gguf model conversion

zig diffusion-models safetensors comfyui ggml gguf gguf-quantization

Updated
Zig

xphot / app

Análise Avançada de Dados com Causalidade e Aprendizado por Reforço

reinforcement-learning bug-tracker data-preprocessing experimental-psychology causal-machine-learning shap-analysis hypergraph-neural-network llms-reasoning llm-fine-tuning explainability-metric unsloth gguf-quantization

Updated
Jupyter Notebook

Mainframework / Quanta

Convert and quantize llm models

ai artificial-intelligence quantization top100 awesome-ai top10 quanta llm safetensors bestofthebest gguf bestofai awesome-ai-tools top-ai-tools gguf-models gguf-quantization best-ai-software gguf-model-support best-apps gguf-editor

Updated
Python

lm-webui / lm-webui

Unified Local AI Interface & LLM Runtime (Support GGUF, Ollama, OpenAI, Gemini, etc.). Insearch of building sovereign AI system ✨

ai webui hardware-acceleration rag ai-assistant llm llm-inference ollama gguf llm-webui gemini-sdk openai-compatible gguf-quantization llm-runtime lm-webui

Updated
Python

arcxteam / gguf-convert-model

Auto GGUF Converter for HuggingFace Hub Models with Multiple Quantizations (GGUF Format)

machine-learning cmake ai tensorflow transformers huggingface ai-models huggingface-models bf16 safetensors llama-cpp gguf gguf-models gguf-quantization gguf-editor convert-gguf

Updated
Python

merterbak / HFtoGGUF

Convert Hugging Face models to GGUF with xet support.

huggingface huggingface-models gguf gguf-quantization

Updated
Jupyter Notebook

TAO71-AI / AutoQuantizer

Quantize LLMs automatically.

python cli python3 quantization llm llms llamacpp llama-cpp gguf gguf-quantization

Updated
Python

GrandFuzard / glm4-7flash-opus-colab

Ready-to-run Colab notebook to run GLM-4.7-Flash Finetuned on Claude Opus 4.5 xHigh-Reasoning (GGUF) with llama.cpp, featuring GPU/CPU split loading, streaming chat, multi-chat manager, and a Gradio web UI — optimized for free T4 environments.

glm llm llama-cpp gguf gguf-quantization

Updated
Jupyter Notebook

waterblower / purai

AI Toolchain in Pure Zig, No Python, No C++

ai zig inference gguf-quantization

Updated
Zig

daniau23 / LoRAfrica_CPU

Deploying LoRAfrica on consumer CPU devices

huggingface llm llms llamacpp ollama gguf-models gguf-quantization

Updated
Jupyter Notebook

spicyneuron / gguf-clone

Create optimized GGUF quantizations by cloning from any GGUF of the same architecture.

huggingface llamacpp gguf unsloth gguf-quantization

Updated
Python

Improve this page

Add a description, image, and links to the gguf-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf-quantization topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.