browserground-gguf (llama.cpp / Ollama)
GGUF build of renezander030/browserground
for llama.cpp, Ollama, and downstream wrappers that accept GGUF multimodal models.
Two files, both required:
| File | Purpose | Size |
|---|---|---|
browserground-Q4_K_M.gguf |
text LLM, Q4_K_M quant | 1.11 GB |
browserground-mmproj-f16.gguf |
vision tower (mmproj), f16 | 0.82 GB |
Use via Ollama
A ready-made Modelfile is in the repo. After downloading both .gguf files:
ollama create browserground -f Modelfile
ollama run browserground "Locate the Submit button" /path/to/screenshot.png
Use via llama.cpp directly
llama-mtmd-cli -m browserground-Q4_K_M.gguf --mmproj browserground-mmproj-f16.gguf --image screenshot.png -p "Locate the element described: Submit button"
Or via the npm CLI (auto-routes to MLX on Apple Silicon)
npm install -g browserground
browserground parse screenshot.png --target "Submit button"
Recipe, numbers, full evaluation: https://huggingface.co/renezander030/browserground.
License: Apache 2.0 (inherits from Qwen/Qwen3-VL-2B-Instruct).
- Downloads last month
- 310
GGUF
Model size
2B params
Architecture
qwen3vl
Hardware compatibility
Log In to add your hardware
4-bit
