ZIP
WinGet
llama.cpp
ggml · b9128 · x64
Before deploying, verify this file with VirusTotal ↗
Silent Commands
Distributed as a ZIP archive — extract the contents and run the executable directly. No installer is included.
File Identity
Filename
llama-b9128-bin-win-vulkan-x64.zip
Signature
Status
Upload installer to verify signature
Installer Selection
WinGet Package
Package ID
ggml.llamacpp
Version
b9128
Description
LLM inference in C/C++
License
MIT
↗
Installer URL
https://github.com/ggml-org/llama.cpp/releases/download/b9128/llama-b9128-bin-win-vulkan-x64.zip
Release Notes
hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993)
- hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase
- hmx-mm: optimize per-group scale handling
- hmx-fa: optimize slope load from vtcm
- hmx-fa: use aligned access where possible in hmx-utils
- hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers
Co-authored-by: Max Krasnyansky maxk@qti.qualcomm.com
macOS/iOS:
- macOS Apple Silicon (arm64)
- macOS Apple Silicon (arm64, KleidiAI enabled)…