Silent Install HQ
ZIP WinGet

llama.cpp

ggml · b9128 · x64

Before deploying, verify this file with VirusTotal ↗

Silent Commands

Distributed as a ZIP archive — extract the contents and run the executable directly. No installer is included.

File Identity

SHA256 0962af6a8c3213acbddfce898b09db3172502dc2de501659f38635ca7aaf2ddc VirusTotal ↗
Filename llama-b9128-bin-win-vulkan-x64.zip

Signature

Status Upload installer to verify signature

Installer Selection

WinGet Package

Package ID ggml.llamacpp
Version b9128
Description LLM inference in C/C++
License MIT
Release Notes hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993) - hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase - hmx-mm: optimize per-group scale handling - hmx-fa: optimize slope load from vtcm - hmx-fa: use aligned access where possible in hmx-utils - hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers Co-authored-by: Max Krasnyansky maxk@qti.qualcomm.com macOS/iOS: - macOS Apple Silicon (arm64) - macOS Apple Silicon (arm64, KleidiAI enabled)…