Silent Install HQ
ZIP WinGet

llama.cpp

ggml · b7898 · x64

Before deploying, verify this file with VirusTotal ↗

Silent Commands

Distributed as a ZIP archive — extract the contents and run the executable directly. No installer is included.

File Identity

SHA256 0ab5e72a688bd37e21871c913a3b159851284e24b64f8fa1e6f90bf620905624 VirusTotal ↗
Filename llama-b7898-bin-win-vulkan-x64.zip

Signature

Status Upload installer to verify signature

Installer Selection

WinGet Package

Package ID ggml.llamacpp
Version b7898
Description LLM inference in C/C++
License MIT
Release Notes ggml-hexagon: flash-attention and reduce-sum optimizations (#19141) - wip - ggml-hexagon: add vectorized dot product function for FP32 and FP16 accumulation - ggml-hexagon: optimize dot product functions for FP16 and FP32 with new vectorized implementations - wip - ggml-hexagon: optimize hvx_vec_dump_f32_n and hvx_vec_reduce_sum_qf32x2 functions for improved performance - ggml-hexagon: refactor dot product functions to use a common loading function for improved readability - optimize vector dot…