ZIP
WinGet
llama.cpp
ggml · b7898 · x64
Before deploying, verify this file with VirusTotal ↗
Silent Commands
Distributed as a ZIP archive — extract the contents and run the executable directly. No installer is included.
File Identity
Filename
llama-b7898-bin-win-vulkan-x64.zip
Signature
Status
Upload installer to verify signature
Installer Selection
WinGet Package
Package ID
ggml.llamacpp
Version
b7898
Description
LLM inference in C/C++
License
MIT
↗
Installer URL
https://github.com/ggml-org/llama.cpp/releases/download/b7898/llama-b7898-bin-win-vulkan-x64.zip
Release Notes
ggml-hexagon: flash-attention and reduce-sum optimizations (#19141)
- wip
- ggml-hexagon: add vectorized dot product function for FP32 and FP16 accumulation
- ggml-hexagon: optimize dot product functions for FP16 and FP32 with new vectorized implementations
- wip
- ggml-hexagon: optimize hvx_vec_dump_f32_n and hvx_vec_reduce_sum_qf32x2 functions for improved performance
- ggml-hexagon: refactor dot product functions to use a common loading function for improved readability
- optimize vector dot…