ZIP
WinGet
llama.cpp
ggml · b7549 · x64
Before deploying, verify this file with VirusTotal ↗
Silent Commands
Distributed as a ZIP archive — extract the contents and run the executable directly. No installer is included.
File Identity
Filename
llama-b7549-bin-win-vulkan-x64.zip
Signature
Status
Upload installer to verify signature
Installer Selection
WinGet Package
Package ID
ggml.llamacpp
Version
b7549
Description
LLM inference in C/C++
License
MIT
↗
Installer URL
https://github.com/ggml-org/llama.cpp/releases/download/b7549/llama-b7549-bin-win-vulkan-x64.zip
Release Notes
vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352)
Run a preprocess to count how many times each expert is used, and use this to
quickly discard workgroups that aren't needed.
macOS/iOS:
- macOS Apple Silicon (arm64)
- macOS Intel (x64)
- iOS XCFramework
Linux:
- Ubuntu x64 (CPU)
- Ubuntu x64 (Vulkan)
- Ubuntu s390x (CPU)
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x…