gguf-runner gained vision support, ships GitHub release binaries, and received many usability and performance improvements.
A small Rust CLI to run GGUF models locally: mmap loading, CPU-only inference, and a general-purpose terminal runner that can lean on RAM (and swap) for large models.