Rust

Splitting Rust builds and tests on Codeberg

A heavy Rust integration test suite kept getting killed by Codeberg's per-job time limit. Here is how a targeted dev profile, a build-once job, and cargo-nextest partitioning got it green again.

Created 2026

Shipping CPU-optimized Rust binaries in container images

How to map the OCI platform variant to a rustc target-cpu so a Rust binary actually uses AVX2/AVX-512 for LLM and image work inside a container — and how to merge the variants into one multi-arch tag.

Created 2026

TurboQuant in gguf-runner: roughly half the memory at nearly the same speed

Implementing TurboQuant in gguf-runner cuts KV-cache memory roughly in half while staying close to Q8 throughput.

Created 2026

gguf-runner updates: vision support, releases, and many small improvements

gguf-runner gained vision support, ships GitHub release binaries, and received many usability and performance improvements.

Created 2026

gguf-runner: a minimal GGUF CLI

A small Rust CLI to run GGUF models locally: mmap loading, CPU-only inference, and a general-purpose terminal runner that can lean on RAM (and swap) for large models.

Created 2026

Splitting Rust builds and tests on Codeberg

Shipping CPU-optimized Rust binaries in container images

TurboQuant in gguf-runner: roughly half the memory at nearly the same speed

gguf-runner updates: vision support, releases, and many small improvements

gguf-runner: a minimal GGUF CLI

An epaper picture frame

A Special Purpose HTTP Proxy in Rust

A HTTP Server-Timing Header for axum

Write a KEDA external Scaler for Oracle in Rust

How to access Azure Key Vault in Rust

A hello world kubernetes operator in Rust