Talking Tech
  • Home
  • About
  • Contact
  • 🚀 Sealix
Sign in Subscribe

OpenSource

A collection of 1 post
gollama terminal interface managing llama.cpp instances on multi-GPU setup
OpenSource Featured

Building gollama: A Single-Binary llama.cpp Instance Manager

Why I built gollama — a single Go binary that manages llama.cpp instances with full flag control, multi-GPU support, and a web UI. Honest account of the technical decisions behind building an open-source LLM inference manager for self-hosters.
29 May 2026 5 min read
Page 1 of 1
Talking Tech © 2026
  • Privacy Policy
Powered by Ghost