Home / Browse / LLMKube

LLMKube

92 stars

Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API

Categories

Generative Artificial Intelligence (GenAI)

Platforms

Go
Docker
K8S

License

Apache, Version 2.0

Last Updated

May 17, 2026

Current Release

v0.7.8
Released May 14, 2026

Commit Activity

Loading graph...

Related Applications

Ollama

171.6k

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models

Generative Artificial Intelligence (GenAI)

LocalAI

46.3k

Run your AI models locally and generate images and audio

Generative Artificial Intelligence (GenAI)

Chat UI that works with any LLM. It comes loaded with advanced features like agents, web search,...

Generative Artificial Intelligence (GenAI)

Certificate authority and access plane for SSH, Kubernetes, web applications, and databases

Miscellaneous

All-in-one mail server with JMAP, IMAP4, and SMTP support and a wide range of modern features

Communication - Email - Complete Solutions

Flipt

4.8k

Feature flag solution with support for multiple data backends

Software Development - Feature Toggle