Home / Browse / LLMKube

LLMKube

40 stars

Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API

Categories

Generative Artificial Intelligence (GenAI)

Platforms

Go
Docker
K8S

License

Apache, Version 2.0

Last Updated

April 01, 2026

Current Release

v0.5.3
Released April 01, 2026

Commit Activity

Loading graph...

Related Applications

Ollama

166.8k

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models

Generative Artificial Intelligence (GenAI)

LocalAI

44.8k

Run your AI models locally and generate images and audio

Generative Artificial Intelligence (GenAI)

Chat UI that works with any LLM. It comes loaded with advanced features like agents, web search,...

Generative Artificial Intelligence (GenAI)

Certificate authority and access plane for SSH, Kubernetes, web applications, and databases

Miscellaneous

All-in-one mail server with JMAP, IMAP4, and SMTP support and a wide range of modern features

Communication - Email - Complete Solutions

Flipt

4.8k

Feature flag solution with support for multiple data backends

Software Development - Feature Toggle