Skip to content

LLM Providers

Kure Monitor uses an LLM to generate contextual fixes for pod failures and security findings. The provider is configured after install from the Admin Panel → AI Configuration — no API keys at install time.

2.4.0 — the LLM is now also used by the new AI Advice tab. Explanations for advice findings are generated lazily on card expand (one LLM call per finding, cached) so a scan with N findings does not cost N requests up front. The advice explainer prompt is constrained to only reference data present in the finding’s evidence dict — no invented replica counts, image names, ports, or labels.

2.3.4 — the AI Configuration panel defaults to Groq for new installations (was Ollama). Existing installs are unaffected; this only changes the initial pre-selection. Model catalogs across all six providers were refreshed to current (May 2026) latest models.

ProviderAliasDefault ModelPricing
Groq (default)groq, groq_cloudmeta-llama/llama-4-scout-17b-16e-instructgroq.com/pricing
OpenAIopenaigpt-5.5-miniopenai.com/pricing
Anthropicanthropic, claudeclaude-sonnet-4-6anthropic.com/pricing
Google Geminigemini, googlegemini-3-flashai.google.dev/pricing
GitHub Copilot (GitHub Models)copilot, github, github_modelsopenai/gpt-5.5-miniGitHub Models
Ollama (local)ollamallama4:scoutFree / self-hosted
ProviderModels in the dropdown (default in bold)
OpenAIgpt-5.5, gpt-5.5-mini, gpt-5.4-mini
Anthropicclaude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5
Google Geminigemini-3.1-pro, gemini-3-flash, gemini-3.1-flash-lite
Groqopenai/gpt-oss-120b, meta-llama/llama-4-scout-17b-16e-instruct, llama-3.3-70b-versatile, openai/gpt-oss-20b
Ollamallama4:scout, llama3.3, qwen3
GitHub Copilotopenai/gpt-5.5, openai/gpt-5.5-mini, anthropic/claude-sonnet-4-6
  • Display name: GitHub Copilot (GitHub Models)
  • Auth: GitHub Personal Access Token (fine-grained, with the Models permission)
  • Base URL: https://models.github.ai/inference
  • API: OpenAI-compatible
  • Example models: openai/gpt-5.5, openai/gpt-5.5-mini, anthropic/claude-sonnet-4-6

For air-gapped clusters. Run Ollama in your cluster, point Kure at it, and your cluster data never leaves your network. Default model: llama4:scout. Other models in the dropdown: llama3.3, qwen3.

Use caseProviderModel
Best qualityAnthropicclaude-opus-4-7
Best valueOpenAIgpt-5.5-mini
FastestGroqmeta-llama/llama-4-scout-17b-16e-instruct
Free tierGroqmeta-llama/llama-4-scout-17b-16e-instruct
Google ecosystemGooglegemini-3-flash
GitHub ecosystemGitHub Copilotopenai/gpt-5.5-mini
Local / air-gappedOllamallama4:scout
  1. Open the dashboard
  2. Admin Panel → AI Configuration
  3. Pick a provider
  4. Paste your API key (or PAT for GitHub Copilot)
  5. Pick a model from the dropdown
  6. Click Test Connection — verifies the key works
  7. Click Save Configuration

If the LLM call fails at runtime, Kure falls back to rule-based solutions so the dashboard stays useful.

  1. Generate a new key with your provider
  2. Admin Panel → AI Configuration
  3. Replace the existing key
  4. Test ConnectionSave

LLM API keys are encrypted at rest using a Fernet key (security.encryptionKey in Helm values). If left empty at install time the chart auto-generates one.

DELETE /api/admin/llm/config

…or click Delete in the Admin panel. Kure reverts to rule-based solutions.