RubixKube uses the open Agent Skills format pioneered by Anthropic and adopted by Cursor, Claude Code, and GitHub Copilot. A skill written for RubixKube is portable to those tools, and vice versa.
Prerequisites
- A RubixKube workspace you can admin.
- A runbook or playbook you already use, in any format (Google Doc, Notion, markdown).
- About twenty minutes.
What a skill looks like
A skill is a YAML file plus optional prompt instructions. Minimal shape:| Field | Purpose |
|---|---|
name | Unique handle, kebab-case. Used when invoking with /skills. |
scope | system (RubixKube-maintained) or tenant (your workspace). Custom skills are always tenant. |
agent | Which agent can run this. Today only the SRI Agent is supported. |
description | One sentence. Used in the skill catalogue and for intent matching. |
allowed_tools | Explicit tool allowlist. Anything not listed is blocked. |
instructions | Plain-English runbook. Numbered steps work best. |
Step 1: Pick a real runbook
Skills work best when they wrap something your team already does in a specific way. Good candidates:Post-deploy verification
Compare current metrics to the last stable window after a rollout.
On-call handoff
Summarise open incidents, pending actions, and high-risk changes for the incoming rotation.
Cost anomaly review
Weekly pass to spot services drifting over budget.
Pre-change safety check
Before a risky deploy, run a standard readiness check.
Step 2: Write the skill
Open Skills in the console and click New skill. Paste your YAML into the editor. The console validates the schema live.Step 3: Pick the right tools
Theallowed_tools list is the safety boundary. The SRI Agent can only call tools listed here. A few of the most common:
| Tool | What it does |
|---|---|
fetch_infrastructure_snapshot | Reads current state of resources across environments |
fetch_kubernetes_graph_snapshot | Full topology of a connected cluster |
fetch_deployment_history | Recent rollouts, rollbacks, and config changes |
analyze_service_health | Aggregated metric view for one or more services |
query_logs | Read-only log access for a scoped window |
run_linear_issue | Create a Linear issue with the output |
Step 4: Test before enabling
Click Test in the console. The skill runs against your live environment and renders the output. Iterate on the instructions until the answer is what you want.Step 5: Enable and share
Flip Enabled on the skill. Now the SRI Agent can invoke it two ways.- Intent match. A user asks a question the skill description fits, and the agent picks it up automatically.
- Explicit call. A user types
/skills skill-namein Chat.
Step 6: Watch it learn
Every run feeds the Memory Engine. Common follow-up questions on top of a skill’s output get folded into its context, so the next run is tighter. You can inspect a skill’s run history and adjust the instructions with real traffic as signal.Examples to copy
On-call handoff (fully worked example)
On-call handoff (fully worked example)
Cost anomaly review (fully worked example)
Cost anomaly review (fully worked example)
Common questions
What is the difference between a system skill and a custom skill?
What is the difference between a system skill and a custom skill?
System skills ship with RubixKube and are maintained by the team. Custom skills live inside your workspace (
scope: tenant). Both use the same format.Can I version skills?
Can I version skills?
Yes. Every save creates a revision. You can roll back from the skill detail page. Versioning in Git via the RubixKube CLI is on the roadmap.
Do skills cost investigations?
Do skills cost investigations?
A skill run counts as one investigation on your plan. High-frequency skills (like a daily handoff) are usually the best use of the quota.
Can I share skills across workspaces?
Can I share skills across workspaces?
Related guides
Agent Skills concepts
The underlying model: why skills matter, how they are scored, where they run.
Talk to your infra
Writing good skills starts with writing good prompts in Chat.