Documentation Index
Fetch the complete documentation index at: https://docs.selftune.dev/llms.txt
Use this file to discover all available pages before exploring further.
What this guide covers
Use this when you want one path from idea to shipped skill:- Is the skill scoped correctly?
- Does it trigger on the right prompts?
- Does it add value compared with no skill?
- Can I deploy it without guessing?
selftune create. If you want the lighter introductory version first, use Build and Improve Your First Skill.
Step 1: Pick a coherent skill boundary
Start with one unit of work the agent would otherwise get wrong or do inconsistently. Good candidates:- A reusable workflow with domain-specific context
- A task with recurring trigger misses
- A job that mixes judgment with a few deterministic steps
- A vague bucket like “do engineering work”
- A tiny one-line trick that the base agent already handles well
- A grab-bag of unrelated tasks
- Put routing in
description - Put ordered execution in
workflows/ - Put durable context in
references/ - Put deterministic mechanics in
scripts/or tools
Step 2: Create the draft package
Start withselftune create init if you already know the skill you want to build:
SKILL.md is the router. workflows/default.md is the first execution path. references/overview.md is durable background context. selftune.create.json records the package metadata selftune uses for readiness and package replay.
Start with a small router:
Step 3: Put the right detail in the right file
Keep the always-loaded instructions lean. Use the rest of the directory intentionally:workflows/for the main path once the skill is selectedreferences/for checklists, taxonomies, schemas, or examples the agent should load on demandscripts/for exact mechanics the agent should execute instead of reinventingassets/for templates, static examples, or config snippets
SKILL.md on activation, and support files only when needed.
Step 4: Check the package before generating evals
Use the draft-aware status and verify commands first:create status is the fast local view. verify runs the same readiness
contract as create check, then emits the measured package report once the
draft is actually ready.
At this point you want to confirm:
- the package structure is complete enough to validate
- the entry workflow exists
- the description is specific enough to route
- the next missing artifact is clear
- should-trigger prompts
- should-not-trigger near misses
- realistic prompts with file paths, context, and messy phrasing
Step 5: Generate your first eval set
If you already have real usage:needs_evals to needs_unit_tests.
Step 6: Add skill-level unit tests
Generate or run deterministic tests for the workflow itself:selftune verify --skill-path ... again after the suite is generated or
recorded.
Step 7: Prove the package with replay validation
For a new draft package, use the package-aware replay path instead of a generic evolve dry-run:workflows/default.mdreferences/overview.md- other package-local files the skill needs during execution
--mode routing only if you intentionally want to isolate the routing layer.
Step 8: Measure the no-skill baseline
Record whether the skill actually adds value versus doing nothing:selftune verify --skill-path ... should move to the point
where publish is the next lifecycle action.
Step 9: Publish the draft package
Once evals, unit tests, replay validation, and baseline are all in place, ship through the lifecycle surface:- blocks if the draft is not ready
- reuses the same measured package evaluation contract you saw during
verify - starts watch by default unless you pass
--no-watch
create replay or create baseline, inspect the dashboard skill report, and publish only after the draft loop is green.
Step 10: Watch the deployed skill
If you did not use--watch, start monitoring explicitly:
selftune status now expose this flow directly:
- missing package resources
- spec validation not yet run
- missing evals
- missing unit tests
- missing replay validation for the package
- missing baseline
- ready to publish
- already deployed and under watch
Step 11: Publish and share it
When the skill is stable, distribute it through the Agent Skills ecosystem:Deployment checklist
Ship only when all of these are true:- the
descriptionexplains when to use the skill, not how to implement it - nearby negative examples do not trigger
- the package passes
selftune verify workflows/,references/, andscripts/each have a clear purpose- the skill validates against the Agent Skills package rules
- evals exist
- unit tests exist
- replay evidence exists for the package
- the no-skill baseline exists
- the publish command has been reviewed
Read next
| Goal | Read next |
|---|---|
| Learn the introductory version first | Build and Improve Your First Skill |
| Tune the router | Writing Effective Descriptions |
| Test trigger boundaries harder | Testing Skill Triggers |
| Package it for other users | Publishing and Sharing Skills |
| Operate the loop continuously | The Iteration Loop |