Documentation Index
Fetch the complete documentation index at: https://docs.selftune.dev/llms.txt
Use this file to discover all available pages before exploring further.
Why descriptions matter
Your skill description is the single most important piece of text in your entire skill. It carries the entire burden of triggering — at session start, the agent loads only the name and description of each skill (~50-100 tokens). If the description doesn’t match the user’s query, the skill never activates and none of your carefully written instructions matter. The agent skills spec defines a hard limit of 1024 characters for descriptions. That’s your budget for convincing every compatible agent to activate your skill at the right time.The anatomy of a good description
A good description has three parts:- What it does — imperative phrasing, not passive
- When to use it — explicit trigger contexts
- Trigger keywords — the words users actually say
Use imperative phrasing
The spec recommends “Use this skill when…” not “This skill does…” — imperative phrasing gives the agent a clearer activation signal.Focus on user intent, not implementation
Users don’t say “execute web security assessment using OWASP methodology.” They say “check if my site is vulnerable.”Be pushy about trigger contexts
The spec says to err on the side of listing contexts explicitly, including cases where the user doesn’t name the domain directly:The developer-user gap
This is the core problem selftune solves. Developers think in technical terms; users think in task terms:| Developer writes | User says |
|---|---|
| ”Create PowerPoint presentations" | "make me a slide deck" |
| "Execute web security assessment" | "check if my site is vulnerable" |
| "Generate TypeScript CLI tools" | "build me a command-line thing" |
| "Process PDF documents" | "grab the text from this contract” |
Testing and optimization
The spec recommends designing ~20 test queries and running an optimization loop. See the full guide in Testing Skill Triggers. The short version:Avoiding overfitting
Split your test queries into a training set (~60%) and a validation set (~40%). If you optimize only against the full set, you’ll overfit to specific phrasings. Better yet, let selftune handle it — evolution generates multiple candidate descriptions and validates each against eval sets using Pareto multi-candidate selection, so no single trigger dimension is sacrificed for another.Before and after
Here’s a real example of description evolution:Further reading
Agent Skills Spec
The open standard for skill descriptions and discovery.
Testing Triggers
Verify your descriptions actually work.
Evolution
Let selftune optimize descriptions automatically.
Example Skills
Study real skill descriptions from Anthropic’s skill library.