Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.selftune.dev/llms.txt

Use this file to discover all available pages before exploring further.

Usage

selftune grade --skill <name> [options]

Subcommands

grade (default)

Grade a skill’s recent sessions:
selftune grade --skill my-skill

grade auto

Auto-grade with custom expectations:
selftune grade auto --skill my-skill --expectations "should handle implicit triggers well"

grade baseline

Establish a baseline score before evolution:
selftune grade baseline --skill my-skill --skill-path path/to/SKILL.md
FlagDescription
--eval-set PATHUse a specific eval set for baseline
grade baseline persists the no-skill comparison into SQLite so the dashboard, skill report, and selftune status can tell whether a skill has been measured against a real baseline yet. For brand-new draft packages, prefer selftune create baseline so the package replay and publish path stays in one surface. grade baseline is still the baseline command for existing skills you are evolving in place.

Common options

FlagDescription
--skill NAMESkill to grade (required)
--skill-path PATHPath to the skill file (for baseline or explicit targeting)
--expectations "..."Natural language expectations for grading
--agent NAMEAgent name for multi-agent setups

How grading works

See Grading concepts for the full 3-tier grading model.