Documentation Index
Fetch the complete documentation index at: https://docs.selftune.dev/llms.txt
Use this file to discover all available pages before exploring further.
Overview
After an evolution deploys a new skill description, selftune monitors for regressions. If the new description performs worse than the old one, selftune can automatically roll back.How monitoring works
selftune watch uses a sliding window of post-deploy sessions to compare against the pre-deploy baseline:
- Baseline capture — records pass rates before the evolution deploys
- Post-deploy tracking — monitors new sessions after deployment
- Regression detection — compares post-deploy metrics against the baseline
- Auto-rollback — if regression confidence is strong enough, reverts to the backup
Activation rules
selftune includes built-in activation rules that trigger automatically:| Rule | Condition | Action |
|---|---|---|
post-session-diagnostic | More than 2 unmatched queries in a session | Suggests selftune last |
grading-threshold-breach | Session pass rate below 60% | Suggests selftune evolve |
stale-evolution | No evolution in 7+ days with pending false negatives | Suggests evolve |
regression-detected | Monitoring detects regression | Suggests rollback |
Orchestrate loop
For fully autonomous operation,selftune run runs the complete loop:
Dashboard monitoring
The local dashboard shows real-time skill health with SSE live updates:- Per-skill pass rates over time
- Evolution history and outcomes
- Missed queries and false negatives
- Orchestrate run summaries