Detect and propose
It surfaces likely issues and drafts a concrete remediation you can review before acting.
Detect, propose, and on your approval, remediate
Your AIOps autopilot watches services, spots issues early, and proposes a fix you can read before anything happens. On your approval it carries out self-healing actions like restart, rescale, or roll back, each wrapped in cooldowns and approval gates. It is opt-in and guarded, so you stay in control of every action it takes.
The problem
When a service degrades, you either catch it too late or hand remediation over to a tool that acts without asking. In a governed environment neither is acceptable. You need something that spots problems early, drafts a specific fix you can read, and waits for your sign-off before touching anything.
It surfaces likely issues and drafts a concrete remediation you can review before acting.
Restart, rescale, or roll back run only after your approval, with cooldowns preventing repeat firing.
Every action passes through a gate you control, so nothing changes without your sign-off.
Autopilot stays off until you enable it per service, keeping you in charge of scope.
The autopilot reads health signals and identifies a service that needs attention.
It drafts a specific fix and presents it to you for approval.
On your sign-off it runs the action under cooldown and records the outcome.
How it stays governed
Each proposed remediation is evaluated against policy as code before it can execute. The autopilot cannot restart, rescale, or roll back a service outside the bounds you define, and opt-in scope limits which services it can act on at all.
Every detection, proposal, approval decision, and executed action writes once to a tamper-evident audit trail. You can show exactly what the autopilot observed, what it proposed, who approved, and what it did, with the full sequence intact.
Restart, rescale, and roll back are state-changing actions. The autopilot presents a concrete proposal and holds until a person signs off. Nothing runs without explicit human approval.
Works with your stack
The autopilot reads from observability sources to detect signals and acts on deploy and infrastructure targets to carry out approved remediations.
Who it’s for
When a service shows sustained memory growth, the autopilot flags it and proposes a restart. You review the signal and the proposed action, approve it, and the restart runs under a cooldown that prevents repeat firing.
A sudden traffic spike pushes a service past its comfortable operating range. The autopilot proposes a rescale, you approve it, and the action executes without requiring a person to find the right knob under pressure.
A recent deploy degrades service health. The autopilot detects the drop, proposes a rollback to the previous version, and waits for your sign-off before making any change.
No. The autopilot detects issues and proposes a fix, but every state-changing action (restart, rescale, rollback) requires a person to approve it first. It never acts autonomously.
Only the services you opt in. Autopilot is off by default and must be enabled per service, so you control exactly what falls within its scope.
Restart, rescale, and roll back are the defined action types. Each runs under a cooldown after approval, so the same action cannot fire repeatedly on the same service without a new proposal and sign-off.
Every detection, proposal, approval, and executed action is written to a tamper-evident audit trail. You have a permanent record of what the autopilot observed, what it proposed, who approved, and what ran.
Request a demo, or read the docs to see how it fits the tools you already run.