EC2 Simplified Automatic Recovery conflicts with Karpenter's termination behavior

### Description

EC2 [Simplified Automatic Recovery](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-instance-recover.html) is enabled by default for many instance types. When triggered, it can hold a node for much longer than it would take for Karpenter to simply replace it.

Karpenter's [interruption controller](https://github.com/aws/karpenter-provider-aws/blob/fd39c51f/pkg/controllers/interruption/controller.go#L254) handles spot interruptions and scheduled maintenance via EventBridge, but doesn't watch the [system status checks](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/monitoring-system-instance-status-check.html#system-status-checks) that trigger EC2 Auto Recovery.

As a potential design option, we could:

1. Always disable auto recovery in [maintenanceOptions](https://docs.aws.amazon.com/AWSEC2/latest/APIReference/API_LaunchTemplateInstanceMaintenanceOptionsRequest.html) on Karpenter-managed launch templates
2. Watch for status check failures via [DescribeInstanceStatus](https://docs.aws.amazon.com/AWSEC2/latest/APIReference/API_DescribeInstanceStatus.html) and trigger node replacement

---

* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the community and maintainers prioritize this request
* Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
* If you are interested in working on this issue or have submitted a pull request, please leave a comment


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EC2 Simplified Automatic Recovery conflicts with Karpenter's termination behavior #8821

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

EC2 Simplified Automatic Recovery conflicts with Karpenter's termination behavior #8821

Description

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions