-
Notifications
You must be signed in to change notification settings - Fork 122
Description
I've just implemented this action in several repos, and for the most part it has been a great thing compared to the self hosted runner spiderweb. However, something like 10% (maybe even 20%) of the time, the ping never succeeds and the action fails. My devs are getting really annoyed about this.
I've set the ping to two different hosts (the most static ones in my tailnet, subnet routers) that are deployed in AWS EC2 in multiple accounts. These hosts work fine on their own, I'd be very aware of issues with them. I don't think they're the problem but maybe they are?
When the ping succeeds, its usually in ~5 seconds. Sometimes just one ping and ready to go.
I'm considering deploying some other static host(s) just to offset this, but then I'm managing infra just for this github action which is what I was more or less trying to get away from.
Definitely feels like I'm doing something wrong. I'd love to be able to tell my devs they don't need to hit Re-run failed jobs anymore.
Any suggestions?