AWS made an automatic security + performance update to ElastiCache this morning at 3:30am PST. Our ElastiCache instances lost connections to some of the nodes in the cluster which led to communication issues between microservices.
No data was lost and all executions continued as expected, but users on the dashboard testing workflows or making http requests to dependent services experienced 5xx status codes.
Enterprise customers were completely unaffected.