Job Description
Heavy on SRE / DevOps (more SRE hiring track)
SRE exp with Gremlin or Chaos tool kit (not a hard req - but more likely they will get through)
Failure Mode and Effects Analysis (FMEA) experience or observability with OpenTelemetry (OTel) (not a hard req - but more likely they will get through)
Experience with multi-region and highly availability systems, resiliency
Kubernetes experience, but not an expert; i.e. be able to deploy on an application to Kubernetes
Need to have IaC (Terraform is preferred, doesn't need to be an SME - will be lowest on grading)
Python code from scratch (python is preferred, java)
