Referenz: JL-727825_1657160014

Senior Site Reliability Engineer



Senior Site Reliability Engineer


Roles and Responsibilities:

- Establish metrics for data-driven decisions to help increase availability, reliability, and velocity
- Apply SRE core tenets of measurement (SLI/SLO/SLA), eliminate toil, and reliability modelling
- Build, maintain, improve, scale and secure cloud infrastructure and resources using IaC tool
- Develop solutions to automate manual development & operational task
- Responsible for the availability, performance, change management, telemetry, and capacity management of their services
- Proactively analyze data and test the integrity of systems to ensure production applications and services are operating optimally
- Participate in in Root Cause Analysis and post-mortem to identify and eliminate gaps and improve service
- Analyse and resolve issues in software, systems, tools, and services to minimize down time and interruption to development
- Identify and mitigate risks with both current infrastructure, systems, and technologies as well as potential future risks with scalability and reliability


- Degree in Computer Science or equivalent with 5 years of relevant experience in the below areas:
- Ability to troubleshoot problems, and solve abstract issues during system administration.
- Experience in documentation for manuals, guides, troubleshooting and system design.
- Sound experience on cloud computing platforms such as AWS(e.g. AWS: EC2, RDS, ELB, EFS, ELK, ElasticCache, S3)
- Experienced working on monitoring frameworks, microservices and orchestrators
- Experience in observability stack (e.g. Grafana, Datadog, Prometheus), logging and traceability
- Familiarity with multiple different deployment methods of application performance monitoring
- Experienced in Kubernetes, Terraform, Helm
- Proficient in Linux
- Proficient in scripting using Python, Shell
- AWS certification will be an added advantage

Interviews are ongoing right now! Apply quickly in order to be considered. Send resumes to or call Jaryl Low (R1982007) +65 3158 4457 to learn more about this and the many other positions that are available.

Jefferson Frank is the specialized AWS delivery arm of Frank Recruitment Group - the global niche IT recruitment specialists. We focus on quick delivery to our Key Clients on roles that are traditionally that little bit more difficult to fill. We've established an exceptional reputation for delivering the very best professionals to our customers. By focusing solely on the niche IT field, our consultants are genuine experts, meaning they not only fully understand the market, but have built solid relationships with the widest range of vendors, customers and specialists looking to progress their career. By specializing solely in placing candidates in this market I have built relationships with most of the key employers in APAC and have an unrivalled understanding of where the best opportunities & jobs are.

EA License Number: 11C3017 EA Personnel Number: R1982007