Ref: JFIJA45_1633938492

Site Reliability Engineer

England, London

Job description

Site Reliability Engineer

JFIJA45_1633938492

The Role

We are looking for an engineer to bring Site Reliability Engineering to Submissions Systems, working alongside our various product squads made up of developers, QEs and systems specialists.

Whilst knowing about SLIs, SLOs and error budgets are essential, we are aware this is a narrow field so also require a mix of software engineering and operations background plus a relentless drive to improve product reliability through engineering efforts.



Responsibilities will include:

*

Advocating for Site Reliability Engineering (SRE) in a product squad and across the organization
*

Agreeing Service Level Objectives (SLOs) with everyone in the squad including product management
*

Defining and implementing suitable Service Level Indicators (SLIs)
*

Putting in place processes to track error budget and SLOs
*

Improving our products through infrastructure and software refinement
*

Ensure optimal construction and implementation of CI/CD pipelines.
*

Build automation supporting the effective management and consumption of cloud resources
*

Design and author soft infrastructure assets using IaC best practices Operate and maintain software systems across the full breadth of the SDLC
*

Actively participate in product refinement session to ensure a clear understanding of requirements
*

Work closely with Product Manager to ensure delivery according to business roadmap and priorities Deliver reliable working software into a continuous delivery pipeline to achieve sprint goals
*

Ensure team engineering processes (coding best practice, reviews etc.) and Agile processes are being followed and evolved
*

Coach and mentor junior team members



Key Skills and Experience



Must Haves:

*

Strong experience in both software and infrastructure engineering in large, multi-product environments
*

Good knowledge of authoring CI/CD pipelines, automation elements related to composition, deployment orchestration, and monitoring
*

Experience using modern scripting and OO programming languages as a contributing member within an agile dev squad, specifically, Node.js & Typescript
*

Knowledge and practical experience of Agile software development methodologies (Scrum and/or Kanban) including supporting backlog refinement and estimations (time and/or story points)
*

Strong knowledge of software engineering design principle, architect techniques to solve complex problems using best industry practices and processes
*

Performance or reliability improvement experience
*

Knowledge of the security controls required to host applications and services in the cloud



Nice to Haves:

*

Experience of working with the following or similar technologies: GraphQL, React.js
*

Experience in AWS Architecture ideally with an accreditation
*

Experience using Terraform as an AWS automation technology
*

Implementing service-based monitoring solutions
*

Containers (e.g. Docker) and any container platform technology (e.g. Kubernetes, ECS etc.)
*

Load balancing and service discovery across micro-services

If you like the sound of the role or know someone who would, please don't hesitate to get in touch with me! email r.durban@jeffersonfrank.com or click apply on the advert and I will get back to you ASAP.