Site Reliability Engineer
£45,000 - £55,000
London or Remote contract
My client are the largest travel operator/airline in Europe, and are leading the new age of the travel industry by migrating their services into a new state of the art AWS platform, this is a very exciting opportunity to be part of a relatively greenfield cloud platform but within a huge enterprise, and the scope for growth in your career alongside this is fantastic.
As a Site Reliability Engineer, you will be part of a cross-functional team or a practice team that enables site reliability engineering skills and capabilities across a whole domain. Being an enthusiast in SRE, with a strong DevSecOps mindset, and thanks to your excellent collaboration skills you will work with your team to deliver the best answers to our customers' needs and to take over full responsibility for its applications, from design to operation. You care diligently about the quality of your work, including proper documentation and security aspects
You will use your deep technical skills to enable your team to deliver operation excellence, ensure and improve reliability, performance, maintainability of systems and services. You work closely with your team to understand the operational processes, technical and business needs of the products and services your team is responsible for. You ensure observability of systems and services, support change and configuration management. You will be involved in raising operational readiness requirements as part of the development life cycle and validate software development and delivery is consistent, meeting the specified requirements. You are hunting for performance optimisations and recognize upcoming problems before our customers are impacted. You will continuously improve CI/CD and automation maturity and efficiency. You will support your team with efficient incident handling and quick reaction to production problems. For this you can expect to take part in on-call rota. You can work hands-on, being able to tackle the whole design, build, test, deploy cycle and thus also take proactive corrective action where required.
You are able to verbalise your thoughts and ideas and take the initiative to translate ideas into outcomes. Together with the teams in the Domains, relevant Practice teams as well as the Group Enabler teams you also will research, evaluate and test new approaches, processes and tools and help teams to use them effectively. You are demonstrating active contribution to Communities of Practice, including collaboration in shared initiatives
Skills & Qualifications
* Strong hands-on experience with Amazon Web Services (AWS) and in managing scaled cloud systems with a focus on solution architecture, the various tools and services, infrastructure-as-code, and DevSecOps practices
* Experience working with highly available, distributed systems and services in a cloud environment and defining, developing and rolling out technical operations processes and new services across teams and marketsStrong experience with monitoring/observability solutions, preferably Datadog, as well as with incident response solutions, like PagerDuty
* Willingness to take part in on-call rota
* Deep automation expertise, hands-on with some programming languages, g. NodeJS, Python, or Bash scripting
Full benefits provided on request, benefits include pension, healthcare, discounted travel and holidays for employees and families.
If this sounds of interest please get in touch by applying below, or contact Steven directly from Jefferson Frank at: email@example.com