We are seeking a Site Reliability Engineer. As a member of our team you will have the opportunity to work on a variety of different projects including our cloud marketplace offerings, hosted applications, and internal infrastructure, which will greatly impact our end-users and employees. You will be given a large amount of autonomy to determine the right tool for the right job. This is a cross functional role and you will partner closely with the engineering, support and solutions teams.
What you may work on:
You will design operational processes and solutions to proactively address issues before they become customer facing
Identify persistent or recurring problems and recommend creative solutions
Build tools and solutions for bridging software development teams with system infrastructure
4+ years experience with a high-level scripting language such as Python or Ruby
3+ years advanced-level experience with Linux
2+ years experience with Amazon Web Services
You have experience designing and implementing elastic solutions while ensuring no single point of failure
Familiarity with system scalability, monitoring, and performance with the ability to troubleshoot systems, network, and storage
Proficiency with automation tools, such as Chef or Puppet, in a production environment
Solid understanding of the challenges with creating, scaling, and managing distributed applications and service
Educate, train, and coach the engineering and solutions engineering teams in best practices
Effectively use tools and techniques to maximize impact on scaling services and systems
Ability to work independently on a number of projects with employees across teams
Experience with Kubernetes is a plus
If interested in applying for this position, please e-mail your CV/Resume directly to firstname.lastname@example.org
Subject: "Candidate Submission: Job Title"
No 3rd party recruiters or agency submissions are accepted.