Senior/Lead Site Reliability Engineer
PrimaryResponsibilities include but are not limited to the following:
o Toolsdevelopment and automation using Ruby, Rails, Capistrano, etc. toincrease availability and performance
o Administrationof Linux machines, Web servers, Application servers, Databases
o Applicationand infrastructure support for customer environments
o Collaboratewith Product and Support teams to plan and deploy product releases
o Participatein 24x7 on-call rotation for after-hours emergencies
o Engagein and improve the whole lifecycle of services—from inception and design,through deployment, operation and refinement
o Abilityto operate in the high-pressure environment and troubleshoot complex issuesquickly, while successfully handling multiple priorities
Requirements:
o Understandingand experience in cloud infrastructure and platforms, such as AWS and Azure
o 7+years of professional experience
o 5+years of production system administration and web operations experience
o 5+years of experience with Linux operating systems internals and administration(e.g., filesystems, inodes, system calls)
o 3+years of experience with programming using Java, Perl, PHP, Python, Ruby orequivalent
o 3+years of experience with configuration management tools like Chef, Puppet, Saltor equivalent
o Design,implement, manage and orchestrate container clusters
o Experiencein massive-scale web operations
o Excellentwritten and verbal communication