Azure Site Reliability Engineer (US/ Pacific Time Zone)

Retrieved 16 days ago from Remotive

Overview

Stack Overflow is growing fast, and our infrastructure needs just keep getting bigger. We’re looking for a Site Reliability Engineer to join our existing team of SREs and devs and help us grow the Microsoft Azure side of our infrastructure. As an SRE, you’ll bring a DevOps mindset to system administration, always looking for ways to automate manual work and create repeatable, scalable systems and processes.

We’re looking for someone with .NET ecosystem experience in an Azure environment (or general Windows Server / IIS experience) (3+ years), but we don’t expect you to know every other part of our stack coming in, so we’ll pair you with other members of the team to learn and develop your skills across our entire infrastructure (including our non-cloud Stackoverflow.com infrastructure). We are a mixed Windows and Linux environment and expect this role to be strong in Windows but learn Linux as we move more infrastructure to it.

What you’ll work on:

  • Help one of our newest products, hosted Stack Overflow Enterprise, grow to its first 1,000 customers and million users
  • Automate the manual steps remaining in deploying and upgrading Stack Overflow Enterprise customers on Azure
  • Work to improve our monitoring and alerting strategy for cloud solutions
  • Work to improve our security patching and compliance strategy for cloud solutions
  • Participate in creating an VM appliance version of our product
  • Participate in our on-call rotation

Our ecosystem includes:

  • Microsoft Azure (Azure SQL, Microsoft SQL Server, Azure Automation, Azure AD)
  • Windows Server 2016 and IIS and .NET Core
  • Linux (we use CentOS)
  • PowerShell / DSC
  • Terraform / Go
  • Our toolchain includes: Git, GitHub Enterprise, TeamCity (CI), CentOS Linux, Puppet, .NET/C#, ElasticSearch, Redis, OctopusDeploy
  • In the future: Containers and Kubernetes

Skills & Requirements

We’re looking for:

  • 3+ years of Windows Server experience (we run 2012R2 and 2016)
  • 3+ years of Azure experience or equivalent Amazon AWS, Google Cloud, Digital Ocean, etc.
  • PowerShell experience, and a developer’s mindset towards system administration (always looking to automate manual tasks)
  • Strong written communication skills and a strong inclination to “document as you go”
  • Linux experience in a mixed environment (we use mainly CentOS)
  • Some Microsoft SQL Server experience (Azure SQL a plus) or other SQL experience
  • Basic familiarity with: Networking, DNS, SSL certificates

We like to see:

  • Deep experience with Azure administration, debugging, and API use
  • Knowledge of programming beyond scripting (we use mainly C# And Go)
  • Experience working both on a team and on independent projects
  • Good communication and people skills

What you’ll get in return:

  • Flexible hours
  • 20 days paid vacation + holidays
  • Completely free health insurance - no copay, no premiums (US residents)
  • Generous parental leave (10-16 weeks at 100% pay), family care leave, and unlimited sick days
  • Employees will never be poked with a sharp stick

This is a remote position… (US/Pacific time zone) While we are a remote-first team with team members all over the world, this position requires collaborating with people in Sydney and NYC, therefore you must be located in the US/Pacific or compatible time zone.

Apply Instructions