Site Reliability Engineer

Posted Apr 9

MaxMind (www.maxmind.com) is looking for a talented, experienced, and highly motivated Site Reliability Engineer (SRE) to join us. We help protect thousands of companies worldwide from fraud, screening over a billion online transactions each year, and we provide IP intelligence data to thousands more. As an SRE, you will play an important role in the continuous improvement and maintenance of MaxMind’s products, services and internal systems to deliver a performant and secure solution.

This is a full time remote position, and we welcome candidates to apply from anywhere in Canada.

For Canadian candidates, you must be eligible/authorized to work in Canada.

Our salary for Site Reliability Engineering roles is $160k CAD for new hires (in Canada). See more about benefits and compensation below.

The Position Overview

As a MaxMind SRE, you will make a difference in defining broader architectural, design, and technical objectives of MaxMind, and achieving customer satisfaction by:

  • Building performant and scalable SaaS solutions and the tools to maintain them
  • Ensuring the security of our infrastructure
  • Offering ideas and suggestions to the improvement of the development tool set, technical direction, and software architectureIdentifying, triaging, and resolving system issuesResearching changes in technologies, development environments, and tools
  • Enabling and extending complex system monitoring
  • Updating configuration management and deployments
  • Collaborating with, mentoring, and advising software engineers and the product team
  • Supporting on call after hours in rotation with other members of the team

Minimum Qualifications

  • Experience as a Site Reliability Engineer and/or System Administrator and/or DevOps Engineer for Highly Available SaaS solutions processing web traffic
  • Knowledge of TCP/IP, HTTP, DNS, TLS, and SMTP
  • Experience building complex monitoring solutions to support identification of issues with high availability web services
  • Able to investigate and resolve issues with Linux performance and network latency/reachability
  • Experience with Ansible, Terraform, or other configuration management and infrastructure as code software
  • Programming experience in Go or another language. Our SRE code is mostly Ansible and Terraform, but we also have a small amount of Go and Perl. We're happy to hear from you if more familiar with other programming languages or configuration management software too
  • Significant experience with Linux systems
  • Experience with version control, preferably Git
  • Strong analytical and problem-solving skills, with logical and repeatable debugging and problem solving approaches
  • Ready to learn new things
  • Excellent written and verbal communication skills with ability to communicate clearly with partners and end users
  • Able to work with a geographically distributed team

Desired, but not required

  • Experience managing PostgreSQL, including streaming replication and backups
  • Experience with Google Cloud or another major cloud provider
  • Experience doing security audits, security compliance, or penetration testing
  • Experience with HAProxy configuration, Docker, Kubernetes, or other container tools ELK/Elastic Stack, Cloudflare, Systemd configuration, Open source technologies
  • Experience with emerging cloud platforms and infrastructure tools

Our Site Reliability Engineering Practices

Our Site Reliability Engineers are members of our Engineering team, working together to deliver to our customers’ success. At MaxMind, we are committed to security and the contributions of our SRE team are integral to our work. To learn more about our commitment to security, visit https://www.maxmind.com/en/company/commitment-to-security. We have built a culture of peers, with highly developed practices and processes to work together remotely. To learn more about working at MaxMind, visit https://www.maxmind.com/en/company/working-at-maxmind.  

We use Linux, PostgreSQL, Ansible, and Terraform to deliver our solution. We use a wide variety of tools to manage and monitor our systems, including Prometheus, Grafana, and the Elastic/ELK stack. All work goes through internal code review on GitHub Enterprise.

Our goal is to automate as much as possible. Our tools are written in Go and Perl. We also want to improve our coding practices for the SRE code we write, writing libraries and tests wherever possible instead of one-off scripts.

Working at MaxMind

Our culture is very important to us. We’re friendly, collaborative, and work-focused. We don’t like office politics and unnecessary stress. We like to have productive workdays and don’t like work to chase us when we’re done for the day. We maintain a set of core, overlapping hours, but are flexible with specific start and end times and are understanding about appointments and life events. We care about helping each other succeed.

In a recent survey, employees listed having a supportive work culture, good co-workers, autonomy, and feeling trusted, valued, and respected as some of the things they liked most about working here.

MaxMind has a social mission. MaxMind donates over 60% of profits to charities. 

MaxMind’s compensation strives to reward getting stuff done, quality of work, and working well with others.

We are a fully remote company, so communication centers around video chat, and direct and group messaging tools.

Benefits

In addition to competitive compensation, our Canadian benefits include medical, dental, vision, life, accidental death and dismemberment, critical Illness, short and long term disability insurance, Employee and Family Assistance Program, and paid parental leave. You also have access to a group Retirement Savings Plan. In lieu of a Canadian RRSP contribution we provide a bonus payout at the end of each year that employees may decide to use toward retirement savings.

Everyone participates in a company performance-based bonus plan. MaxMind offers a $2,000 USD professional development budget and five days for professional development annually. 

Diversity and Inclusion

We're committed to diversity and inclusion and are mindful of incorporating them into all aspects of our company. 

We encourage and sincerely welcome applications from candidates of color, women, queer candidates, candidates with family caregiving responsibilities, transgender candidates, and from other communities not well represented in the tech world.

See our complete diversity and inclusion statement -  https://www.maxmind.com/en/company/working-at-maxmind.

Our Interview Process

One of the first steps in our interviewing process is a homework assignment, and we will ask you for a submission so we can gain insight into your work. The next step of our interview process would be a pairing session combined with a first round technical interview with our hiring manager and another one of our Site Reliability Engineers. This interview will cover questions specific to the role. Next, there would be a final interview with our COO and our HR Manager. That interview would be a more general / behavioral interview. The final step of our hiring process is reference checking.

We want to know about you, please share any projects or accomplishments and include a link so we can learn more.