Software Development Mgmt 3
The WEX Site Reliability Engineering (SRE) team is a newly formed team and we are looking for an experienced Manager to help build the overall SRE team and program. The team will be part of the Enterprise Platform Services organization which is building out common tools and services for all of WEX. The team is progressively responsible for full-stack observability, level two alerting, reliability engineering and incident response for products and services that are on-boarded into the team. We aim to ensure that our production environment remains available to our customers and we aim to proactively address reliability issues before they affect performance or availability.
Key Responsibilities:
- Act as 'gatekeeper' for the on-boarding of new services into the SRE team to ensure that services accepted into the SRE program have been adequately documented, level 1 response Run-books have been developed, sufficient monitoring has been put in place and escalation channels for various service owner teams have been agreed
- Own end-to-end availability (SLO/SLA), reliability, and performance of our platform by developing processes, metrics and engineering projects that ensure maximum reliability and uptime for our customers
- Hire exceptional SRE talent and help build, and grow a team that is based in the US and Brazil.
- Collaborate with the existing Engineering teams to understand deployment practices and processes and work towards iteratively improving the release pipeline to ensure a highly resilient deployment strategy, ideally with zero downtime
- Establish an on-call cadence with the team and ensure adequate coverage of key areas
- Participate in 24x7 Site Reliability rotations and escalation workflows
Preferred Qualifications
- 5 years+ managing an SRE (or related) team
- A passion for SRE/DevOps principles and a clear drive to remove toil and run highly resilient/automated systems
- Ability to encourage and foster a culture of visibility and transparency across teams
- Passion for automation and a reluctance for manual implementation
- Strong philosophy of continuous improvement
- Experience with Cloud Computing platforms (particularly AWS) a plus
- Strong technical fundamentals including Linux, TCP/IP, Docker, CI/CD
- Working knowledge of databases including SQL, indexing, and schema design
- Bias towards data-driven decisions and ensuring key metrics are agreed on, visible and actionable
- Experience in software development in one or more of the following: Java, Python, Go
- Experience managing an engineering team on projects with technical deep-dives into code, networking, operating systems and/or storage
- BA/BS degree in Computer Science or related technical field, or equivalent practical experience.
The base pay range represents the anticipated low and high end of the pay range for this position. Actual pay rates will vary and will be based on various factors, such as your qualifications, skills, competencies, and proficiency for the role. Base pay is one component of WEX's total compensation package. Most sales positions are eligible for commission under the terms of an applicable plan. Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. WEX's comprehensive and market competitive benefits are designed to support your personal and professional well-being. Benefits include health, dental and vision insurances, retirement savings plan, paid time off, health savings account, flexible spending accounts, life insurance, disability insurance, tuition reimbursement, and more. For more information, check out the "About Us" section.Salary Pay Range: $126,000.00 - $169,000.00
- Health insurance