Principal DevOps Engineer (AWS)
Description
The Principal AWS Cloud Engineer leads a team of cloud engineers and DevOps professionals, ensuring the delivery of complex cloud solutions. This role involves coordinating with stakeholders to plan, design, develop, test, and maintain cloud applications and infrastructure. The engineer provides technical leadership in AWS architecture, ensuring systems are scalable, secure, cost-efficient, and of high quality.
Your responsibilities include:
- Leading and managing a team of AWS cloud engineers, to deliver complex cloud solutions.
- Coordinating with scrum masters, project managers, and other engineers to plan, design, develop, test, and maintain cloud-based applications and infrastructure.
- Providing technical leadership in all aspects of AWS cloud architecture and solutions, ensuring best practices and efficient use of AWS services.
- Overseeing the design, development, and implementation of new cloud systems and enhancements to existing systems, ensuring scalability, security, and cost-efficiency.
- Ensuring timely delivery of cloud engineering projects that meet quality standards and performance metrics
- Continually updating technical knowledge and skills, particularly in AWS services and cloud technologies, and sharing them with the team.
- Maintaining high standards of cloud solution quality within the team by establishing and promoting good practices and habits.
- Providing regular performance feedback, coaching, and career development opportunities to team members
- Facilitating root cause analysis of cloud system issues, implementing resolution strategies, and ensuring proactive measures are taken to prevent future occurrences.
- Developing and maintaining comprehensive documentation and diagrams
- Managing the release process and system architecture for smooth deployment and integration of new features
- Performing system maintenance procedures even during non-business hours and over the weekends
- Participating in an on-call coverage for support of systems outside of normal business hours
- Ensuring high availability of internal and production systems
- Implementing and maintaining standard server technologies (Linux and Windows) for performance, optimization, and capacity
Requirements
To qualify for the role, you should have experience in most of the following:
- AWS Cloud Computing Solutions (Required)
- Terraform and other laC tools
- Experience with Jenkins Pipeline Scripting
- Docker, Apache, TomCat, Wordpress, and Testlink
- Networking, IDS/IPS, and SAN
- Server computing (Linux and Windows administration)
- Virtualization (HyperV, VirtualBox, and VMWare)
- Encryption
- VPN (SonicWall/OpenVPN)
- ITIL Standards
- DB Administration (MySQL)
- Command line scripting: BASH PowerShell, DOS Batch, etc.
- Source code management: GIT and GIT services like Bitbucket
In addition to the above requirements, the ideal candidate should have the following development skills:
- Ability to use AWS through code scripts, not just navigate the AWS website dashboard
- Ability to create and modify Docker YAML files, not just use Docker's click-and-drag interface
- Ability to create scripts to automate or tweak IP tables, not just manually adjust them