Lead SRE/DevOps Engineer

Posted Jun 19

Description

Details

Your work ensures that approximately nine million customers can buy their banking products at any time through Client. You can make this difference by leading a team of SRE/DevOps Engineers and overseeing the development and implementation of robust systems and infrastructure. As a Lead SRE/DevOps Engineer at Belmont Lavan Ltd, you will drive the continuous improvement of the software development lifecycle and ensure the availability, scalability, and reliability of our systems.

Job Info

As a Lead SRE/DevOps Engineer, you will be responsible for:

• Leading a team of SRE/DevOps Engineers in implementing and maintaining CI/CD pipelines, infrastructure automation, and monitoring systems.

• Collaborating with software development teams to optimize the application architecture for performance, scalability, and reliability.

• Designing and implementing robust disaster recovery and business continuity plans.

• Defining and implementing best practices and standards for software development, deployment, and infrastructure management.

• Conducting regular performance and security audits to identify areas for improvement and implementing necessary enhancements.

Requirements

Skills

customer-oriented, driven, a natural leader, and deeply passionate about DevOps and Site Reliability Engineering. For this position of Lead SRE/DevOps Engineer at Belmont Lavan Ltd, you should have:

• A minimum of 5 years of experience in a similar role, preferably in a high-scale, cloud-native environment.

• Strong hands-on experience with cloud platforms (e.g., AWS, Azure, GCP) and infrastructure automation tools (e.g., Terraform, Ansible).

• Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with containerization technologies (e.g., Docker, Kubernetes).

• Deep understanding of DevOps methodologies and practices, including CI/CD, configuration management, and version control.

• Experience with monitoring and observability tools (e.g., Prometheus, ELK stack) and implementing best practices for system monitoring and alerting.

• Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed systems environment.

• Excellent communication and leadership skills, with the ability to effectively collaborate with cross-functional teams.