Cloud Site Reliability Engineer
Job description
As a Cloud Site Reliability Engineer, you’ll be at the forefront of innovation, working on our cloud products platform to ensure stability and optimal performance.
Where you’ll do it: This role is 100% remote as long as you’re in the EET or Central Europe time zone +/- 2 hours.
The Interview Process: The 4-week process will have 4 stages that include a 45-minute HR chat ➡ 45 min Values & Technical chat ➡ Home Task ➡ A 45 min Technical interview
Technologies: Linux, Kubernetes, CI/CD, Prometheus, Helm, Bash
Reporting to: Cloud SRE Team Leader
Your team: You’ll join a team of 7 colleagues, (Cloud SRE Lead, 5 Cloud SRE Engineers, YOU!)
What will make your journey with us amazing?
A supportive manager who cares about your well-being and is invested in your professional growth.
A culture of continuous learning, with clear targets and feedback.
A global company with over 2600 employees located in more than 26 countries around the world, including offices in 3 countries: Ukraine, Portugal, and India.
What will you do?
The Cloud SRE team supports our cloud system, takes care of monitoring platforms, and provides 24x7 "Always On" support through on-call rotations. We automate manual processes, enhance monitoring tools, maintain documentation, and collaborate with other teams to ensure effective service delivery to customers.
What will you bring?
-Kind, empathetic, and collaborative personality, willing to learn and share knowledge openly.
-Proficiency in command-line interfaces, *nix systems (Linux, Ubuntu), and Git.
-Experience working with Kubernetes clusters, both Docker and CRI-O based, and familiarity with Helm charts.
-Understanding of monitoring tools such as Prometheus, Grafana, and Alertmanager.
-Demonstrated expertise in scripting (Bash)
-A proactive approach to taking ownership, supporting new ideas, and following through from ideation to post-release support.
-An autonomous and flexible working style, able to contribute independently and collaboratively, with strong research and analytical skills for informed decision-making.
-And as a bonus—we value a good sense of humor!
Will be a plus:
-Knowledge of AWS
-Experience with CI/CD tools like ArgoCD, and FluxCD
-Experience with Ansible, Terraform, and Newrelic
-Knowledge of programming languages like Python, Go, and PHP
What’s in it for you?
Embrace a 100% remote lifestyle with this opportunity!
Work with flexibility in a supportive environment where you have the autonomy to manage your time, while also staying connected with the team through daily check-ins and shared office hours. We value collaboration and commitment to team goals, balancing independence with structured support to ensure we all succeed together.
- Invest in your growth with dedicated learning resources and support.
- Thrive in a culture rooted in truth, trust, and transparency.
- Unleash your creativity and explore new ideas with 2 dedicated R&D days each month!
- Stay ahead of the curve with weekly team knowledge-sharing sessions.
- Escape the meeting marathon with 3 meeting-free days per week.
- Enjoy generous vacation policies to recharge when you need it.
- Be a part of a unique team, not just another "cloud-shop" - we run our own infrastructure!
#NamecheapCareers
#HackYourCareer
#equalopportunity