About Our Client:
Our client is a leader in the digital experience industry enabling the creation of unforgettable digital moments for a broad spectrum of usersfrom emerging artists to global brands. Renowned for empowering individuals to create stunning and powerful images videos and applications they transform how companies interact with customers across every screen.
Responsibilities:
- Enhancing reliability scalability security and efficiency of offered services.
- Maintaining monitoring and alert systems managing incident responses to ensure maximum uptime and Quality of Service.
- Collaborating with various teams to identify and resolve reliability issues.
- Managing incident responses and conducting postmortem analyses.
- Developing and maintaining automated deployment processes managing scaling and configuration using cuttingedge technologies.
- Optimizing operational processes through Infrastructure as Code (IaC) practices.
- Forecasting capacity needs and adjusting scalability to meet growing user demands.
Requirements:
- Strong communication and collaboration skills in an international environment.
- Experience in designing and scaling distributed systems and working with containerization and orchestration technologies.
- Proficiency with monitoring and observability tools such as Cortex Prometheus Grafana.
- Programming experience preferably in Python TypeScript Java or Golang.
- Familiarity with the latest technological trends especially AWS and Kubernetes.
- Higher education in computer science or a related field.
If you are ready to join a team that is changing the world through digital innovations apply now! Prove that you can deliver better software faster than ever before!