Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
Site Reliability Engineer (SRE)
Remote Job
Position Type: Fulltime permanent role
Salary: $96K/Annum Benefits in USD
Responsibilities
Handling oncall responsibilities with minimum supervision
Monitor and continually improve the capacity of our production environment
Design and implement scalable reliable and efficient infrastructure using Kubernetes Terraform AWS resources.
Partner with development teams to improve services through rigorous testing and release procedures with CI pipelines (Github Actions Dockerfiles)
Gain a deeper understanding of RudderStack infrastructure and help debug incidents
Proactively build software to help operations and support teams
Identify opportunities for process improvements automation and cost savings
Identifies parts of the system that do not scale provides immediate palliative measures and drives long term resolution of these incidents.
Acts as a champion for reliability in the company
Requirements
A Bachelor or Master degree in Computer Science or equivalent experience is required
8 years of experience as a Site Reliability Engineer Internal Platform Developer or similar role
Strong understanding of cloud computing containers and DevOps practices
Excellent debugging skills
Experience with high availability administration of data stores like Postgres and Redis
Experience with Scripting and infrastructure automation
Demonstrated Linux experience
Familiarity with distributed systems design patterns using tools such as Kubernetes
Familiarity with AWS Azure or Google Cloud Compute
Excellent verbal and written communication skills
Familiarity with Networking concepts like VPCs proxies and CDNs
Kubernetes,Python
Full Time