Titre du poste ou emplacement

Site Reliability Engineer for Cloud Systems

Yelp, Inc

Toronto, ON

Détails de l'emploi :

Télétravail
Temps plein
Expérimenté

Elevate the stability of cloud services as a Site Reliability Engineer. Tackle extensive system challenges while ensuring platform reliability and performance in a fully remote capacity. In this position, you will work with engineers across multiple functions to develop and maintain scalable, self-healing infrastructure. Your role will emphasize creating automated solutions and monitoring tools, ensuring the reliability of critical data stores while maintaining service level objectives. Key Responsibilities: • Monitor platform stability using advanced tools • Help integrate services and support new features • Automate processes and infrastructure tasks • Design new systems and tests to enhance performance • Collaborate in light on-call rotations for team support Requirements: • Strong Linux knowledge, debugging OS behaviors • Proficient in programming languages such as Typescript or Ruby • Familiarity with AWS and GCP public cloud platforms • Experience with container orchestration technologies • Self-motivated with strong teamwork abilities Enhance system performance and user experience through your expertise, working in a collaborative remote environment that fosters innovation.