Job Title or Location

Principal Site Reliability Engineer for AWS EKS Infrastructure

Parallel Domain

Vancouver, BC

Job Details:

Full-time
Executive

Join as a Principal Site Reliability Engineer, focused on the reliability and scalability of AWS Cloud infrastructure. Contribute highly to managing simulation workloads for autonomous vehicle development with expertise. This hands-on role invites you to be the primary architect of a multi-region AWS/EKS platform. You will steer ongoing improvements and security measures while working collaboratively across various engineering teams, ensuring top-notch service availability and customer satisfaction. Key Responsibilities: • Oversee AWS infrastructure and optimize performance • Manage EKS operations including autoscaling and health checks • Drive GitOps practices for effective application deployment • Handle complex networking challenges including VPC design • Implement proactive SLO measurement and incident management Requirements: • 5+ years in SRE, DevOps or similar technical roles • Experience with infrastructure-as-code tools; proficient in Terraform • Extensive AWS expertise including EKS and network services • In-depth Kubernetes management capabilities required • Knowledge of monitoring tools; experience with Grafana desirable Elevate cloud reliability and security as you drive infrastructure projects, ensuring seamless service delivery for high-performance autonomous vehicle solutions.