Essential Skills:
• Build, scale, and support high-availability Ubuntu Linux production and development systems in public cloud environments.
• Work with tools like Jenkins, Ansible, Argo CD, Terraform, CloudFormation, Resource Manager to implement Infrastructure as Code (IaC).
• Manage and improve security/availability monitoring, ensuring policies are consistently enforced.
• Deploy workloads to AWS, Azure, or GCP; hands-on experience with core cloud services (instance management, IAM, databases, caching, troubleshooting).
• Deep understanding of Kubernetes (ability to build clusters from scratch).
• Expertise in load balancing, service mesh, and optimizing uptime/availability.
• Maintain quality documentation for infrastructure systems.
• Use Prometheus and monitoring tools to proactively resolve issues.
• Troubleshoot failures/performance issues; participate in on-call rotations.
• Code ownership in Go, Python, Rust, or Bash to build tools and improve integrations.
• Strong grasp of networking fundamentals (DNS, DHCP, routing).
________________________________________
Desirable Skills:
• Excellent spoken/written English and collaboration skills.
• BS in Computer Science or equivalent experience.
• 3+ years' experience with Linux/UNIX systems (troubleshooting, memory management, security).
• Experience with Ansible/Chef/Terraform for provisioning.
• Proficiency in CI/CD tools (Jenkins or similar).
• Programming skills in Go, Python, Bash.
• Working knowledge of MySQL/PostgreSQL databases.
• Experience with containers (Kubernetes, Mesos, Docker Swarm).
• Hands-on experience with AWS, Azure, or GCP.