Role: Site Reliability Engineer
Location: Montreal, Canada (3 days onsite and 2 days Remote)
Duration: 12+ months
Interview: PH/Skype
The hiring manager has told us that he could teach someone the IAM skill set, he really needs strong SRE, Unix/Linux, Command line, and automation using python . If someone did not have everything listed in the job description below that is fine if they have SRE, Unix/Linux, Command line, and automation using python experience
Position Overview:
- Support Identity Management applications for human and system account management built on the PingIdentity / ForgeRock Identity Management products.
- Maintain LDAP Directory Services products such as PingDirectory and PingDataSync
- Provide first-line support for Identity Management during large-scale outages, including postmortem, pre-mortems and problem management with a data-driven strategies and a code-first approach to problem solving.
- Prepare and execute change management activities, often automating and creating tools where necessary.
- Collaborate with partner enterprise technology teams and provide support to stakeholders and our Level 2 operations team
- Contribute to performance and training assessments of team members
- Improving stability of Identity Management platforms by identifying and implementing alert automation and self-healing functions where possible.
- Performance and scalability - ensure systems can scale seamlessly to handle increased load and monitor the performance of our applications and infrastructure using our service level objectives (SLO) and service level indicators (SLI).
- Participate in on-call rotations (weekday and weekend cycles)
- Share responsibilities and knowledge across the team, engaging with our community and stakeholders to gather feedback to improve our systems.
- Address security and compliance issues ensuring we are meeting industry standards and have implemented best practices for security and data protection with our systems.
Preferred Qualifications
• Experience in Identity and Access Management (IAM)
• Previous experience in IT Operations, Reliability Engineering, or DevOps
• Exposure to Agile / DevOps environments
• Knowledge of Site Reliability Engineering (SRE) principles and methodology
Technical Skills
• Basic understanding of the LDAP protocol and its functionalities
• Experience supporting PingDirectory and PingDataSync
• Familiarity with IAM products from providers such as ForgeRock and Ping Identity
• Knowledge of enterprise security standards and concepts
• Understanding of general enterprise infrastructure concepts and troubleshooting, including
network, storage, web infrastructure, middleware, etc.
• Basic knowledge of operating system administration on Windows (Active Directory) and Red Hat
Linux platforms (>=RHEL7+)
• Proficiency in at least one scripting language such as PowerShell, Python, Shell (bash)
• Functional knowledge of C/C++ and Java can be advantageous for some tooling.
• Experience working within large enterprise architectures
• Familiarity with the Software Development Life Cycle (SDLC) and development environment
tooling (GitHub, Jenkins, Visual Studio Code, etc.)
• Familiarity with visualization and plant and incident management tools such as Splunk, Grafana,
ServiceNow, Jira, Bitbucket, PagerDuty, PowerBI
• Recommend: Foundational knowledge of authentication protocols in the broader IAM domain,
such as OpenID Connect, SAML, Kerberos, and Radius, and multifactor authentication solutions
like RSA SecurID, Cisco Duo Security, FIDO, etc
Soft Skills
• Strong interest in account management systems, directory products, and information security
• Excellent written and oral English communication skills; capable of writing documentation, making
presentations, and positively interacting with colleagues and customers
• Independent problem-solving attitude, highly motivated, and self-directed
• Comfortable working within an operations and support team with end-user interaction and
periodic on-call responsibilities
• Advocate of SRE principles
• Good organizational skills