Job Title or Location
RECENT SEARCHES

Site Reliability Engineer Observability

Experis
North York, ON
Posted today
Job Details:
Remote
Full-time
Experienced

Site Reliability Engineer Observability

Contract Term: 4 months, renewable
Work Location: downtown Toronto, hybrid, 3 days a week onsite but flexible to be more remote as time goes on

Our client, a global leading IT company, is looking for a Site Reliability Engineer Observability.

Required skills:
Technical hands-on SRE experience, automate operations, reduce toil, and L2/L3 Engineering Support
Advanced knowledge of the following SRE practices and Dynatrace tool:
o DQL
o Gen3 dashboards
o Traces on Grail
o Active-Gate / Plugins
o Dynatrace application development
o SRG / Workflow development
o BizEvents
Expert level dashboard related UI/UX design
Design and implement creative ways to monitor and observe systems like IBM DataPower where sufficient observability isn't present
Background or knowledge of OTEL
Experience using APM including Grafana, Prometheus, Splunk, AppDynamics or similar
Advanced knowledge of at least 1 programming language (Nodejs/Python/Java)
Experience with AWS technologies, such as: EC2, Lambda, ELB, S3, CloudWatch, IAM, KMS, VPC, Dynamo DB, RDS

Responsibilities:
• Responsible for the design, development, and implementation of Site Reliability Engineering (SRE) solutions
• Embed with application teams to improve SRE practices, participate in hands-on SRE Squad activities, and navigate multi-team SRE and IT Operations to drive results
• Drive an Infrastructure as Code mentality within teams to adopt better observability planning
• Demonstrate new technology Proof-Of-Concepts to teams to rapidly increase SRE adoption
• Define service availability requirements; develop, monitor, and alert on SLIs/SLOs
• Document and maintain runbooks and procedures, automate (Toil Reduction) as much as possible.
• Drive continuous improvements in productivity, monitoring, tooling, and best practices

Nice to have
• OSS knowledge a bonus.
• Experienced in Financial Services and or equivalent environments (ie. very complex end-to-end transaction processing where 50+ systems working together to fulfil one customer request)

Contact Information: Lam Guan

Share This Job: