Site Reliability Engineering (SRE)
Contract length: June 2025- June 2026 (1 year contract!!)
Location: Onsite 3 days, remote 2 (ONSITE either SCC - Victoria Park and Mc Nicoll OR BMO Place - Yonge and Dundas)
Rate: 65-85/Hour
Keywords:
- 3+ years SRE experience, Dynatrace, APM, AWS
- Experienced with Node.js, Python, or Java.
Your New Company
Join a global leader in technology consulting, delivering transformative solutions to major financial institutions. Our client is a world-renowned IT services provider, and you will be working on a high-impact engagement with a leading North American bank. This is a fantastic opportunity to work on a cutting-edge Site Reliability Engineering (SRE) project within a collaborative and innovative environment.
Your New Role
As a Site Reliability Engineering (SRE) Observability SME, you will play a critical role in enhancing the monitoring, reliability, and performance of complex applications and infrastructure. You will be responsible for designing and implementing creative observability solutions, leveraging advanced SRE practices and tools. This position is a 1-year contract, based in the Greater Toronto Area (GTA), requiring onsite presence three days per week (either at Victoria Park & McNicoll or Yonge & Dundas locations), with the flexibility to work remotely for two days.
Key Responsibilities:
- Automate operations and reduce manual toil, providing L2/L3 engineering support.
- Lead the design and development of advanced dashboards (UI/UX) and observability solutions.
- Utilize Dynatrace (DQL, Gen3 dashboards, Grail, Active-Gate/Plugins, SRG/Workflow, BizEvents) to monitor and analyze system performance.
- Innovate monitoring strategies for systems with limited observability, such as IBM DataPower.
- Integrate and optimize observability using APM tools (Grafana, Prometheus, Splunk, AppDynamics, or similar).
- Develop and maintain solutions using at least one programming language (Node.js, Python, or Java).
- Leverage AWS services (EC2, Lambda, ELB, S3, CloudWatch, IAM, KMS, VPC, DynamoDB, RDS) to support cloud infrastructure.
What You'll Need to Succeed
- Minimum 3 years of hands-on SRE experience, with a proven track record of automating operations and supporting L2/L3 engineering.
- Advanced expertise in SRE practices and Dynatrace observability tools.
- Master-level proficiency in dashboard UI/UX design and implementation.
- Strong background in monitoring systems with limited native observability.
- Familiarity with OTEL (OpenTelemetry) and experience with leading APM tools.
- Advanced programming skills in Node.js, Python, or Java.
- Solid experience with AWS technologies, including compute, storage, networking, and security services.
- Excellent problem-solving and communication skills.
- Ability to work both independently and as part of a cross-functional team.
What You'll Get in Return
- Competitive hourly rate
- Opportunity to work with a top-tier IT services provider on a prestigious banking client project.
- Hybrid work model (3 days onsite, 2 days remote) in prime GTA locations.
- Exposure to the latest cloud, observability, and SRE technologies.
- Chance to make a significant impact on mission-critical financial systems.
What You Need to Do Now
If you are ready to take your SRE expertise to the next level and thrive in a dynamic, high-visibility project, we want to hear from you! Apply today with your updated resume and a brief cover letter outlining your relevant experience. Our recruitment team will be in touch to discuss your fit for this exciting opportunity.
Note: Only candidates legally eligible to work in Canada will be considered. This is a contract position from June 2025 to June 2026.