Jobs /

Lead Site Reliability Engineer

RBC

Apply Now

Job Details

Location: Toronto, Golden Horseshoe, Ontario, Canada Posted: Sep 14, 2021

Job Description

What is the opportunity?

We are looking to add a Lead Site Reliability Engineer (SRE) with a strong problem solving and engineering background to the Retail Banking & Payments Technology SRE organization. The team will work collaboratively with the application development arm of the organization and other IT partners required to succeed in its mandate. As the Lead Site Reliability Engineer, you will provide the leadership to an SRE team that will be responsible for successfully executing the strategy in transforming IT Operations. From Monitoring to incident response, SREs are focused on building and monitoring anything in production that improves service resiliency and reducing repetitive manual tasks.

What will you do?

  • Lead a squad in implementing SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) while striving to reduce toil using automation tools
  • Perform code and non-functional (performance, security, maintainability) reviews of all production bound SRE solutions
  • Help drive transformation by continuously looking for ways to automate existing processes
  • Maintain technology currency (perform server patching, certificate renewal, etc.) with keen eye on automating opportunities
  • Run engineering mindset meetups accelerating breadth and depth of knowledge in community
  • Perform a production support role: proactive monitoring of environments, troubleshooting all systems and applications in scope, including off-hours support
  • Drive incident response: facilitate communication channels, develop and execute playbooks, meet SLOs, and coordinate within own squad and other application stakeholders to get to resolution
  • Assist in incident management and problem management for applications in scope

What do you need to succeed?

Must-have

  • Production support experience in effectively guiding a team through the incident response process
  • Experienced people manager that prioritizes engineering while maintaining production resiliency and compliance standards
  • Hands-on experience in a variety of SRE languages and tools including Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Blue Prism, Catchpoint
  • Software engineer experience with production class delivery, strong analytical mindset, communication skills, and sense of ownership / drive (SRE, DevOps, Cloud, Data)
  • Intermediate experience in a variety of environments including Cloud, distributed and mainframe, business workflows and services/APIs, databases
  • Experience with Agile (SCRUM) methodology
  • Experience supporting applications in cloud environments (CloudFoundry), containerized (OpenShift)
  • Programming experience with Java, SpringBoot, Groovy, and/or shell scripting
  • Knowledge of vendor platforms build and deploy process: IIB 11, IBM ODM, Apache Nifi

Nice-to-have

  • Experience with Docker, OpenShift
  • Knowledge of networking and security
  • Familiarity with performance engineering concepts
  • Experience with Site Reliability Engineering (SRE)

What’s in it for you?

We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.

  • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable
  • Leaders who support your development through coaching and managing opportunities
  • Work in a dynamic, collaborative, progressive, and high-performing team
  • Opportunities to do challenging work
  • Opportunities to take on progressively greater accountabilities
  • Access to a variety of job opportunities across business and geographies

Learn more about RBC Tech Jobs
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at rbc.com/careers.
JOB SUMMARY
City: Toronto
Address: 88 Queens Quay West
Work Hours/Week: 37.5
Work Environment: Office
Employment Type: Permanent
Career Level: Experienced Hire/Professional
Pay Type: Salary + Variable Bonus
Required Travel(%): 0
Exempt/Non-Exempt: N/A
People Manager: Yes
Application Deadline: 11/14/2021
Platform: Technology and Operations
Req ID: 404479
Ad Code(s):

About RBC

RBC is a Canadian multinational financial services company and the largest bank in Canada by market capitalization.

View Website

Get More Interviews for This and Many Other Jobs

Huntr helps you instantly craft tailored resumes and cover letters, fill out application forms with a single click, effortlessly keep your job hunt organized, and much more.

Sign Up for Free