Jobs /

Site Reliability Engineer

Red Hat

Apply Now

Job Details

Location: Posted: Mar 28, 2020

Job Description

Company Description

At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

Job summary

The Red Hat Software Engineering team is looking for a Site Reliability Engineer to join our Hosted Service Delivery team in Ireland. In this role, you will help build the platform to run all user-facing Software-as-a-Service (SaaS) offerings on top of Red Hat OpenShift using site reliability engineering (SRE) industry best practices. You'll focus more on coding than operations, keeping the Red Hat OpenShift platform available and secure by interacting with site reliability engineers (SREs) and engineering teams to manage provisioning, upgrades, and problem detection in clusters. As a Site Reliability Engineer, you will also be responsible for delivering automation for issue remediation, incident management, and fault resolution to ensure that service-level agreements (SLAs) are met.

Primary job responsibilities

  • Create software delivery pipelines to increase service team velocity and confidence
  • Establish and enforce SRE best practices through platform constraints and interface requirements
  • Help service teams develop software operators against Red Hat OpenShift Kubernetes APIs to manage service life cycle events automatically
  • Serve as a point of contact with other SRE and Product Engineering teams

Required skills

  • Bachelor's or master's degree in computer science, engineering, math, or an equivalent degree or experience
  • Experience developing software systems for running other software or applications
  • Understanding of distributed systems and common distributed system failure domains
  • Experience testing in a distributed environment
  • Understanding of monitoring and alerting best practices
  • Ability to effectively work in a globally distributed team

The following experience will be considered a plus:

  • Managing a production service with Red Hat OpenShift or Kubernetes
  • Developing a Kubernetes controller, operator, or platform component
  • Operations experience with a production user-facing application
  • Writing a continuous delivery (CD) pipeline for highly available applications
  • Experience with Golang

Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

About Red Hat

Red Hat is the world’s leading provider of open source solutions, using a community-powered approach to provide reliable and high-performing cloud, virtualization, storage, Linux, and middleware technologies. Red Hat also offers award-winning support, t...

View Website

Get More Interviews for This and Many Other Jobs

Huntr helps you instantly craft tailored resumes and cover letters, fill out application forms with a single click, effortlessly keep your job hunt organized, and much more.

Sign Up for Free