Jobs /

DevOps Manager

OpenTable

Apply Now

Job Details

Location: Austin, Travis County, Texas, USA Posted: Dec 02, 2021

Job Description

Manager of Site Reliability Engineering

The Team

The Serving Platforms team, within Infrastructure Engineering, is responsible for building and maintaining the container stack and its lifecycle using the latest automation and configuration management tooling. We're skilled in many different engineering disciplines and work with a DevOps mindset. If the tools don’t exist that meet our needs, we write or script them ourselves. The team has a high impact and works closely with all of the engineering organizations at OpenTable. We strive for efficiency, reliability, and velocity while also exploring new technologies that can bring business value and help us achieve company goals.

We provide the following services across the company

  • Building and maintaining the container platform (Kubernetes)
  • Config management infrastructure administration (Puppet, Ansible)
  • CDN & Message Bus (Akamai, Kafka)
  • Cloud service operation (AWS)
  • Supporting Security efforts

About The Role

The Manager of Site Reliability Engineer leads the Serving Platforms team that supports OpenTable’s development and production container infrastructure. In this role, you will work with multiple engineering teams across the globe as a technical leader on Kubernetes and other technologies owned by the Serving Platforms team. You will lead high impact projects, mentor team members technically, support team priorities, and help develop good communication with stakeholders. You can expect to build greenfield projects, mitigate some amount of legacy infrastructure, and participate in on-call rotation. We're looking for someone exceptional to join and lead our team.

About You

You are a subject matter expert on all things Kubernetes.

You love working in a small, agile environment. You enjoy building automation and self service tools. You like learning new languages or skills and sharing your findings with others. You’re detail oriented, enjoy writing code, and implementing DevOps principles.

You are able to effectively communicate, collaborate, and influence team members and engineering technical leads across many time zones. You are able to organize and plan projects from start to finish with other technical leads as needed.

At its core, this role requires excellent problem solving skills. You will mentor, teach, and guide others as we strive to develop successful independent teams that tackle problems in an efficient and cost effective manner.

Required experience:

  • 5+ years of experience leading and managing a team of site reliability engineers on multi-team projects
  • Able to develop long terms plans based on goals set by the team and business
  • Demonstrated ability to, plan, prioritize the work of others, drive, and contribute to multi-dimensional projects across multiple teams
  • Able to develop new concepts, methods, techniques and innovate
  • Able to represent team in areas such as incidents, technical direction, and planning
  • Able to communicate effectively with team members of diverse technical backgrounds
  • Strong expertise with Kubernetes and docker in a hybrid environment
  • Proven hands on Linux experience (Ubuntu, CentOS, Etc.)
  • Solid understanding of systems administration concepts
  • Demonstrated experience with scripting languages such as GoLang, Python, Ruby, Perl, or Bash
  • Solid understanding of cloud computing - AWS, GCE, Azure
  • Experience in incident response and root cause analysis service disruptions
  • Solid experience with config management tools such as Puppet or Ansible

Nice to have

  • Familiarity with CI/CD Pipelines using tools such as Github, Artifactory, Jenkins, TeamCity, Docker registry, etc.
  • Experience working with K/V stores such as zookeeper, redis, or consul in production
  • In depth experience with virtualization technologies such as VMware, ESX, xen, openstack
  • Experience working with monitoring and alerting systems such as Sensu, Graphite, Logstash, and Nagios
  • Applied knowledge of working and communicating with a globally distributed team
  • Experience with Windows Server OSs

About OpenTable

OpenTable, part of Booking Holdings Inc. (NASDAQ: BKNG), is the world's leading provider of online restaurant reservations, seating more than 24 million diners per month via online bookings across approximately 40,000+ restaurants.

Since its inception in 1998, OpenTable has seated more than 1 Billion diners around the world. The Company is headquartered in San Francisco, California, and the OpenTable service is available throughout the United States, as well as in Canada, Germany, Japan, Mexico, Australia and the UK.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

#LI-JL1

#LI-Remote

About OpenTable

We create innovative technology to connect people and restaurants.

View Website

Get More Interviews for This and Many Other Jobs

Huntr helps you instantly craft tailored resumes and cover letters, fill out application forms with a single click, effortlessly keep your job hunt organized, and much more.

Sign Up for Free