Jobs /

Site Reliability Engineer, AWS Capacity Management

Airbnb

Apply Now

Job Details

Location: 888 Brannan St, San Francisco, CA 94103, USA Posted: May 22, 2019

Job Description

Site Reliability Engineering at Airbnb:

Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of Airbnb's infrastructure and products. SREs design and implement the tools that automate building reliable and performant systems.

In addition to driving performance and reliability, SRE at Airbnb runs our AWS Capacity Management Program out of Airbnb’s San Jose office. In this function, you will partner with AWS to drive efficiency initiatives to help Airbnb efficiently use our cloud resources and to right-size the largest portions of Airbnb’s fleet. This will involve developing tooling to ensure that microservices are launched with efficient resource utilization and tooling to alert us when capacity growth out-paces projections. Long term, this will position will also involve developing tooling to develop long term capacity planning models.

What makes Site Reliability Engineering different at Airbnb?

  • We emphasize building tools over manual processes. We create, not operate. Things should go from repeatable to automated quickly
  • We're rooted in open source ( http://airbnb.io/ ) and give as much back to the community as possible with both new and contributions to existing projects
  • Our job is to focus on building reliable infrastructure and tools for our product teams so that they can focus on solving user problems and new features, not reinventing platforms
  • SREs don't sit on the other side of the tossing fence -- we're a first class engineering citizen and help lead our infrastructure focus

What are some examples of Site Reliability Engineering work at Airbnb?

  • Develop self service tooling to allow product engineering teams to efficiently utilize resources when launching applications in our Kubernetes clusters
  • Develop an AWS Cost Attribution framework that allows teams to understand
  • Work with product engineering teams on design and implementation choices of large scale distributed systems
  • Automate as much as humanly possible and always configure as code
  • Bring ideas to life (i.e. production) to help make the lives of engineers better
  • Predict our future failures and work proactively to mitigate them
  • Advocate and implement reliable design patterns (circuit breakers, graceful degradation, etc.)

Some examples of SRE projects are:

  • Working on our next generation internal platform for efficiently automating our AWS infrastructure for ease of use by our product engineers
  • Automating our alerts configuration tool for Datadog to work with dynamic thresholds
  • Optica, a tool for keeping track of nodes in an infrastructure
  • Building automation around determining the causation and correlation of events in our infrastructure

The following experience is relevant to us:

  • 2+ years of industry experience
  • Knowledge of AWS services
  • Experience bringing software to production at high scale
  • The knack for writing, clean, readable, maintainable code
  • An eye for automation and instrumentation
  • The ability to decompose complex systems and find failure scenarios
  • Great communication skills
  • Contributions to open source software

Benefits:

  • Stock
  • Competitive salaries
  • Quarterly employee travel coupon
  • Paid time off
  • Medical, dental, & vision insurance
  • Life insurance and disability benefits
  • Fitness Discounts
  • 401K
  • Flexible Spending Accounts
  • Apple equipment
  • Commuter Subsidies
  • Community Involvement (4 hours per month to give back to the community)
  • Company sponsored tech talks and happy hours
Apply now

About Airbnb

Create a world where anyone can belong anywhere It’s an audacious, incredibly rewarding mission that our increasingly diverse team is dedicated to achieving. Airbnb is built around the idea that everyone should be able to take the perfect trip, including where they stay, what they do, and who they meet. To that end, we empower millions of people around the world to use their spaces, passions, and talents to become entrepreneurs. Exciting challenges lie ahead—new regions, technologies, and businesses. Guided by our four core values, we’ll meet these challenges creatively and with the support of our global community. Join us!

View Website

Get More Interviews for This and Many Other Jobs

Huntr helps you instantly craft tailored resumes and cover letters, fill out application forms with a single click, effortlessly keep your job hunt organized, and much more.

Sign Up for Free