Jobs /

Site Reliability Engineer

Acquia

Apply Now

Job Details

Location: Pune, Maharashtra, 411001, India Portland, Multnomah County, Oregon, USA Posted: Jun 23, 2021

Job Description

Acquia empowers the world’s most ambitious brands to create digital customer experiences that matter. With open source Drupal at its core, the Acquia Digital Experience Platform (DXP) enables marketers, developers, and IT operations teams at thousands of global organizations to rapidly compose and deploy digital products and services that engage customers, enhance conversions, and help businesses stand out.

Headquartered in the U.S., Acquia has been named a top software company by The Software Report, rated a leader by the analyst community, and named a top place to work by the Boston Globe and the Boston Business Journal. We are Acquia. We are building for the future and we want you to be a part of it!

Site Reliability Engineering (SRE) is what you get when you treat operations as if it’s a software problem. Our mission is to improve, maintain, and provide for the software and systems behind all of Acquia’s services - with an ever-watchful eye on their availability, latency, performance, and capacity.

As an SRE, you will be working on monitoring Kubernetes, coding in Go, Yaml and Python and implementing reliable continuous deployments. You will also be given the opportunity to help refactor and integrate existing architecture for greater automation.

As a Site Reliability Engineer, you will...

  • Work in an Agile team designing, writing and delivering software to improve the availability, scalability, latency, and efficiency of Acquia’s services.
  • Maintain an understanding of system functionality and architecture, with a strong focus on the operational aspects of the service (availability, performance, change management, emergency response, capacity planning, etc).
  • Collaborate with your team members to review their work and have your work reviewed in turn.
  • Work in a collaborative environment where teams own and operate the services they build.
  • Influence and create new designs, architectures, standards and methods for large-scale distributed systems.

You’ll enjoy this role if you...

  • Know how to code.
  • Are curious and like solving complex challenges for scalable, low latency systems.
  • Enjoy creating software solutions for a Cloud native environment.
  • Enjoy collaborating with multiple stakeholders.
  • Have a passion for SRE, DevOps and related automation.

What you’ll need to be successful...

  • BS degree in Computer Science or related technical field, or equivalent practical experience.
  • Experience writing automation using Python/Go, Terraform and Unix Shell.
  • Have been involved in designing, analyzing and troubleshooting large-scale distributed systems like Kubernetes.
  • 1+ years of SRE/DevOps and experience in delivery software into production.
  • 1-2 years managing monitor, logging and report systems, and building observability dashboards on application and server performance and scalability issues (examples: SignalFX, Sumologic, New Relic, or other observability tools).
  • Understanding of security best practices.
  • Experience with automation/configuration management using Ansible, Chef or Puppet
  • Experience on large scale administration of Linux servers.
  • Knowledge of AWS or GCP products like EC2 or EKS/GKA/ECS
  • Ability to provide after-hours support as needed for emergency or urgent situations.

Extra credit if you...

  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
  • Familiarity with running web services at scale; understanding of Unix systems internals and networking.
  • Understanding of Unix/Linux systems from kernel to shell and beyond, taking in system libraries, file systems, and client-server protocols along the way.
  • Networking: knowledge and understanding of network theory, such as different protocols (TCP/IP, UDP, ICMP, etc), MAC addresses, IP packets, DNS, OSI layers, and load balancing).
  • Systematic problem-solving approach, coupled with a strong sense of ownership and drive.
  • Familiarity with other languages a part Python or Go, like Ruby or PHP.
Acquia is proud to provide best-in-class benefits offerings to our employees and their families in maintaining both a healthy body and a healthy mind. Core Benefits include: competitive healthcare coverage, wellness programs, take it when you need it time off, parental leave, recognition programs, and much more!

Acquia is an equal opportunity (EEO) employer. We hire without regard to age, color, disability, gender (including gender identity), marital status, national origin, race, religion, sex, sexual orientation, veteran status, or any other status protected by applicable law.

Interested residents of Colorado may contact [email protected] as it relates to regulation C.R.S. § 8-5-201. Information regarding benefits are linked here .

About Acquia

Global organizations turn to Acquia to provide technical expertise and strategic insight to build, deliver, and optimize the Drupal WCMS.

View Website

Get More Interviews for This and Many Other Jobs

Huntr helps you instantly craft tailored resumes and cover letters, fill out application forms with a single click, effortlessly keep your job hunt organized, and much more.

Sign Up for Free