Job Description
Our success is reliant on building teams that include people from different backgrounds and experiences who can elevate assumptions and ideas with fresh perspectives. We're dedicated to hiring the whole human, not just a resume. To that end, we look for a diverse pool of applicants-including those from historically marginalized groups. We would like to invite you to apply even if you don't think you meet all of the requirements listed below. We don't want a few lines in a job description to get between us and the opportunity to meet you. About the Team The platform team is composed of folks with disparate skills and backgrounds. Our unifying attribute is our desire to work together to find creative, scalable solutions to the problems we run into. The SRE COR (Compute, Observability and Reliability) team at Outreach is responsible for the underlying infrastructure on which Outreach runs, providing operational insights into it, and maximizing the reliability of that infrastructure. We are heavily integrated into
AWS . The COR team is responsible for building the tools that build the infrastructure that allows us to continue to grow. In the ideal world, the software we build creates kubernetes clusters, validates performance, and can automatically migrate workloads to account for the new clusters safely.
The Role We are looking for an engineering manager to lead a Site Reliability Engineering team in Prague. The Core Infrastructure team at Outreach is responsible for the foundation on which all the other software that Outreach engineering teams build runs. That means we need to be empathetic to the needs of our co-workers in the performance of their jobs. It also means that we must be pretty focused on how our systems are performing according to our SLOs and SLIs. We have transitioned much of our production infrastructure to run on top of Kubernetes. We are looking for someone to come help us mature that new platform. We also need someone to help us reshape other portions of our underlying production infrastructure as we continue to rapidly grow and scale. That means we need someone to help us identify the constraints in our system, and prioritize which ones we address. We are looking for someone that can be analytically minded. In addition, the right person isn't necessarily interested in building new and exciting infrastructure technology. Instead, you are focused on using and augmenting existing tools to serve our needs.
Tech stack In addition to Kubernetes, we use jsonnet, Chef, Concourse, Elasticsearch, Terraform, Ruby, and Go. It's awesome if you have experience with any of those things, but we are happy to help you learn. Other things that you may have experience in that are potentially relevant, but not required: building highly available services, an understanding of distributed systems and their commonly associated problems, MySQL administration, cloud computing fundamentals (preferably in AWS), REST, cloud-based networking, Unix fundamentals, performance profiling (especially in Ruby). We encourage you to apply, even if you think the position sounds a bit outside your wheelhouse. We want to find folks who are interested in learning and developing on the job, even if you can walk in the door and do amazing things.
Location While we are remote-friendly, we remain an "office-centric" company. The workplace is based in
Prague, Czechia .
Your Daily Adventures Will Include
- You will be managing and coaching a team of software engineers who are usually iterating on our planned projects on a day to day basis. However, we are occasionally disrupted by exigent circumstances (read: alerts). The aim is to ensure that we spend more time than not working on software to make our platform more performant and scalable, and make it easier for the other software engineers to do their jobs. We are also occasionally called to assist other teams. When confronted with disruptive events, we strive to codify what we’ve learned and feed that information back into how we plan and prioritize our work
Our Vision of You
- We are ideally looking for an SRE Manager with 5+ years of dedicated experience
- Experience with complex architecture
- Experience with Kubernetes in a production environment
- Cloud computing fundamentals and cloud-based networking, preferably in AWS
- Our tech stack: In addition to Kubernetes, Docker and AWS, we use Terraform/Atlantis, Helm, CircleCI, Concourse, Prometheus/Grafana, Vault and Git, with services written in Go. It's awesome if you have experience with any of those things, but we are happy to help you learn
- Experience building highly available services - monitoring, alerting and logging with an emphasis on both the business and technical aspects of the application
- Understanding of distributed systems and their commonly associated problems
- Good Unix/Linux fundamentals
- Excellent communication skills. English is a must
Why You’ll Love It Here ● Highly competitive salary ● Amazing working space with a running track on its roof ● Flexible time off, 5 weeks of vacation, and 5 annual sick days ● 4% employer supplemental pension monthly contribution ● Private medical care for employee and spouse with Program Health Plus ● Life insurance at 2x annual salary ● 5.000 CZK monthly allowance for meal vouchers, flexipasses and other personal expenses ● 16 weeks of annual top up maternity leave pay or 8 weeks of fully paid paternity leave ● Opportunity to be part of company success via the RSU program ● Company-organized and personal paid volunteer days to support the community that supports us ● Relocation package - support offered for foreign future employees and their families relocating to Prague ● Diversity and inclusion programs that promote employee resource groups like OWN (Outreach Women's Network), AAPI, Rainbow (LGBTQIA+), Gender+, LatinX, Black Excellence, Disability Community, and Veterans ● Employee referral bonuses to encourage the addition of great new people to the team ● Fun company and team outings because we play just as hard as we work
Apply for this job