Sr. Director, Site Reliability Engineer

Information Technology Chicago, Illinois


Company: Oak Street Health 

Title: Sr.Director, Site Reliability Engineer 

Location: Chicago

Company Description

Oak Street Health believes everyone deserves great healthcare – the kind you’d want for your own family. We specialize in providing exceptional care for older adults on Medicare, often in underserved communities where there is little or no quality healthcare.

Oak Street Health is on a mission to rebuild healthcare as it should be with an innovative care model that centers on wellness-based, positive health outcomes for patients, not the volume of services provided. Our unwavering commitment to keeping patients healthy is highlighted by dedicated care teams that take time to get to know each individual, providing the personalized care they need to stay healthy and live life more fully. 

We’re an organization on the move! Our rapidly growing national network of primary care centers is staffed by a diverse team of outstanding care providers, service team members, technologists, community outreach experts, business professionals, and more. For more information, visit

Role Description:

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality.  Utilizing modern methodologies and open source tools, you will be empowered to set the engineering excellence standards as we seek to deliver applications that will directly and immediately impact the experience of our teams and our patients. 

Core Responsibilities:

  • Review systems to identify and implement the necessary telemetry, monitoring and alerting for proactive and reactive management. 

  • Partner with Product and AD to define/review Service Level Objectives and Service Level Agreements. 

  • Participate in design reviews to ensure solutions can meet SLO’s / SLA’s.

  • Design and automate performance and resiliency test cases in partnership with application development and infra teams.

  • Identify and eliminate manual repeatable tasks with automation or application enhancements partnering with development. 

  • Other duties, as assigned.

What are we looking for?

  • Bachelors or Relevant industry experience

  • Minimum of 5 years of development experience in consumer facing products leveraging cloud native technologies 

  • Experience automating pipelines using continuous delivery tools.

  • Experience with system monitoring, alerting and observability platform tools and best practices. 

  • Experience with capacity planning and management.

  • Experience with resilient systems, resiliency testing and design best practices.

  • Experience with nonfunctional requirements along with SLO’s/ SLA’s.

  • Preferred: Our Tech Stack – Istio, Grafana Labs,.NET Core, Confluent Kafka, Mongo, gRPC, AKS, Docker, Azure

  • Preferred: Experience managing Kubernetes clusters in a production environment

  • Preferred: Experience monitoring applications at scale using Microservices

  • US Work Authorization

  • Someone who embodies being “Oaky”

What does being “Oaky” look like?

  • Radiating positive energy

  • Assuming good intentions

  • Creating an unmatched patient experience

  • Driving clinical excellence

  • Taking ownership and delivering results

  • Relentlessly determined

Why Oak Street?

Oak Street Health offers our coworkers the opportunity to be at the forefront of a revolution in healthcare, as well as:

  • Collaborative and energetic culture

  • Fast-paced and innovative environment

  • Competitive benefits including paid vacation and sick time, generous 401K match with immediate vesting, and health benefits

Oak Street Health is an equal opportunity employer. We embrace diversity and encourage all interested readers to apply to