Site Reliability Engineer

R&D - Research and Development Ottawa, ON


Description

What makes us Qlik 

Qlik helps enterprises around the world move faster, work smarter, and lead the way forward with an end-to-end solution for getting value out of data. A Gartner Magic Quadrant Leader for 11 years in a row! Our platform is the only one on the market that allows for open-ended, curiosity-driven exploration, giving everyone – at any skill level – the ability to make real discoveries that lead to real outcomes and transformative changes. We are a Values-Driven organization, operating over 100 countries with 38,000 customers around the world. If you think we are interesting, please read on – we may be looking for you!  

 

Site Reliability Engineer

The Qlik Site Reliability Engineering team (SRE) is committed to ensuring that our SaaS environments are scalable, observable, and reliable while providing an optimal user experience. Bring your passion and software engineering best practices to scale the SaaS architecture and improve reliability, increase automation, and remove toil.

 

 

Responsibilities include:

  • Be curious about new technology, infrastructure, and practices to scale our architecture and prepare for future growth.
  • Provision secure, reliable, and scalable SaaS infrastructure via code (Infrastructure as Code).
  • Through collaboration, advocate for reliability and scalability best practices throughout the development lifecycle. Lead by example.
  • Increase automation and tooling to reduce toil and manual intervention.
  • Leverage metrics and traces to effectively observe systems and provide data and insights.
  • Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability.
  • Apply software engineering best practices to develop SRE-managed services and tools.

 

 

Qualifications include:

  • 3+ years of experience with deploying and supporting a SaaS offering.
  • 3+ years of experience deploying and supporting Kubernetes clusters in a public, scalable SaaS offering.
  • 2+ years of experience as a Site Reliability Engineer or 3+ years in a DevOps environment.
  • 3+ years development experience (Golang / NodeJs / bash / etc.).
  • 2+ years of experience with each of the following:
    • Cloud Infrastructure (Amazon Web Services (AWS) / Google Cloud Platform (GCP) / Azure / etc.)
    • Bug tracking (JIRA / Bugzilla / YouTrack / etc.)
    • Continuous Deployment (concourse / GitHub Actions / Spinnaker / etc.)
    • Source control (github / gitlab / etc.)
    • Metrics & Tracing (Grafana / Prometheus / Jaeger / OpenTelemetry / OpenTracing / etc.)
  • Familiar and comfortable with agile development techniques – Scrum certified preferred.
  • Self-starter with the ability to work independently on projects.
  • Proactive and strong ability to learn new things with limited guidance.
  • Demonstrated ability to work effectively within a team and with cross-functional technical and business teams.
  • A curious attitude that is interested in knowing why things work the way they do and using that information to improve and enhance.

 

Location
This position is based in Ottawa, ON Canada.

 

About Qlik 

 

Qlik is an Equal Opportunity Employer and does not discriminateon the basis ofany protected category or characteristic.  We value the diversity of our workforce. If you need assistance due to disability during the application and/or recruiting process, please contact us via theAccessibility Request Form 

 

AGENCIES: Qlikis not accepting unsolicited assistance from search firms for this employment opportunity. Please, no phone calls or emails. All resumes submitted by search firms to any employee atQlikvia-email, the Internet or in any form and/or method without a valid written search agreement in place for this position will be deemed the sole property ofQlik. No fee will be paid in the event the candidate is hired byQlikas a result ofthe referral or through other means. 

 

#LI-AMER