Senior Reliability Engineer, Infrastructure

Research, Development & Cloud Operations United States

Senior Reliability Engineer, Infrastructure - 100% REMOTE


Who We Are:

CrashPlan® provides peace of mind through easy-to-use, automatic endpoint data backup. We help organizations recover from any worst-case scenario, whether it is a disaster, simple human error, a stolen laptop, ransomware or an as-of-yet undiscovered calamity. We continue to innovate as the landscape of work evolves, which makes CrashPlan foundational to organizations’ data security. What starts as endpoint backup and recovery becomes a solution for ransomware recovery, breaches, migrations, and legal holds. 

Position Summary:

We are recruiting for a Senior Reliability Engineer, Infrastructure to join our growing team.  As a key member of the Product Development organization, you will build and maintain Development and QA test infrastructure for the CrashPlan application.

Key Responsibilities:

  • Replicate the CrashPlan production cloud infrastructure by instantiating and configuring resource instances in AWS, vSphere, and physical systems
  • Create/Modify orchestration, provisioning, deployment, monitoring tooling
  • VMware configuration management activities - VM template creation/configuration/updates
  • AWS and vSphere consumption management
  • Linux/Windows/Mac systems administration tasks
  • Test Environment Application Database administration tasks
  • Application configuration management tasks
  • Design and develop test environment strategies and plans to support new product and product testing capabilities
  • Troubleshoot and resolve environmental related issues quickly
  • Ensure adherence to FIPS, PCI, and SOC2 compliance in accordance with standards, policies and procedures
  • Propose and develop new technology standards and best practices as appropriate
  • Collaborate as part of a team of engineers dedicated to putting the customer first

Required Qualifications:

  • Bachelor’s Degree in Computer Science  or related discipline and/or equivalent experience
  • 5+ years of experience in a similar role, including experience with computer networking (Internet routing, load balancing, DNS, etc.), working with Ansible and Docker, and/or experience with Hashicorp tools (Terraform, Vault, Packer, Vagrant, etc.)
  • Demonstrated proficiency in programming software (i.e. Java, Python, GoLang, Git, etc.)
  • AWS Systems Operations experience, or equivalent public cloud experience (Google Public Cloud, Azure, etc.)
  • Strong written and verbal communication skills with the ability to communicate with both internal and external stakeholders and senior leadership

Preferred Qualifications:

  • Advanced level degree
  • Experience with Continuous Integration, Continuous Delivery (i.e.e Jenkins, Concourse, or similar)


CrashPlan values workplace diversity and ensuring an environment of mutual respect. Employment opportunities are available to all applicants without regards to race, color, creed, religion, sex, national origin, age, marital status, veteran status, sexual orientation, gender identity or expression, disability, genetic information, or any other category protected by law. We believe that diversity and inclusion are critical to our success, and we seek to recruit, develop, and retain the most talented people from a diverse candidate pool. We are proud to be an equal opportunity employer