Manager - DevOps and Platform Engineering

Engineering Remote - United States


Description

Position at J2 Cloud Services

The Manager of DevOps and Platform Engineering reports to the VP of Engineering and will drive the Dev/Sec/Ops culture within the Engineering org.. This position will have a direct impact on production operations for a large multi-site Linux infrastructure providing internet security solutions for domestic and international markets. 

As the Manager of DevOps and Platform Engineering you will work closely with our platform product managers, as well as members of the Software Engineering and Network Engineering teams to build, manage and maintain a suite of online security products.

This is a high-profile opportunity within a highly profitable and dynamic high-tech company. We've established a strong foundation to make us the undisputed leader in our space. Now we need to add an experienced, collaborative, and passionate DevOps professional to help us grow, support, and maintain our services for our customers.

 

Job Duties:

  • Lead, mentor and grow a team of DevOps engineers
  • Execute project work required for new business initiatives such as; environment setup and configuration; infrastructure upgrades; product enhancements and feature rollouts.
  • Maintain and enhance our platform of internet security solutions through tooling and automation.
  • Develop new platform functionality and introduce new technology into the existing environment in a way that is supportable, scalable and capable of being handed off to Network Operations for production operations and support.
  • Interface with Support, Software Engineering and Network Engineering teams to maintain SLA, uptime, KPIs, and security requirements.
  • Maintain and improve existing system monitoring (including; instrumentation, visibility, alerting).
  • Identify and monitor platform health and Key Performance Indicators and develop automated tooling to alert appropriate teams.
  • Lead the way and accelerate adoption and  rollout of modern devops techniques (including: terraform, kubernetes, apm, unified logging, distributed tracing, and anomaly detection)
  • Participate in On-call escalations to help troubleshoot problems and solutions for multiple environments.

Job or Project Requirements and Experience:

Requirements

  • At least seven years of experience working in a high traffic, managed, scalable and fault-tolerant Linux environment.
  • At least five years of experience leading staff in either a Lead or Manager capacity.
  • Experience with AWS as both a generic compute platform and some of its hosted services, such as Route53, Lamda, RDS, API Gateway, Cognito, EKS, Amazon Elasticsearch; and supporting infrastructure as code tools like Terraform,.
  • Proficient with scripting languages such as Bash and Perl or Python.
  • Proficient with containers, including Docker, docker-compose, kubernetes, helm.
  • Experience with systems automation and configuration management tools.
  • In depth experience with RDBMS systems such as Postgres, MySQL, including replication and high availability.
  • Hands-on experience with CI/CD systems such as Gitlab or Github Actions  
  • Experience with change management policies and procedures.
  • Excellent written and verbal communication skills in English.
  • You are service-oriented and enjoy working with engineers to manage, deploy and tune high volume applications in a cloud environment.
  • You are motivated and keep on top of current technical trends along with deep knowledge of your tools of choice.
  • You are reliable, and we can count on you in times of need.
  • Candidates should be able to demonstrate strong problem-solving methodology and the ability to work with individuals at all levels of the organization as well as external vendors.

Desirables

  • Experience with metrics and monitoring tools such as Prometheus, Grafana, InfluxDB, and Nagios.
  • Experience with administration of or usage of Elasticsearch, Logstash and Kibana.
  • Experience integrating and maintaining SaaS metrics and analytics platforms (such as Newrelic or Datadog).
#J2CloudServices
#LI-Remote
#LI-MW1