Site Reliability Engineer [Poland]

Operations - Poland Poznań, Poland or Remote, Poland


Description

Egnyte is a product-focused company. We build and scale our flagship product: a secure content platform called Egnyte used by companies like Red Bull, IKEA, and Yamaha. It’s a large-scale system with 16,000+ customers. Our customers can access and manage their data through different devices and interfaces like mobile, desktop apps, or WebUI.

The opportunity:
As an SRE you will be ensuring reliability for a large-scale environment. Our engineers are part of the whole process: from design through coding and testing to the deployment and back again for further iterations. You will touch every level of the infrastructure depending on the day and the project you are working on. This role requires you to take on complex problems and execute end-to-end solutions. 

Your day-to-day at Egnyte:

  • Drive focused initiatives that improve operational efficiencies, reliability, and scalability of the platform and its applications
  • Participate in big projects like migrating solutions to Kubernetes, from monolith to microservices
  • Proactively propose and implement automation and observability solutions focusing on improving our core business
  • Address performance challenges, optimize and fine-tune production environments
  • Maintain and monitor our environments - you can expect different shifts but also elastic working hours to work on projects
  • Implement best SRE practices in making and documenting improvements to the infrastructure

About you:

  • 2+ years of experience in an SRE/SysAdmin/DevOps/NOC, software development, or equivalent role
  • Coding skills in Python or Golang
  • Good understanding of the Linux Operating System on the administration level
  • Experience with public cloud services (GCP/AWS/Azure)
  • Knowledge of metric-based monitoring solutions
  • Experience handling large numbers of diverse systems with configuration management systems like Puppet, Ansible, Terraform
  • Practical knowledge of CI/CD solutions
  • Troubleshooting skills to hunt down the root causes of issues and persistence in preventing them from happening again
  • Incident management skills - must be able to own, cooperate  and resolve large scale incidents under time pressure
  • Good English skills to effectively communicate about technical matters

Bonus points:

  • Practical knowledge of container orchestration (Kubernetes, Docker)
  • Experience with Linux HA solutions such as HAProxy
  • Experience with message brokers (RabbitMQ, Kafka or others) and databases (MySQL or others)
  • Operational knowledge of the ELK stack

What we can offer you:

  • Attractive salary based on skills and experience
  • Stock options
  • Your own Egnyte account with lifetime access to 1 TB of cloud storage
  • 4000 PLN gross conference budget per person and additional 4 training days off each year
  • MyBenefit: you can choose a MultiSport card or gift cards every month
  • Private medical health care
  • In-house English classes