Site Reliability Engineering Group Leader
Description
Summary
Data has never been more valuable and vulnerable. As cybercriminals become more sophisticated and regulations more strict, organizations struggle to answer one key question: “Is my data safe?
At Varonis, we see the world of cybersecurity differently. Instead of chasing threats, we believe the most practical approach is protecting data from the inside out. We’ve built the industry’s first fully autonomous Data Security Platform to help our customers dramatically reduce risk with minimal human effort.
At Varonis, we move fast. We’re an ultra-collaborative company with brilliant people who care deeply about the details. Together, we’re solving interesting and complex puzzles to keep the world’s data safe.
We work in a flexible, hybrid model, so you can choose the home-office balance that works best for you.
Responsibilities:
- Managing global SRE teams with Production service reliability and increase productivity and efficiency.
- Monitor, manage and operate our cloud services including incident management.
- Scale our service with required monitoring and alerting capabilities.
- Develop tools and automations based on C# .Net and Python to support our operation and growth.
- Work closely with R&D to make sure new features are reliable, easily deployable, and support the requirements of the service in terms of scale and security.
- Establish a regular operational feedback cycle into our engineering teams.
- Manage the Service Operations team to operate with a culture of business and customer-centricity by maintaining Varonis SLA for each service, including incident response, problem management, and service upgrades.
- Develop and drive, as the primary owner, the communication strategy for internal and external stakeholders (including customers) to convey service health, tracking against SLAs, current and historical incidents, upcoming events, or upgrades.
- Ensure all technical procedures are documented, reviewed, and updated and actively contribute to the maintenance of operational standards & policies.
- Collaborate with the Varonis Support team to understand and improve user experience, performance, incident response, and the serviceability of our offerings.
- Collaborate with the internal R&D team to automate infrastructure services and system administration tasks wherever possible and implement a monitoring strategy to provide rapid feedback and diagnostics in the event of a service disruption.
- Create relationships with other departments, including Marketing, Product Management, Engineering, and Customer Success, to make sure we provide services with high availability and superior performance for all our customers.
Requirements:
- At least a bachelor's degree (computer science or related fields) or equivalent experience in building scalable solutions to improve high-availability Production service reliability and/or increase productivity and efficiency.
- At least 5 years of experience in managing multiple operation teams globally.
- At least 2 years’ experience in developing C# or Python or Java applications.
- In-depth understanding of the entire web development process (design, development, and deployment)
- Strong organizational and analytical skills.
- Substantial experience in operating a high-availability cloud infrastructure.
- Quick technology adaptation
- Good interpersonal skills
- Experience with Microsoft Azure or other cloud platforms (GCP, AWS)
We invite you to check out our Instagram Page to gain further insight into the Varonis culture!
@VaronisLife
Varonis is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, religion, sex, national origin, disability, veteran status, and other legally protected characteristics.
#LI-Hybrid