Lead Site Reliability Engineer (Sitecore XMC)

Engineering & Technology Athens, Greece


Description

About Us:
Sitecore’s mission is to provide cutting-edge DXP solutions that enable the world’s greatest brands to craft truly unforgettable experiences for their customers. A highly decorated industry leader, Sitecore brings content, commerce, and data into one connected platform that delivers millions of digital experiences every day. Thousands of blue-chip companies, including American Express, Porsche, Starbucks, L’Oréal, and Volvo Cars, rely on Sitecore to provide more engaging, personalized customer experiences. Under the leadership of our new product-centric CEO, we are poised to continue to push the boundaries of marketing technology and shape the future of customer engagement. Learn more at Sitecore.com.   
 
Sitecore XMC team:
Sitecore XMC SRE team is committed to delivering a reliable, observable, and supportable Sitecore XMC product for our customers. We collaborate closely with engineering, operations, and support teams in order to research and solve complex problems with the product.
 
About the Role:
This role is part of a team that ensures a seamless and robust experience for our customers. As a Lead SRE, you’ll be at the forefront of this exciting journey: you’ll be the architect of resilience, the troubleshooter extraordinaire, and the automation wizard - all rolled into one. 
 
What You’ll Do:
  • Lead the design and the implementation of reliability projects to develop and maintain scalable and reliable software systems
  • Develop technical design documents for systems and processes
  • Define monitoring concepts and dashboards that will appropriately measure system SLAs/SLOs/ SLIs, performance and detect issues and will drive their implementation - in collaboration with the engineering teams
  • Drive the automation efforts to resolve major toils for engineering teams
  • Drive initiatives to reduce complexity on the architecture of the systems
  • Collaborate and coach other SREs in the team to share knowledge, best practices, and expertise 
What You Need to Succeed:
  • Deep understanding of operating systems, networking, and cloud infrastructure
  • Extensive experience in automating routine tasks, using tools like Ansible, Python, Bash, or Ruby
  • Deep knowledge on cloud platforms (e.g., AWS, GCP, Azure).
  • Large experience to work with containerization technologies (e.g., Kubernetes, Docker)
  • Experience in a leading role, inspiring others, and thrive in chaos
  • Tactics to introduce best practices and simplification in all aspects of your work (systems, day to day work, processes)
Additional Skills That Could Set You Apart:
  • Pro-active personality that will help other team members and other teams to understand issues
  • Mentorship and Leadership experience with junior SREs and engineers, sharing knowledge and best practices
  • Leadership and coordination of teams during complex incidents or large-scale projects
How we hire:
Sitecore is proud to be an equal-opportunity workplace. We are committed to equal employment opportunity without unlawful regard to race, color, ancestry, religion, gender, national origin, sexual orientation, age, citizenship, marital status, disability, veteran status or any other local legally protected characteristic

Share this job