Manager, Site Reliability

Engineering Ahmedabad, Gujarat


Description

Key Responsibilities:
Act as primary point of contact on all cloud infrastructure projects
Responsible for providing technical support and monitoring of day-to-day operations of our
production environments
Monitoring infrastructure using a variety of tools and react to resolve any alerts which may arise
and ensure system uptime meeting service level agreements
Escalating any incidents that cannot be resolved within specified time frames to relevant team
members and work with them until the incident is resolved
Document and record steps towards resolution of an incident and add this to the knowledge
database for future referral
Planning and coordinating updates, installations, and deployments
Identify ways to improve existing systems, determine cost benefits, and recommend solutions
Be the driving force behind our automation and observability initiatives
Support a variety of Litera customers who have various technical environments, SLA's and
technical needs
Build strong and effective working relationships with Engineering and Management
organizations
Operate in a 24 X 7 Network Operations Center; this includes shift work and weekends


Qualifications:
• 8+ years of experience of hands-on experience of managing SREs
• 5+ years of experience deploying and supporting SaaS and Web applications
• Experience with CI/CD tools such as Bamboo, Jenkins, Docker, Maven
• Experience with scripting using PowerShell and/or python
• Experience managing Linux (Ubuntu) and Windows systems
• Experience managing and supporting cloud platforms (i.e. Azure, AWS)
• Good understanding of standard networking protocols and components such as HTTP, DNS,
TCP/IP, VPN, Networking and load balancing
• Knowledge of monitoring and logging software (i.e. ELK, Prometheus, Grafana, AppDynamics,
NewRelic)
• Familiarity with configuration-as-code tools such as Ansible and Terraform
• Previous NOC experience
• Strong communication and problem-solving skills with the ability to communicate clearly and
calmly with customers and technical personnel in high-stress situations
Wh