Lead ,Site Reliability Engineer

Computers/Software Full-Time Bangalore, India ReqID:3888


Team                         : Site Reliability Engineering

Position                          : Lead ,Site Reliability Engineer

Reports to                      : Senior Manager

Experience                : 8 to 12 years



Roles and Responsibilities:

  • Works on incidents, Analyzes application issues/logs , does impact analysis and post-incident reviews,
  • Participates and provides feedback during design discussions
  • Handles/manages projects
  • Automates repeated jobs , develops tools to improve efficiency , enhances the knowledge base
  • Ready to work in night and weekend shifts

Preferred Skills:

  • Strong experience in Unix/Linux based Operating systems
  • Experience in debugging java based applications (Both API and UI) – Memory utilization , Analysis of thread dumps, heap dumps
  • Understanding of oracle/postgreSQL database, Analyzing issues due to db load/failures , AWR report analysis , understanding of concepts like blocking locks , query execution plans
  • Troubleshooting skills in AWS – S3, SNS, lambda , EC2,VPC etc
  • Good troubleshooting skills and knowledge of Kubernetes and Docker based systems
  • Shell Scripting , Perl/Python scripting/Java
  • Knowledge about different networking concepts and components, knowledge on “how Load balancing works” is a must
  • Knowledge on JBoss , Tomcat , microservices etc
  • Good knowledge in using and writing queries in splunk
  • Knowledge of Incident, problem, Change management
  • Knowledge in APM tools(New Relic) , monitoring tools
  • Knowledge on  IBM MQ/Kafka/Redis/Elastic Search


Good to have:

  • Exposure to CI/CD
  • Exposure to UI technologies
  • Excellent Analytical and problem solving skills
  • ITIL certification
  • Worked in a dynamic environment and ability to adapt quickly to changes