Site Reliability Engineer

Other Bangalore, India


Description

Key responsibilities:

  • Continuous improvement of system and application monitoring and automation
  • Monitoring of infrastructure, systems and application availability, performance and capacity
  • Leads engagement with software developers and infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems
  • Identify and automate manual workarounds and process improvements
  • Monitor the availability, latency, scalability and efficiency of all services
  • Perform periodic on-call duty as part of the SRE team
  • Experience managing and troubleshooting large AWS infrastructures
  • Familiarity with Docker, Ansible, and CloudFormation
  • Background in system administration scripting (shell, bash, python, etc.)
  • Experience with Linux operating systems and system administration a plus
  • Basic network debugging skills
  • Knowledge of Amazon S3, EC2, RDS, EFS, ELB, Route 53 a plus

 

Required qualifications:

  • BE/MCA in Computer Science or Engineering
  • Experience in Linux and Unix-Like operating systems
  • Must be self-directed, flexible, and be able to prioritize and handle multiple projects simultaneously
  • Outstanding problem solving, troubleshooting and decision making skills required