PS RMT (Release Management Team) - Member Technical Staff
Description
Skills:
- Adept on Linux platform.
- Min 3+ years of experience with Docker, Kubernetes, AWS implementation and a working knowledge of core AWS products like VPC,S3, RDS,ELB,IAM,SQS, Lambda, EC2, EKS, etc and expert understanding of best practices.
- Strong Cloud infrastructure as code (IaC) automation experience through Terraform
- Ability to research, architect and drive complex technical solutions, consisting of multiple technologies and cloud services
- Experience with containerization and container orchestration through Docker and Kubernetes.
- Experience in debugging backend (like java) and frontend (like node) applications – Memory utilization , Analysis of thread dumps, heap dumps
- Exposure on Networking, load balancers, Messaging Queue and strong database fundamentals (preferably PostgreSQL and knowledge of Oracle and NoSQL DBs like Mongo DB).
- Experience working in programming languages like python, shell, Perl, java, etc.
- Working knowledge of network protocols, load balancing technologies, transport protocols and Linux/Unix system internals
- Expert in writing detailed solution specifications, diagrams, best practices/standards documentation, operating procedures, test plans/test reports, etc.
- Solid understanding on SDLC and agile methodology – Scrum and Kanban
- Excellent communication, interpersonal skills, and effective team player - must be capable of effectively communicating and engaging with cross functional technical and business teams and varying levels of management
- Ability to multitask, work well under pressure and prioritize work against competing deadlines and changing business priorities
- Experience working with a globally distributed workforce
- Experience in handling issues across the entire stack - hardware, software, application and network.
- Deep knowledge and experience in Incident and problem management.
- Knowledge on JBoss, Tomcat, Spring boot, etc
- Experience in monitoring tools like Splunk, New Relic, Sensu, Foglight, etc.
- Knowledge of Incident, problem, Change management
- Knowledge in APM tools like New Relic
- Knowledge on IBM MQ/Kafka/Redis/Elastic Search.
- Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
- Take up current monitoring two notches higher and ensure operations team to be able to detect all critical issues before customer.
- Have a systematic problem-solving approach, coupled with strong communication and analytical skills and a sense of ownership, initiative, grit, and drive.
- Design patterns (microservices/aws architecture patterns/enterprise application design patterns)
- Exposure to CI/CD, GitLab, JIRA, Service Now
- Exposure to UI technologies
- Good to have certifications like ITIL, AWS