Infrastructure Operation Engineer (Devops) [ 4 - 7 years ]
Hungry, Humble, Honest, with Heart.
The Opportunity
We are seeking a proactive and detail-oriented Infrastructure Operations Engineer to join our team. In this role, you will be responsible for the day-to-day reliability, maintenance, and enhancement of our build and delivery infrastructure. You will play a vital part in ensuring our CI/CD pipelines are fast, stable, and scalable, directly impacting the velocity of our software development teams.
The ideal candidate has a solid foundation in DevOps practices and a "reliability-first" mindset.
- Ownership: Take real ownership of the tools that power our engineering engine.
- Skill Development: Gain exposure to enterprise-scale distributed systems and modern cloud-native technologies.
- Impact: Work in a role where your automation directly saves hundreds of hours for your fellow engineers.
- Culture: A supportive environment that values continuous learning and operational excellence.
About the Team
At Nutanix, the Developer Productivity team is a global team across multiple countries and supports Developer Productivity within the Engineering Organization. Our team culture thrives on determination and mutual respect, fostering an environment where innovative ideas are welcomed and collaboration is key. Each member is dedicated to not just their own growth, but also to empowering one another, creating a cohesive team atmosphere that drives our success.
You will report to the Senior Network Operations Manager, who is committed to supporting your professional development and ensuring that you have the resources needed to excel in your role.
Your Role
- System Maintenance: Maintain and troubleshoot CI/CD infrastructure, including GitHub, Jenkins, Artifactory, and Gerrit.
- Pipeline Optimization: Identify bottlenecks in build and deployment workflows and implement technical solutions to improve performance.
- Automation: Develop and maintain scripts and tools to automate repetitive operational tasks and improve system resilience.
- Monitoring & Alerting: Configure and manage observability tools to ensure the health of the infrastructure, maintaining a target of 99.9%+ availability.
- Incident Response: Participate in on-call rotations, responding to system outages, performing root cause analysis (RCA), and implementing permanent fixes.
- Collaboration: Partner with development teams to provide "Developer Enablement," helping them troubleshoot build failures and platform-related issues.
- Documentation: Create and maintain clear technical documentation for infrastructure configurations, runbooks, and operational processes.
What You Will Bring
- 4–5 years of professional experience in Infrastructure Operations, DevOps, or Site Reliability Engineering (SRE).
- CI/CD Expertise: Hands-on experience configuring and managing enterprise-grade build tools and repository services.
- Scripting Skills: Proficiency in Python, Bash, or similar languages for automation and system integration.
- Infrastructure as Code (IaC): Working knowledge of Terraform, Ansible, or similar configuration management tools.
- Troubleshooting: Strong ability to diagnose issues across the stack (network, OS, application, and cloud layers).
- Availability: Willingness to participate in an on-call rotation and provide after-hours support for critical incidents.
Technical Preferences (A Plus)
- Cloud Platforms: Experience operating within AWS, Azure, or GCP environments.
- Virtualization: Familiarity with Nutanix or VMware environments.
- Containers: Basic experience with Docker and Kubernetes orchestration.
- Observability: Experience with tools like Prometheus, Grafana, ELK, or Splunk.
- Security: Understanding of basic security protocols (SSL/TLS, SSH, IAM roles).
Professional & Soft Skills
- Self-Sufficiency: Ability to take a high-level task and drive it to completion with minimal guidance.
- Communication: Ability to clearly explain technical issues to both technical and non-technical stakeholders.
- Attention to Detail: A disciplined approach to testing changes before they hit production environments.
- Team Player: Collaborative attitude with a desire to learn from senior engineers and help mentor junior staff.
Work Arrangement
Hybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. In locations where our workplace policy applies (i.e. San Jose, Durham, Mexico City, Bangalore, Pune, Hoofddorp, Belgrade, Barcelona, Singapore, Sydney and Tokyo), employees are expected to work onsite a minimum of 3 days per week to foster collaboration, team alignment, and access to in-office resources. Workplace type may vary based on location and team requirements. Please speak with your recruiter for details. Additional team-specific guidance and norms will be provided by your manager.
--
Nutanix is an equal opportunity employer.
Nutanix is an Equal Employment Opportunity and (in the U.S.) an Affirmative Action employer. Qualified applicants are considered for employment opportunities without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, protected veteran status, disability status or any other category protected by applicable law. We hire and promote individuals solely on the basis of qualifications for the job to be filled. We strive to foster an inclusive working environment that enables all our Nutants to be themselves and to do great work in a safe and welcoming environment, free of unlawful discrimination, intimidation or harassment. As part of this commitment, we will ensure that persons with disabilities are provided reasonable accommodations. If you need a reasonable accommodation, please let us know by contacting [email protected].