Observability Engineer, Corporate Infrastructure

Information TechnologyHybrid Remote, San Jose, Costa Rica


Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destination and why we've won so many awards as a best place to work. If you become a Splunker, we want your whole, authentic self, what we call your "million data points". So bring your work experience, problem-solving skills and talent, of course, but also bring your joy, your passion and all the things that make you, you.

Role Summary

We are actively seeking an Observability Engineer with a real passion for automation to help build scalable tools to run our distributed systems. You will be responsible for expanding and supporting the infrastructure platform services we provide to Splunk, as well as engaging with other teams to help improve efficiency and optimize our infrastructure. You're also an individual who’s motivated by technology and enjoys automation and problem-solving. We work hard, we like to challenge the status quo, and we enjoy having fun!

Your Continual Impact

  • Manage and maintain datacenter and cloud Observability infrastructure to ensure SLAs are maintained with application and service owners using IAC tooling such as AWS config, CloudFormation, Ansible, or Terraform.
  • Identify areas for improvement and develop automation to increase system reliability with scripting languages such as Python or Golang.
  • Work closely with partners to craft and develop a variety of features and integrations to increase application and system dependability.
  • Identify and resolve application, system, and integration issues to maintain efficient performance in our cloud based environments.
  • Contribute to defining a full SLA supported matrix to improve MTTD/MTTR for customer services.
  • Join the on-call rotation to respond to high-priority incidents with the goal of minimal downtime (minimal MTTR).
  • Document and maintain up-to-date system procedures, configurations, resolution guides, and standard processes as reference material for the team.

Must-have Qualifications

  • Demonstrated ability in DevOps / SRE focused environments.
  • Foundational understanding of a scripting language (Golang, Python).
  • Validated experience managing AWS or other Public Cloud platforms.
  • Practical knowledge of a number of Operating Systems, including Linux (Ubuntu/RHEL) and Windows.
  • Proficiency using Configuration Management tools like Ansible/Chef/Puppet.
  • Familiarity with using CI/CD pipeline tool experience (e.g. Jenkins, GitLab, GitHub).
  • An understanding of networking concepts and Internet protocols.
  • Ability to communicate technical concepts clearly to customers and upper management.
  • A strong desire to automate and solution issues with code.

Nice-to-have Qualifications

  • Understanding of the pillars of Observability and OpenTelemetry (OTel).
  • Experience with F5 Network Load Balancing.
  • Familiarity with cloud-native serverless architectures and container orchestration.
Splunk is an Equal Opportunity Employer
At Splunk, we believe creating a culture of belonging isn’t just the right thing to do; it’s also the smart thing. We prioritize diversity, equity, inclusion, and belonging to ensure our employees are supported to bring their best, most authentic selves to work where they can thrive. Qualified applicants receive consideration for employment without regard to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition, genetic information, veteran status, or any other consideration made unlawful by federal, state, or local laws. We consider qualified applicants with criminal histories, consistent with legal requirements.



Base Pay Range

Costa Rica

Base Pay: CRC 24,000,000.00 - 33,000,000.00 per year

Splunk provides flexibility and choice in the working arrangement for most roles, including remote and/or in-office roles. We have a market-based pay structure which varies by location. Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location as set out above, as well as the knowledge, skills and experience of the candidate. In addition to base pay, this role is eligible for incentive compensation and may be eligible for equity or long-term cash awards.

Benefits are an important part of Splunk's Total Rewards package. This role is eligible for a comprehensive, competitive benefits package which may include healthcare and retirement plans, paid time off, wellbeing expense reimbursement, and much more! Learn more about our comprehensive benefits and wellbeing offering at https://splunkbenefits.com.

Thank you for your interest in Splunk!