Cloud Operations Incident Commander

Engineering Krakow, Poland


Want to work in a dynamic environment working with the latest cloud technologies? Want to learn tools like Splunk from the inside and grow your career in exciting ways? Splunk Cloud is looking for self-starting engineers to be a part of the Cloud Network Operations Center (CNOC). The responsibility of the Splunk CNOC to supervise and resolve issues that affect the availability and performance of Splunk for our cloud customers. As the authority on our customer’s experience, the CNOC is the frontline of defense in making sure each of our customers has an outstanding experience.

We’re looking for an experienced Incident Manager to join our team in supporting and supervising our ever-expanding Cloud platform.

Responsibilities:

  • Provide 24/7 support & incident management.
  • Use the Splunk Incident Management System (SIMS) to restore normal service operations as quickly as possible to minimize the impact to business operations.
  • Assemble the response team which includes the incident owner, problem owner and other professionals in the specified area of expertise.
  • Establish accurate expectations from response procedures to ensure customer satisfaction throughout the process.
  • Supervise and manage incidents fully to ensure accurate information is captured.
  • Assemble and lead conference calls for diagnosis and remediation of customer impacting outages.
  • Provide Incident commander responsibilities, contribute to post incident review, and follow through with action plans
  • Develop positive, strong, and collaborative relationships with multiple cross-functional partners across Splunk to improve the Team's efficiency and ability to deliver on sophisticated tasks that have broad impact
  • Make suggestions for process improvements and improved operational efficiencies.

Qualifications:

  • You have 3+ years in Major incident response and management experience.
  • You are able to think out of the box and work on multiple tasks simultaneously while dynamically prioritizing based on changing conditions.
  • You enjoy problem solving and analyzing global scale distributed systems.
  • You have outstanding interpersonal and communication skills.
  • Remain calm and collected in stressful situations, such as a major service outage
  • You have excellent problem-solving skills with a strong attention to detail.
  • You are willing to work weekends, overnight shifts and holidays with flexibility to work additional shifts on short notice.

Thank you for your interest in Splunk!