Site Reliability Engineer

DomainHybrid Remote, Timisoara, Romania


Description

Position at Ness Romania SRL

Why Ness

We know that people are our greatest asset. Our staff’s professionalism, innovation, teamwork, and dedication to excellence have helped us become one of the world’s leading technology companies. It is these qualities that are vital to our continued success. As a Ness employee, you will be working on products and platforms for some of the most innovative software companies in the world.

You’ll gain knowledge working alongside other highly skilled professionals that will help accelerate your career progression.

You’ll also benefit from an array of advantages like access to trainings and certifications, bonuses, and aids, socializing activities, and attractive compensation. 

Requirements and responsibilities

The Site Reliability Engineer works as an software developer in reliability for a specific software application or suite of applications and accompanying infrastructure. This includes implementation of new systems as well as providing mid-level and escalation support for other groups and working to resolve production issues in conjunction with development, operational, and architectural resources.

What you’ll do

  • Develop production environment monitoring for availability by taking a holistic view of system health and uptime;
  • Build software and systems to manage platform infrastructure and applications;
  • Improve reliability, quality, and time-to-market of our suite of software solutions;
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve;
  • Provide primary operational support and engineering for multiple large, distributed software applications;
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding;
  • Partner with development teams to improve services through rigorous testing and release procedures;
  • Participate in system design consulting, platform management, and capacity planning;
  • Create sustainable systems and services through automation and uplifts;
  • Balance feature development speed and reliability with well-defined service level objectives;
  • Strategic thinker and capable of learning new technologies quickly. Understand the use to define and refine the monitoring requirements;
  • Ability to work under high pressure and out of the box thinker.

What you’ll bring

  • 2+ years of experience in programming (structured and OO) with one or more high-level languages, such as PowerShell, Shell/Bash, Python/kickstart/puppet, and Docker containers;
  • 1+ experience in Software Development with Java/.NET is preferred;
  • Work hand in hand with R&D, DevOps, and support teams to improve processes and ensure systems are always operational;
  • 1+ years of working with GIT-based source control systems, preferably with GitHub;
  • Good understanding of code promotion techniques, building automation, and branching strategies;
  • 1+ years of working with Amazon AWS services, such as EC2, IAM, Dynamo, RDS, S3, EBS, CloudWatch, Lambda, API Gateway or Azure native services;
  • Experience with load balancers, layer 4/7 load balancing, HAProxy, tcpdump a plus;
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks;
  • English proficiency.

Not checking every single requirement?

If this role sounds good to you, even if you don’t meet every single bullet point in the job description, we encourage you to apply anyway. For most of the candidates that applied, we found a role that was a very good fit with their skills.

Let’s meet and you may just be the right candidate for one of our roles.

At Ness Digital Engineering we are willing to build a work culture that is based on diversification, inclusion, and authenticity.