Big Data Engineer

Software EngineeringHybrid Remote, Limassol, Cyprus


Description

Position at Parimatch Tech

Parimatch Tech is an international product company that develops and provides tech and marketing solutions for the Gaming & Entertainment industry. Headquartered in Cyprus and operating globally, the company has R&D centers located in four countries, the largest of which is in Ukraine.

Hi-tech solutions and innovative approaches are what drive Parimatch Tech forward in the gaming market and remain its basis for success and development.

We invite those who fired up to:

  • Working with big enough datasets (200+ TB) which must be updated at least with hourly frequency;
  • DWH components: AWS (S3, Athena, Redshift), GCP(Cloud Storage, BigQuery), PostgreSQL;
  • Data collection: Kafka, Google Analytics, Firebase, Appsflyer, Cloudflare, other 3rd party apps;
  • Data modelling: building centralised data catalog with well validated and documented data marts;
  • Data quality / integrity testing automation;
  • Designing and implementing REST based APIs;
  • Development and support of ETL / ELT processes;
  • Creation and support of project documentation. 

Essential professional experience:

  • 3+ years experience Python / Data Engineer;
  • Hands-on experience with following technologies:
  • Designing, implementing RESTful API’s (Aiohttp, Flask, FastAPI);
  • Relational databases (PostgreSQL, Microsoft SQL Server);
  • Job scheduling, task queues;
  • Cloud providers: AWS (S3, Athena, Redshift), Google Cloud Platform (Cloud Storage, BigQuery), etc.;
  • Linux, Docker;
  • Incorporating and utilising BDD / TDD / Unit testing;
  • Exceptional problem solving, technical and data analysis skills;
  • Extensive knowledge of best practices in software design and design patterns;
  • Strong Computer Science fundamentals;
  • Knowledge in database theory: types, their pros and cons;
  • Knowledge in performance tuning of ETL Jobs, SQL’s, Partitioning, Indexing;
  • Hands-on experience with ETL, Data Warehousing tasks;
  • Solid understanding of git flow best practices.

Desirable skills and personal features:

  • Hands-on experience with following technologies:
  • Kubernetes;
  • Apache Airflow;
  • NoSQL databases (Elasticsearch, Redis, MongoDB);
  • Kafka, Kafka Connect, Kafka Streams;
  • NoSQL databases: MongoDB, ElasticSearch, Redis;
  • IaC: Terraform, Ansible;
  • Data visualisation tools (Tableau, PowerBI, Superset, Grafana, Kibana etc.).