Lead Data Engineer

Posted: 05/25/2021
Information Technology Dublin, Ohio


Description

Job Title: Lead Data Engineer

Job Summary

The Lead Data Engineer is responsible for the design and construction of data flows that assemble and refine complex data sets into usable information supporting organization initiatives.  This individual is also responsible for some oversight of people and team activities to ensure the successful delivery of the team’s work streams.

They will work with data architects, developers, analysts, and other stakeholders to design/build pipelines and cloud-based technical processes consistent and in support of the architecture direction to satisfy functional/non-functional requirements.  The scope of their work will support and drive capabilities of Business Intelligence, Operational Reporting, Enterprise Data Warehouse, Data Lake, Big Data, and Enterprise Application Integrations. 

Essential Roles and Responsibilities 

% of Time Spent

Essential Tasks/Duties/Responsibilities

30

Create and maintain pipeline and cloud-based services

30

Work with data and design teams to define solutions and support data requirements

20

Oversight and support of peers and associated activities that enhance the overall productivity and capabilities of the department and team.

20

Project coordination and planning

 Skills and Qualifications

  • BS / Graduate Degree in Computer Science, Engineering, Mathematics, Statistics, or related field
  • 7+ years of experience in similar technical roles (ETL, Application Development, Data Science, Big Data, Reporting)
  • Extensive SQL knowledge and experience working with relational databases
  • Experience building and optimizing data pipelines and data sets
  • Ability to analyze data, find patterns, identify issues, and enhance and improve the integrity and quality of data and associated technical processes
  • Build processes supporting data transformation, data structures, metadata, dependency and workload management
  • Familiarity and experience with cloud technologies and associated purpose. AWS preferred, such S3, EC2, EMR, DynamoDB, Aurora, Athena, Glue, Lambda.
  • Working knowledge of message queuing, stream processing, and scalable data stores.
    • Experience with data warehouse and associated modeling / design (data mart, dimensions, facts)
    • Exposure to big data tools: Hadoop, Spark, Kafka, EMR etc.
    • Exposure and familiarity with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc
  • Experience with Enterprise integration and ETL platforms/IPaaS (SnapLogic, Informatica, SSIS)
  • Experience supporting and working with cross-functional teams in a dynamic environment.
    • Interest in and evidence of increasing leadership on projects and complex deliverables
    • Interest in and evidence of increasing skills in mentorship, management, and leading people