Cloud Data Lake Architect

Posted: 03/09/2020
Business Intelligence Dublin, Ohio


Job Title:               Cloud Data Lake Architect

Department:        IT            

Job Summary

The Cloud Data Lake Architect supports business process through technology solutions. This role is responsible for working in partnership with other Architects, developers and business partners in designing and building a secure and reliable data lake in cloud that meets the growing needs of the organization

Essential Roles and Responsibilities 

% of Time Spent

Essential Tasks/Duties/Responsibilities


Support Enterprise Data Analytics function by working closely with business partners and other members of IT in building in designing and building an effective data lake in cloud

Create governance rules for the data lake for easier and efficient consumption

Design and Implement security of the data in the data lake

Design and Deploy EMR clusters in AWS for large scale processing of varied data sets

Monitor data lake performance and find ways to continuously improve query performance

Monitor data lake costs /storage and build rules around data lifecycle management

Enable master data across different domains through innovative ways to provide the single version of truth to downstream applications

Collaborate with stakeholders, create architectural artifacts necessary for implementation and   walk through solution design with other technical teams to ensure alignment


Enable innovation through technology, conduct POCs, make fact based decisions to modernize technology stack


Work with IT and business partners to train and share knowledge on data lake, master data and data governance and enable adoption of data lake/data warehouse across the organization

Skills and Qualifications

  • 12+ Years of experience in Data Modeling, Data Warehousing, Dimensional Modeling, Data Modeling for Big data and Master data Management
  • 7+ Years hands-on experience with Enterprise and Reporting modeling including Logical/ Canonical Modeling, Dimensional Modeling
  • 3+ Years of Hands-on Experience with Cloud based Storage, Data warehouses and Big data Frameworks is a Must (AWS S3, RedShift, Dynamo DB, Athena, Elastic Search and other cloud native databases and big data processing frameworks using EMR, Spark, Python, Scala)
  • 3+ Years of experience designing Data Lake to accommodate Structured and Unstructured Data for Batch and Real-Time Ingestion
  • 1+ Years of Experience using AI/ML tools to architect solutions to translate Voice to Text in Real-Time
  • AWS Solutions Architect and/or Big Data Certification highly preferred
  • Strong Communication, Analytical and Presentation skills