Tech Lead
Description
We are looking for a Google Cloud Lead Data Engineer, They will be responsible for building a solid Data Integration pipelines by integration various source data which will enable data delivery in near real-time using next-gen technologies.
Title: Google Cloud Lead Data Engineer
Roles and Responsibilities:
Design, develop, and maintain scalable data pipelines on Google Cloud Platform (GCP).
Implement data processing solutions using GCP services such as BigQuery, Python, Dataproc, Dataflow, Pub/Sub, and Cloud Storage, Cloud Function and other GCP services.
Optimize data processing and storage for performance, cost, and scalability in GCP BQ.
Ensure data quality and integrity by implementing best practices for data governance and monitoring.
Develop and maintain documentation for data pipelines, and ETL Pipeline processes.
Stay up-to-date with the latest advancements in data engineering and GCP technologies.
Proven experience as a Data Engineer with a focus on Google Cloud Platform (GCP).
Strong programming skills in Pyspark and Python and additional similar languages.
Experience with SQL and relational databases.
Familiarity with data modeling, ETL processes, and data warehousing concepts.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.
Preferred Skills :
BigQuery, Python, Dataproc, Dataflow, Pub/Sub, Cloud composer/Airflow, Github
Google Cloud Data Engineer certification is a good to have.
A solid grip over programming languages, like Python, Scala, C++, etc.
In-depth knowledge about SQL databases and ability to execute queries quickly.
Knowledge of data warehousing and data modeling.
Fundamentals of Google Kubernetes engine to design and develop Microservices for real-time data feeds.
Conducting end-to-end analyses, including data collection, processing, and analysis.