Senior Data Engineer

Job ID 2023-5869

Technology Navi Mumbai, Maharashtra


Description

Position at WebMD

About the Company:
Headquartered in El Segundo, Calif., Internet Brands® is a fully integrated online media and software services organization focused on four high-value vertical categories: Health, Automotive, Legal, and Home/Travel. The company's award-winning consumer websites lead their categories and serve more than 250 million monthly visitors, while a full range of web presence offerings has established deep, long-term relationships with SMB and enterprise clients. Internet Brands' powerful, proprietary operating platform provides the flexibility and scalability to fuel the company's continued growth. Internet Brands is a portfolio company of KKR and Temasek.
WebMD Health Corp., an Internet Brands Company, is the leading provider of health information services, serving patients, physicians, health care professionals, employers, and health plans through our public and private online portals, mobile platforms, and health-focused publications. The WebMD Health Network includes WebMD Health, Medscape, Jobson Healthcare Information, prIME Oncology, MediQuality, Frontline, QxMD, Vitals Consumer Services, MedicineNet, eMedicineHealth, RxList, OnHealth, Medscape Education, and other owned WebMD sites. WebMD®, Medscape®, CME Circle®, Medpulse®, eMedicine®, MedicineNet®, theheart.org®, and RxList® are among the trademarks of WebMD Health Corp. or its subsidiaries.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

For company details visit our website: www.webmd.com / www.internetbrands.com

Education: B.E. Computer Science/IT degree (or any other engineering discipline)
Experience: 4+ years
Work timings: 2 PM to 11 PM IST

Position Requirements:

  • 4+ years of experience with RDBMS databases such as Oracle, MSSQL or PostgreSQL
  • 2+ years of experience with Pentaho Data Integration or any ETL tools such as Talend, Informatica, DataStage or HOP.
  • Working knowledge of orchestration tools such Oozie and Airflow
  • Experience working in both OLAP and OLTP environments
  • Experience working on-prem, not just cloud environments
  • Experience working with teams outside of IT (i.e. Application Developers, Business Intelligence, Finance, Marketing, Sales)
  • Experience managing or developing in the Hadoop ecosystem is preferred
  • Programming background with either Python, Scala, Java or C/C++ is a plus
  • Experience with Spark. PySpark, SparkSQL, Spark Streaming, etc
  • Strong in any of the Linux distributions, RHEL, CentOS or Fedora
  • Experience using reporting and Data Visualization platforms (Tableau, Pentaho BI) is good to have
  • Web analytics or Business Intelligence a plus
  • Understanding of Ad stack and data (Ad Servers, DSM, Programmatic, DMP, etc)



Role & Responsibilities:

  • Work within our on-prem Hadoop ecosystem to develop and maintain ETL jobs
  • Design and develop data projects against RDBMS such as PostgreSQL 
  • Implement ETL/ELT processes using various tools (Pentaho) or programming languages (Java, Python) at our disposal 
  • Analyze business requirements, design and implement required data models
  • Lead data architecture and engineering decision making/planning.
  • Translate complex technical subjects into terms that can be understood by both technical and non-technical audiences.