Data Engineer, Enterprise Data, Analytics and Innovation, Digital Engagement

Digital Engagement United States


Description

Position at Vaniam Group

 

Data Engineer, Enterprise Data, Analytics and Innovation, Digital Innovation

 

What You'll Do

Are you passionate about building robust data infrastructure and enabling innovation through engineering excellence? As our Data Engineer, your goal is to own and evolve the foundation of our data infrastructure. You will be central in ensuring data reliability, scalability, and accessibility across our lakehouse and transactional systems. This role is ideal for someone who thrives at the intersection of engineering and innovation, ensuring our data platforms are robust today while enabling the products of tomorrow.

 

A Day in the Life

Lakehouse and Pipelines

  • Design, build, and operate reliable ETL and ELT pipelines in Python and SQL
  • Manage ingestion into Bronze, standardization and quality in Silver, and curated serving in Gold layers of our Medallion architecture
  • Maintain ingestion from transactional MySQL systems into Vaniam Core to keep production data flows seamless
  • Implement observability, data quality checks, and lineage tracking to ensure trust in all downstream datasets

Data Modeling and Governance

  • Develop schemas, tables, and views optimized for analytics, APIs, and product use cases
  • Apply and enforce best practices for security, privacy, compliance, and access control, ensuring data integrity across sensitive healthcare domains
  • Maintain clear and consistent documentation for datasets, pipelines, and operating procedures

Integration of New Data Sources

  • Lead the integration of third-party datasets, client-provided sources, and new product-generated data into Vaniam Core
  • Partner with product and innovation teams to build repeatable processes for onboarding new data streams
  • Ensure harmonization, normalization, and governance across varied data types (scientific, engagement, operational)

Analytics and Predictive Tools

  • Collaborate with the innovation team to prototype and productionize analytics, predictive features, and decision-support tools
  • Support dashboards, APIs, and services that activate insights for internal stakeholders and clients
  • Work closely with Data Science and AI colleagues to ensure engineered pipelines meet modeling and deployment requirements

Reliability and Optimization

  • Monitor job execution, storage, and cluster performance, ensuring cost efficiency and uptime
  • Troubleshoot and resolve data issues, proactively addressing bottlenecks
  • Conduct code reviews, enforce standards, and contribute to CI/CD practices for data pipelines

 

What You Must Have

Education and Experience

  • 5+ years of professional experience in data engineering, ETL, or related roles
  • Strong proficiency in Python and SQL for data engineering
  • Hands-on experience building and maintaining pipelines in a lakehouse or modern data platform
  • Practical understanding of Medallion architectures and layered data design

 

Skills and Competencies

  • Familiarity with modern data stack tools, including:
    • Spark or PySpark
    • Workflow orchestration (Airflow, dbt, or similar)
    • Testing and observability frameworks
    • Containers (Docker) and Git-based version control
  • Excellent communication skills, problem-solving mindset, and a collaborative approach

 

What You Might Have, but Isn't Required

  • Experience with Databricks and the Microsoft Azure ecosystem
  • Expertise with Delta Lake formats, metadata management, and data catalogs
  • Familiarity with healthcare, scientific, or engagement data domains
  • Experience exposing analytics through APIs or lightweight microservices

 

The Team You'll Work Closest With

You will collaborate closely with the innovation team to prototype and productionize analytics solutions. Your main contacts will be Data Science and AI colleagues, product and innovation leaders, and internal stakeholders who rely on data-driven insights. You will work remotely with flexibility, growth opportunities, and the ability to influence how data shapes the future of medical communications, helping to turn raw data into client-ready insights that enable measurable healthcare impact.

Top of Form

 

Why You’ll Love Us:   
  • 100% remote environment with opportunities for local meet-ups   
  • Positive, diverse, and supportive culture   
  • Passionate about serving clients focused on Cancer and Blood diseases
  • Investment in you with opportunities for professional growth and personal development through Vaniam Group University
  • Health benefits – medical, dental, vision
  • Generous parental leave benefit 
  • Focused on your financial future with a 401(k) Plan and company match   
  • Work-Life Balance and Flexibility   
  • Flexible Time Off policy for rest and relaxation   
  • Volunteer Time Off for community involvement   
  • Emphasis on Personal Wellness 
  • Virtual workout classes 
  • Discounts on tickets, events, hotels, child care, groceries, etc.   
  • Employee Assistance Programs
 
Salary offers are based upon several factors including experience, education, skills, training, demonstrated qualifications, location, and organizational need. The range for this role is $110,000 - $125,000. Salary is one component of the total earnings and rewards package offered.    
      
About Us:   
Vaniam Group is a people-first, purpose-driven, independent network of healthcare and scientific communications agencies committed to helping biopharmaceutical companies realize the full potential of their compounds in the oncology and hematology marketplace. Founded in 2007 as a virtual-by-design organization, Vaniam Group harnesses the talents and expertise of team members around the world. For more information, visit www.VaniamGroup.com.   
 
Applicants have rights under Federal Employment Laws to the following resources: 
Bottom of Form