Sr. Data Engineer, Life Sciences

Technology & Analytics New York, New York

Position at Medidata Solutions

Medidata: Conquering Diseases Together


Medidata is leading the digital transformation of life sciences with the world's most-used platform for clinical development, commercial and real-world data. Powered by artificial intelligence and delivered by #1 ranked industry experts, the Intelligent Platform for Life Sciences helps pharmaceutical, biotech, medical device companies and academic researchers accelerate value, minimize risk and optimize outcomes. Medidata serves more than 1,000 customers and partners worldwide and empowers more than 100,000 certified users every day to create hope for millions of patients. Discover the future of life sciences:

We know that diverse teams win and are fully committed to selecting leaders and employees that represent the markets in which we operate. We are still led by our Co-founders, Tarek Sherif and Glen de Vries, and have global operations in US, Europe and Asia with over 2000 employees.


Medidata’s Data and Analytics Group is seeking an engineer that is passionate about working with data.  We’re looking for someone who is excited by data and the value that can be found in it, and who has the skills and drive to deliver that value from it. You will be a key contributor in growing and expanding our data platform by making our data assets usable and accessible to platform consumers.

We work with a variety of cutting edge as well as industry standard techniques and tools. We are looking for candidates that can both support existing technologies and apply new techniques to solve the next generation of data issues that we will encounter as we extend and expand of our data platform.


In this role, you will be responsible for data acquisition from source systems, transformation, standardization, and delivery to enterprise repositories and systems. You will work with product to understand acquisition, transformation and delivery needs, will lead efforts to understand and design for production data workloads and shape, and will work with our architecture team to deliver solutions that are aligned with enterprise architecture plans.

You will deliver solutions that work within a DevOps delivery pipeline using infrastructure designs that are scalable, resilient and highly performant. You will develop mapping and testing artifacts that enable data movement solutions to be verified as meeting functional, non-functional and business requirements. Your mission will be to deliver efficient and error free data movement systems and processes.


Required Skills:

  • You hold at least a bachelor’s degree in Computer Science or a related discipline
  • You have at least 3 years’ experience working with large and complex data sets
  • You are a proficient Java developer familiar with frameworks such as Maven, Gradle or Ant
  • You are a proficient SQL developer
  • You have experience working with AWS tools
  • You have experience writing ETL/ELT code
  • You have experience with data profiling tools and concepts

Nice to Have:

  • Prior experience working with electronic medical/health or clinical trial data
  • Agile experience
  • Experience working with Message Bus technologies
  • Any combination of these AWS Technologies:
    • EC2 w/ EBS
    • Step Functions
    • Aurora
    • Redshift
    • Kinesis
    • EMR
    • ECS/ECR
  • Prior experience working with ETL systems (e.g. Pentaho, Talend, Informatica, etc.)
  • Oracle PL/SQL experience

Medidata is making a real difference in the lives of patients everywhere by accelerating critical drug and medical device development, enabling life-saving drugs and medical devices to get to market faster. Our products sit at the convergence of the Technology and Life Sciences industries, one of most exciting areas for global innovation. Nine of the top 10 best-selling drugs in 2017 were developed on the Medidata platform.

Medidata’s solutions have powered over 14,000 clinical trials giving us the largest collection of clinical trial data in the world. With this asset, we pioneer innovative, advanced applications and intelligent data analytics, bringing an unmatched level of quality and efficiency to clinical trials enabling treatments to reach waiting patients sooner.

Medidata Solutions, Inc. is an Equal Opportunity Employer. Medidata Solutions provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, national origin, age, disability, or status as a veteran. Medidata Solutions complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities.