Senior Data Scientist II

Acorn AI New York, New York Boston, Massachusetts

Position at Medidata Solutions

Medidata: Conquering Diseases Together

Medidata is leading the digital transformation of life sciences, creating hope for millions of patients. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 1,400 customers and partners access the world's most-used platform for clinical development, commercial, and real-world data. Medidata, a Dassault Systèmes company, is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at

Acorn AI is one of the largest AI companies exclusively dedicated to life sciences. It’s built on Medidata’s platform that includes the industry’s largest structured, standardized and growing clinical trial data repository consisting of 20,000+ trials and 6M patients. Our team is composed of over 40 PhD/Masters statisticians, data scientists, analytical product leads, former FDA biostatisticians and computational genomicists.

Your Mission:

Acorn AI is looking for data scientists who will help us tackle some of the most complex questions facing the industry today using our proprietary Acorn platform and advanced analytics. In this role, you will research and develop statistical models. At Acorn AI, we never work alone. This role will partner heavily with all of the key stakeholder functions including Product, Data Science, Engineering, Partnerships and Biostatistics. Successful candidates will be skilled in analytical/quantitative thinking, structured communication and excited about building the next horizon of Medidata’s journey of powering smarter treatments and healthier people. 

  • Lead analytical projects and apply strategic thinking to solve some of the most complex problems in healthcare, translating complex data into meaningful insights

  • Design, develop and validate statistical and machine learning models for novel medical applications. Areas of team focus include design, optimization and meta-analysis of clinical trials 

  • Define successful metrics and demonstrate impact and value to both technical and non-technical audiences

  • Provide support functions around model-building, including data cleaning, code review, & reporting

  • Productionalize developed methods and code for integration with existing/new products

  • Work directly with our team comprised of the brightest minds in technology, research and mathematics as well as senior interfaces from leading life sciences companies across the globe

Your Competencies:

  • Fluency in programming languages (Python, R, SQL) that allow you to be self-sufficient in analyzing data

  • Proficiency with machine learning techniques (e.g., classification, regression, feature selection, etc.) and ability to independently execute analytical experiments, predictive model builds and to support deployments

  • Demonstrated ability to define the problem and solution in an ambiguous setting, and ability to identify and integrate new datasets to solve complex and unique issues 

  • Demonstrated ability to exercise independent judgment in methods, techniques and evaluation criteria for obtaining results

  • Demonstrated ability to collaborate with all levels of data science, technology personnel and senior leadership

  • Excellent presentation and communication skills to all levels of technical & non-technical audiences

  • Entrepreneurial spirit and commitment to creating rigorous, high-quality insights from data at scale 

Your Education & Experience:

  • Masters or Ph.D. in Math, Statistics, Data Science, Computer Science, Physics, Bioinformatics or another quantitative field with strong foundation in statistics

  • 5+ years of experience with statistical analysis & predictive modeling

  • Experience in leading analytical projects from ideation stage to delivery and/or productization, as well as reviewing others’ work

  • Experience using Git version control

  • Experience with clinical trial data and/or large healthcare datasets is a plus

  • Experience with machine learning engineering, AWS ML suite and/or deep learning is a plus  

Note: The requirements should reflect your minimum requirements for the role in general

Medidata is making a real difference in the lives of patients everywhere by accelerating critical drug and medical device development, enabling life-saving drugs and medical devices to get to market faster. Our products sit at the convergence of the Technology and Life Sciences industries, one of most exciting areas for global innovation. Nine of the top 10 best-selling drugs in 2017 were developed on the Medidata platform. 

Medidata Solutions have powered over 20,000+ clinical trials giving us the largest collection of clinical trial data in the world. With this asset, we pioneer innovative, advanced applications and intelligent data analytics, bringing an unmatched level of quality and efficiency to clinical trials enabling treatments to reach waiting patients sooner.

Medidata Solutions, Inc. is an Equal Opportunity Employer. Medidata Solutions provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability status, protected veteran status, or any other characteristic protected by the law. Medidata Solutions complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities.