Principal Data Architect

Technology and R&D New York, New York Remote, United States

Position at Medidata Solutions

Your Mission

This position will be responsible for the design, implementation and development of data aggregation, integration and migration solutions for the cloud based Medidata data platform. You will be comfortable working with multiple stakeholders to turn great ideas into detailed requirements while driving the overall data platform strategy. You will also design how migrations should be implemented and help determine the best architecture to use in each use case. The perfect candidate will have strong data infrastructure, data architecture and software design skills, as well as a proven track record of implementing streaming patterns (preferably Kafka/Flink, cloud based data warehouses (preferably Snowflake) as well as creating and managing data governance processes.

Candidates should be able to

  • Provide leadership, vision and direction to the Data Engineering organization to reach the business goals and continuously evolve the data fabric.
  • Develop productive relationships with business & technology partner teams across the organization to influence how a common data platform can enable new sources of value.
  • Define and enhance methodologies and practices for the application lifecycle management in line with best practice and practical experience of continuous improvement.
  • Manage relationships with internal and external stakeholders to drive outcomes in a matrixed environment.
  • Create a collaborative culture across teams that drives successful delivery and adoption of technology and business process solutions. 

Your Competencies

  • Strong communication skills with a proven ability to understand key concepts and communicate effectively with technical staff, business stakeholders and senior management.
  • Proven ability to communicate technical concepts to nontechnical people to enhance understanding and drive decisions that lead to positive outcomes.
  • Proven ability to collaborate, build relationships and influence individuals at all levels in a matrix-management environment (as well as external vendors and service providers) to ensure that segregation and overlapping roles are identified and coordinated.
  • Strong organizational skills, the ability to perform under pressure and management of multiple priorities with competing demands for resources.
  • Expertise in one or more common data engineering programming languages such as Java, Scala, Python and prior development experience creating data pipelines.
  • Strong API development and integration expertise ideally as a Full Stack developer. GraphQL experience is a plus.
  • Proficiency in working with polyglot data stores including relational databases (eg Oracle, SQL, MySQL), NoSQL (eg MongoDB, Cassandra, Elastic), Data Warehouses (eg Redshift, Snowflake) and Graph Databases (eg Neo4j, Neptune).
  • Expert working knowledge of streaming / stream processing technologies like Kafka, Kinesis, Spark, Flink.
  • Hands-on cloud expertise (preferably AWS) with strong understanding of SaaS models and containerized workloads (using Kubernetes).
  • Experience operating in a secure networking environment, leveraging production support and SRE teams is a p.lus
  • You have a bias towards automation, an Agile/Lean mindset and embrace the Devops culture.

Your Education and Experience

  • 10+ years of experience in progressively responsible roles within the technology function.
  • Proven senior engineering leadership, identifying, designing, developing, implementing and supporting data technology solutions.
  • Experience and understanding of a variety of data analysis processes including but not limited to modeling and algorithms, machine learning, data governance, data quality and lineage.
  • Life Science/Clinical Trial Data background preferred but not required.
  •  An undergraduate or postgraduate degree in computer science / engineering or a related field. 

Medidata is making a real difference in the lives of patients everywhere by accelerating critical drug and medical device development, enabling life-saving drugs and medical devices to get to market faster. Our products sit at the convergence of the Technology and Life Sciences industries, one of most exciting areas for global innovation. Nine of the top 10 best-selling drugs in 2017 were developed on the Medidata platform. 

Medidata Solutions have powered over 17,000+ clinical trials giving us the largest collection of clinical trial data in the world. With this asset, we pioneer innovative, advanced applications and intelligent data analytics, bringing an unmatched level of quality and efficiency to clinical trials enabling treatments to reach waiting patients sooner.

Medidata Solutions, Inc. is an Equal Opportunity Employer. Medidata Solutions provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability status, protected veteran status, or any other characteristic protected by the law. Medidata Solutions complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities.