Data Scientist

Job ID 2022-3783



Position at WebMD

WebMD is the most recognized and trusted brand of health information and the leading provider of health information services, serving consumers, physicians, healthcare professionals, employers and health plans through our public and private online portals and WebMD the Magazine. The WebMD Health Network includes WebMD, Medscape, MedicineNet, eMedicine, RxList, and Medscape Education. Our consumer portals and mobile health applications provide engaging, relevant and credible health and wellness information, personalized health assessment tools and access to online communities.

WebMD is an Equal Opportunity/Affirmative Action employer and does not discriminate on the basis of race, ancestry, color, religion, sex, gender, age, marital status, sexual orientation, gender identity, national origin, medical condition, disability, veterans status, or any other basis protected by law.

Education: B.Tech/B.E. / MSc Statistics

Experience: 3 + years  

Position Requirements:  

  • 3+ years of hands on experience in Python development, NLP libraries (Spacy / NLTK / Scikit-Learn etc.)  
  • Knowledge about NLP (stemming, lemmatization, TFIDF etc.)  
  • Excellent problem-solving skills with a strong understanding of statistics and machine learning algorithms  
  • Experience in development using a server, Github  
  • Excellent command over English language 
  • Experience with Deep Learning modelization applied to text (e.g.: Bert / BioBert framework) is good to have  
  • Previous experience with the SnowFlake data warehouse will be an added advantage  
  • Knowledge about less classic Machine Learning methodologies of NLP (such as LDA, NMF  etc. is good to have  
  • Experience with large / complex codebase is good to have  
  • Experience in working on SQL  Ability to do clean code in Python (class oriented, PEP8, list comprehensions etc.)  
  • Experience in Docker / CI/CD is good to have  
  • Experience with API development using the Flask framework  
  • Experience with data visualization tools (e.g.: plotly)  
Role & Responsibilities:  

Develop a clinical & non-clinical content tagging solution in order to deliver a 1 to 1 personalized experience to healthcare professionals.  
Build the best affinity score to personalize the 360° user experience.  
Improve churn index predictive model in order to grow the overall platform audience.  
Independently execute and lead analytical projects and assignments  
Help solve challenging Healthcare related NLP issues with state-of-the-art algorithms.