Data Scientist
Role details
Job location
Tech stack
Job description
- Develop and deploy predictive models, including classification, regression, and risk scoring.
- Design, train, and validate machine learning models using structured healthcare datasets.
- Select appropriate algorithms based on problem framing, data characteristics, and business objectives.
- Implement end-to-end modeling workflows from data extraction through deployment-ready artifacts.
- Perform feature engineering and data enrichment within data pipelines.
- Conduct hyperparameter tuning and apply rigorous validation techniques such as cross-validation.
- Evaluate model performance using appropriate statistical metrics and assess model stability.
- Perform quantitative analysis on large-scale datasets to detect patterns and utilization trends.
- Partner with business stakeholders to translate analytical findings into operational insights.
- Ensure compliance with data privacy and regulatory requirements.
Requirements
A position is available for a Data Scientist to develop predictive models and advanced analytics solutions across large and complex healthcare datasets. This role will focus on applying statistical and machine learning methodologies to generate actionable insights that support clinical, operational, and financial decision-making. The ideal candidate brings strong quantitative expertise and hands-on modeling experience with diverse healthcare data sources., Education: Master's Degree in Data Science, Statistics, Applied Mathematics, Computer Science, or a related quantitative field.
Technical Skills:
- Strong Python and SQL skills.
- Deep understanding of statistics, including distributions, variance, and hypothesis testing.
- Experience with machine learning algorithms such as XGBoost, logistic regression, and tree-based models.
- Experience with hyperparameter tuning and model validation techniques.
- Knowledge of algorithms and data structures, including sorting, searching, and graph traversal fundamentals., * PhD in Data Science, Statistics, Applied Mathematics, Computer Science, or a related quantitative field.
- Experience with Databricks or Spark.
- Healthcare analytics experience, such as with claims, population health, or Medicare Advantage.
- Experience building inference pipelines.
- Experience designing and implementing RAG-based pipelines using Azure OpenAI.
- Ability to evaluate and select vector databases based on performance and scalability needs.
- Experience in prompt engineering and parameter tuning.