Data Scientist

The Surgical

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Remote

Tech stack

Artificial Intelligence

Data analysis

Cloud Computing

Collaborative Software

Data Validation

Data Visualization

Factor Analysis

Statistical Hypothesis Testing

Python

PostgreSQL

Machine Learning

MongoDB

NoSQL

NumPy

SciPy

Software Engineering

SQL Databases

GIT

Pandas

Scikit Learn

Information Technology

Data Management

Data Pipelines

Job description

Remote Hiring Remotely in USA Mid level Remote Hiring Remotely in USA Mid level The Data Scientist will lead analytical workstreams, design clinical studies, develop scoring methodologies, build data pipelines, and collaborate with clinical teams to enhance surgical performance metrics., * Study design and execution: Design and run clinical validation studies - correlating AI-derived metrics with surgical outcomes (e.g., complications, resection extent, procedure duration)

Scoring methodology: Develop and refine composite scoring algorithms (PCA-weighted, Bayesian, or other approaches) that summarize multi-dimensional surgical performance into interpretable scores
Statistical modeling: Apply appropriate statistical methods (logistic regression, mixed effects, survival analysis, dimensionality reduction) to clinical datasets with clustered, sparse, and heterogeneous data
Data pipeline development: Build and maintain Python pipelines that extract, transform, and analyze data from MongoDB, PostgreSQL, and S3 at scale (hundreds to thousands of procedures)
Data quality and integrity: Design and implement data validation checks, investigate discrepancies across data sources, and ensure reproducibility of analyses
Clinical collaboration: Work directly with surgeons and clinical researchers to define metrics, interpret results, and refine tools based on clinical feedback
Reporting and communication: Produce analysis reports, methodology documentation, and presentations for internal teams, clinical partners, and external stakeholders

Requirements

This role requires independent judgment about statistical methodology, comfort working with messy real-world clinical data, and the ability to communicate complex findings to both technical and clinical audiences., * Master's degree (or equivalent experience) in statistics, biostatistics, data science, computer science, or a related quantitative field

2+ years of experience in applied data science or quantitative research
Strong Python skills for data analysis and pipeline development (pandas, NumPy, SciPy, scikit-learn)
Solid understanding of statistical methods: regression, hypothesis testing, dimensionality reduction (PCA/factor analysis), bootstrap inference
Experience with SQL databases (PostgreSQL preferred) and NoSQL databases (MongoDB)
Ability to work independently on ambiguous problems - scoping analyses, choosing methods, and communicating trade-offs
Strong written communication - ability to produce clear reports for both technical and non-technical audiences
Experience with Git and collaborative software development practices

Preferred