Data Analyst
Role details
Job location
Tech stack
Job description
We are seeking a highly analytical Data Analyst to support a large enterprise healthcare organization with analytics initiatives spanning Commercial, Medicare, and Medicaid populations. This role will focus on generating actionable datasets, supporting ad hoc business requests, and enabling predictive modeling initiatives such as member-level risk scoring. This is a hands-on data role requiring advanced SQL expertise, strong analytical curiosity, and the ability to explore and validate complex healthcare datasets., * Write highly complex SQL queries across Databricks and legacy Oracle systems, utilizing advanced constructs like multi-table JOINs, nested subqueries, CTEs, and window functions.
- Build member-level analytic datasets by synthesizing claims, eligibility, utilization, and risk data.
- Troubleshoot and refine queries for performance and accuracy in large-scale environments.
- Construct clean, structured datasets to support predictive modeling use cases, including feature engineering from raw data.
- Validate dataset integrity prior to model training through null analysis, duplication checks, and distribution validation.
- Partner with Data Scientists to ensure features align with modeling objectives.
- Perform deep exploratory analysis to uncover trends, patterns, anomalies, and data quality issues in healthcare datasets.
- Translate ambiguous stakeholder requests into structured queries and datasets.
- Develop recurring extracts and analytic views for operational and executive reporting.
- Collaborate with BI Developers to ensure reporting accuracy and consistency.
Requirements
Education: Bachelor's Degree in Computer Science, Mathematics, Information Systems, or a related field.
Technical Skills: Strong SQL skills are required, with the ability to write large, complex queries quickly and accurately. Experience with Python for analysis and visualization (e.g., pandas, matplotlib) is also necessary.
Experience: Experience working with relational databases and understanding table relationships is required. Candidates must demonstrate strong analytical thinking, curiosity, and the ability to interpret healthcare datasets and explain findings clearly., * Master's Degree in Data Analytics, Statistics, Mathematics, Computer Science, Public Health, or a related field.
- PySpark experience.
- Basic data modeling knowledge.
- Experience with Databricks.
- Exposure to healthcare claims or population health data.
- Familiarity with basic data pipeline scheduling and transformations.