Data Engineer (Healthcare Data)
Role details
Job location
Tech stack
Job description
The Senior Data Engineer will design, build, operate, and improve large-scale data pipelines and foundational data products that power Komodo's Healthcare Map, analytics products, and downstream AI/ML-enabled use cases. This is a hands-on engineering role focused on processing complex healthcare data at scale, improving reliability and performance, and contributing to the technical direction of core data systems., * Architectural Advancement: Deliver high-impact technical initiatives that improve pipeline performance, scalability, and system efficiency.
- Platform Hardening: Improve the reliability, observability, and cost-efficiency of core Data Foundations systems.
- Healthcare Data Innovation: Develop deep domain expertise and contribute novel approaches to challenges such as patient journey mapping and identity resolution.
- Cross-Functional Delivery: Partner with Data Product and Engineering teams to ship scalable, production-grade data solutions.
- Partner on architecture: Raise the bar across the team through mentorship, design reviews, and engineering best practices., * Build, operate, and optimize large-scale production data pipelines using Python, SQL, Airflow, cloud infrastructure, and distributed processing frameworks.
- Transform massive healthcare claims, EHR, and reference datasets into trusted, performant Healthcare Map data products and serving-ready data assets.
- Strengthen pipeline reliability through data quality checks, validation, lineage, observability, monitoring, and alerting.
- Debug complex data, system, and performance issues across computationally intensive workflows.
- Partner with Data Product Quality, Product, Platform, and Engineering teams to translate healthcare data needs into scalable technical solutions.
- Contribute to system design, architecture, code quality, testing, documentation, CI/CD, and rotational production support.
- Enable downstream analytics, product, and AI/ML use cases through high-quality, well-modeled, reliable data.
Requirements
- Strong hands-on experience building, operating, and debugging production-grade data pipelines at scale.
- Advanced Python and SQL skills, with experience in Airflow or similar workflow orchestration tools.
- Experience with Spark or comparable distributed data processing frameworks.
- Proven experience designing and operating data solutions in AWS.
- Strong instincts for data quality, reliability, root-cause analysis, and production troubleshooting.
- Ability to communicate technical trade-offs clearly and collaborate with engineering, product, and data partners.
- Comfort using AI-assisted engineering tools for productivity, debugging, documentation, and technical exploration.
AI-Augmented Engineering Expectations:
- You will be expected to leverage AI-augmented engineering tools, such as ChatGPT, Gemini, or Claude, to improve productivity and technical decision-making. This may include using AI to generate and refine code, accelerate documentation, automate test case creation, debug complex issues, explore unfamiliar technical concepts, and assess architectural trade-offs and risks.
Additional skills and experience we'd prioritize (nice to have)...
- Experience delivering external-facing data products through customers, APIs, serving layers, or production access patterns.
- Ability to optimize high-scale data architectures for performance, cost, versioning, and large-volume productization.
- Experience applying AI or agentic workflows to engineering, data quality, delivery, or operations.
- Success in high-growth or ambiguous environments that require balancing architecture, speed, and quality.
Open to US remote, OR SF/NYC hybrid
#LIRemote
The pay range for each job posting reflects a minimum and maximum range of annual base pay that we reasonably expect to pay for this position within the US. We carefully consider multiple business-related factors when determining compensation, including job-related skills, work experience, geographic work location, relevant training and certifications, business needs and market demands.
Benefits & conditions
The starting annual base pay for this role is listed below. This position may be eligible for performance-based bonuses as determined in the Company's sole discretion and in accordance with a written agreement or plan. This role may also be eligible for equity awards. In addition, this role is eligible for benefits including, but not limited to, comprehensive health, dental, and vision insurance; flexible time off and holidays; 401(k) with company match; disability insurance and life insurance; and leaves of absence in accordance with applicable state and local laws and regulations and company policy.
San Francisco Bay Area and New York City:
$207,000-$238,000 USD