Data Engineer

LMC Properties, Inc.

New York, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

New York, United States of America

Tech stack

Adaptable Database Systems

Artificial Intelligence

Airflow

Amazon Web Services (AWS)

Business Analytics Applications

Data analysis

Google BigQuery

Cloud Computing

Data Architecture

Data Infrastructure

ETL

Data Structures

Data Warehousing

Relational Databases

Database Queries

Electronic Data Interchange (EDI)

Python

PostgreSQL

Machine Learning

MySQL

Operational Databases

Data Streaming

Technical Data Management Systems

Feature Engineering

Fast Healthcare Interoperability Resources

Snowflake

Health Level Seven International

Operational Systems

Machine Learning Operations

Terraform

Data Pipelines

Legacy Systems

Redshift

Databricks

Job description

We are looking for a Data Engineer to build the data infrastructure powering analytics, business intelligence, operational monitoring, and AI-driven decision-making across a healthcare automation platform. You will design and maintain data pipelines, integrate fragmented healthcare and operational data sources, model complex workflows into clean data structures, and create the analytics foundation that helps the company measure, monitor, and optimize its systems. This role is ideal for someone who enjoys owning the full data stack: ingestion, transformation, modeling, analytics, alerting, and infrastructure decisions. Over time, you will have the opportunity to take end-to-end ownership of the data platform and shape the technical direction of the company's data architecture. What You'll Do Design, build, and maintain reliable data pipelines across internal systems, third-party APIs, and healthcare data sources. Integrate complex data sources such as operational systems, healthcare records, billing workflows, claims-related data, and external feeds. Build clean, scalable data models for revenue cycle workflows, intake operations, clinical processes, and business reporting. Develop analytics-ready schemas that support dashboards, reporting, monitoring, and operational decision-making. Build dashboards, alerts, and analytical tools to identify automation opportunities and operational bottlenecks. Support product, engineering, operations, and leadership teams with high-quality data infrastructure. Contribute to the technical direction of the business intelligence and data platform. Improve data quality, reliability, observability, and performance across the stack. Help create the foundation for AI/ML-driven workflows, analytics, and automation., You will build foundational data infrastructure rather than simply maintain legacy systems. Your work will directly shape how an AI healthcare platform measures and improves operations. You will work with complex, messy, high-value healthcare data. You will partner closely with product, engineering, and operations rather than sit in a silo. You will have the opportunity to define the long-term data architecture for a fast-scaling platform. You will help create analytics and automation systems that reduce administrative waste in healthcare. Work Model Full-time role. Hybrid in New York, with regular in-office collaboration expected. Visa sponsorship is not available for this position.

Requirements

3+ years of experience building and maintaining production data pipelines, warehouses, and analytics infrastructure. Strong SQL skills. Experience with at least one modern data warehouse or lakehouse such as Databricks, Redshift, Snowflake, BigQuery, or similar. Experience with relational databases such as Postgres or MySQL. Experience building ETL or ELT pipelines using tools such as Airflow, dbt, Dagster, or similar orchestration frameworks. Comfort working across the full data lifecycle, from raw ingestion through analytics-ready models. Ability to model complex operational domains into clean, queryable schemas. Comfort with exploratory analysis, statistical thinking, and analytical problem-solving. Experience using Python or R for data analysis. Strong communication skills and ability to explain technical data concepts to non-technical stakeholders. Ability to partner effectively with product, engineering, and operational teams. High ownership, detail orientation, and comfort operating in a fast-paced startup environment. Nice to Have Experience in healthcare technology or healthcare operations. Familiarity with healthcare data standards or data types such as EDI, HL7, FHIR, claims data, clinical documentation, or provider workflows. Experience with real-time or streaming data pipelines. Background in data science, machine learning, or feature engineering for AI/ML systems. Exposure to BI tools such as Hex, Sigma, or similar. Experience working in a high-growth startup or scale-up environment. Familiarity with cloud infrastructure and tooling such as AWS, Terraform, or similar.

About the company

We are representing a high-growth AI startup building infrastructure to automate complex administrative workflows in healthcare. The company is creating an AI-native platform that integrates with provider systems and helps automate areas such as insurance verification, prior authorization, billing, intake, revenue cycle operations, and clinical workflow support. This is a rare opportunity for a data engineer to build core data infrastructure from the ground up in a complex, high-impact domain where clean data, reliable pipelines, and operational visibility directly shape product and business decisions., © 2026 Careerjet All rights reserved