Data Engineer (Python)
Role details
Job location
Tech stack
Job description
At Datavant, our Life Sciences technology powers clinical research, real-world evidence generation, pharmacovigilance, and data collaboration across the healthcare ecosystem. As a Software Engineer in our Life Sciences Tech Hub, you will design and build systems that enable secure, privacy-preserving access to high-quality healthcare data used by pharmaceutical companies, biotech innovators, and research organizations worldwide. Your work will directly support the development of new therapies, accelerate clinical trials, and improve patient outcomes by ensuring that critical data is interoperable, scalable, and trustworthy.
Careers at Datavant offer the chance to apply your technical expertise and problem-solving skills in support of our mission: enabling every healthcare decision to be powered by the right data, at the right time, in the right format. We're looking for problem-solvers, game-changers, innovators, dreamers, doers - people who are ready to move the needle and build on our success.
Why join Datavant's tech team?
-
You'll collaborate with other engineering leaders on all matters that impact the Engineering team, including resourcing and building technology/product vision
-
You'll have the opportunity to coach and mentor colleagues, including code reviews, higher-level software design, and direct management
-
The team works on a technical stack which includes both cloud and on-premise deployments, big-data ingestion and analytics, distributed systems, and algorithmic complexity., As a Data Engineer, you will build and scale health data pipelines that enable data discoverability, linkage, privacy, machine learning, and analytics across the organization. You will work with Python, Spark, and Snowflake to design efficient data models and reliable processing frameworks. This role provides the opportunity to shape foundational data infrastructure that supports regulatory-grade health decisions at scale. What You'll Do
-
Design and maintain scalable data pipelines using Python, Spark, and Airflow
-
Ensure pipeline reliability, data quality, and observability
-
Partner with Science and Product to define and prioritize data requirements
-
Implement automated testing and optimize processing performance and cost
-
Design and maintain data validation frameworks aligned with regulatory requirements
Requirements
- Bachelor's degree in Computer Science, Engineering, Mathematics, or related field, or equivalent practical experience
- 2+ years of data engineering or software engineering experience for junior candidates; 5+ years for senior candidates
- Professional experience building data pipelines in Python
- Hands on experience with distributed data processing frameworks such as Apache Spark and orchestration with Airflow
- Experience working with cloud data warehouses such as Snowflake
- Experience integrating agentic code generation tools such as Claude Code or Codex into day to day software engineering practices, * Experience designing dimensional or analytics data models
- Experience working in AWS-based data environments
- Experience implementing data governance and quality frameworks
- Understanding of architectural considerations for on premises deployments, including networking and security constraints
- Knowledge of application security best practices (vulnerability management, security posture management, and risk assessment) and compliance frameworks (HIPAA, GDPR, HITRUST)
Benefits & conditions
- Competitive and equitable salary
- Hybrid work model, with two days per week based in our Gràcia office
- Daily in-office lunch stipend and access to a fully stocked kitchen
- Comprehensive private health insurance, including out-of-network reimbursement options
- 25 days of annual paid leave
- Six-week "Work from Anywhere" policy
To ensure the safety of patients and staff, many of our clients require post-offer health screenings and proof and/or completion of various vaccinations such as the flu shot, Tdap, COVID-19, etc. Any requests to be exempted from these requirements will be reviewed by Datavant Human Resources and determined on a case-by-case basis. Depending on the state in which you will be working, exemptions may be available on the basis of disability, medical contraindications to the vaccine or any of its components, pregnancy or pregnancy-related medical conditions, and/or religion.