Data Engineer
Role details
Job location
Tech stack
Job description
We live in a world where most information is readily available at our fingertips. You might be surprised to learn the situation in hospitals is rather different. Systems used for electronic healthcare records (EHR) facilitate the clinical workflow, retrieving and storing information for each patient. There is, however, a large need for secondary usage of this data. Think of linking patients to clinical trials, medical research studies, and care quality assessment. For this purpose IQVIA has developed a search engine enabling medical professionals to find patient cohorts and collect data. In all that we do, we have the doctors and patients at the fore of our mind, and we also ensure to strictly adhere to privacy regulations.
Patient Finder is currently used on a daily basis by 40+ hospitals in The Netherlands and Belgium and is expanding into other international markets. The product is a scale-up within a larger corporation, meaning the product is proven while there is still room to make your own mark and propose creative solutions., As a data engineer you will design, implement, and improve data pipelines in hospitals with a wide variety of data sources, mostly on-premise, with plans to become cloud-ready for international customers in the coming years. You will be working in a collaborative environment as part of a six-person data warehousing team. The team is responsible for a wide range of tasks, including data warehouse design, data mapping, ETL pipeline development, and providing support for new user requests and ETL-related issues. You will be actively involved in all of these activities, with the opportunity to specialize in a particular area if you wish.
Additionally, you will be part of a larger, cross-functional and dynamic team consisting of software engineers, designers, medical consultants and NLP engineers., * Collaborate on the design and development of the Patient Finder datawarehouse & dataflow ecosystem using SQL and Python
- Design & develop new testable python data pipelines primarily in a Prefect-driven architecture
- Contribute to & provide support for existing pipelines either by providing operational support, python module maintenance or adjusting SQL layers
- Develop SQL data transformations and connectors for new medical data sources
Your Profile
You are a dedicated professional driven by a passion for transforming healthcare and making a meaningful impact. You thrive in data-centric environments, confidently mapping data and building robust data pipelines. With a strong desire for continuous learning and personal growth, you bring curiosity and clarity to complex, ambiguous challenges. You excel in self-organizing teams and embrace autonomy in your work.
Requirements
Do you have experience in Scalability?, * Deep expertise in Python, with a solid grasp of Object-Oriented Programming and abstraction design.
- Proficiency in modular development practices.
- Hands-on experience designing and maintaining data pipelines using tools such as Prefect, Airflow, Luigi, or Dagster.
- A strong focus on testability and scalability, often applying Test-Driven Development (TDD) principles.
- Confidence in writing SQL for both data definition (DDL) and data manipulation (DML).
- Extensive expertise in at least one relational database system, preferably Microsoft SQL Server or PostgreSQL
- The ability to clearly articulate and translate complex concepts to fellow data engineers and cross-functional teams.
- Excellent communication skills in written and spoken English.
Nice to have:
- Experience or affinity with monitoring system, like DataOps (testing, CI/CD, deployment)
- Eagerness to learn more and stay up to date with data engineering best practices
- Affinity with the medical domain
- Knowledge of medical data standards such as FHIR and/or OMOP