Data Scientist
Role details
Job location
Tech stack
Job description
Join us as our Data Scientist at Dedalus, one of the World's leading healthcare technology companies, in France to do the best work of your career and make a profound impact in providing better care for a healthier planet. What you'll achieve As a Data Scientist, you will play a key role in ensuring the accuracy, reliability, and usability of clinical and operational data within the Analytics data pipeline. Your work will directly support data-driven insights and clinical research initiatives. You will:
- Analyze and assess Electronic Medical Record (EMR) database structures to design and implement efficient SQL queries and scripts for data extraction, transformation, and loading (ETL).
- Design and implement data quality checks to identify inconsistencies, resolve issues, and ensure high data integrity across all sources.
- Identify gaps and inefficiencies in the existing data pipeline and propose or implement improvements to enhance data usability and performance.
- Query and analyze the analytics data sets to extract and validate data that aligns with specific clinical trial requirements.
- Use Jupyter notebooks for advanced data exploration, complex queries, and reproducible analyses.
- Maintain clear and comprehensive documentation for data pipelines, workflows, and related processes to ensure transparency and knowledge sharing across teams.
Requirements
Do you have experience in Software development?, * A degree in Computer Science, Data Science, Statistics, Biomedical Engineering, or a related quantitative discipline.
- 3+ years of programming experience in Python, with a focus on data manipulation, analysis, automation and data visualisation.
- Strong proficiency in SQL and experience working with large, complex relational databases (preferably EMR systems).
- Solid understanding of data pipelines, ETL processes, and data modelling concepts.
- Conceptual knowledge of the software development lifecycle (e.g. unit testing, optimization, scalability, continuous integration, debugging, and documentation).
- Applied knowledge of data quality management and validation techniques.
- Hands-on experience using Jupyter Notebooks for data analysis, visualisation, and reproducible research.
- Strong analytical and problem-solving skills, with attention to detail and data accuracy.
- Effective communication skills in both English and French.
- Proficiency in the following technologies and tools: Python, Relational databases and SQL, Jupyter Notebooks, Version control systems (e.g., Git) and Containerized environments (e.g., Docker)
Desirable Requirements
- Team spirit, open mindset and the ability to collaborate effectively with people from different cultural backgrounds
- Knowledge in following: Jira, OpenSearch database
- Familiarity with clinical data and the functioning of clinical trials
- Healthcare standards, including HL7-FHIR, DICOM, and most used code systems in the healthcare field(SNOMED, ICD, LOINC)