Research Data Engineer II
Role details
Job location
Tech stack
Job description
With general supervision and guidance, participates in the design, implementation, and maintenance of informatics tools and solutions to support and accelerate clinical research. Primarily focuses on developing and supporting the data quality assurance and quality control applications, processes, and protocols of the URMC OMOP database and its de-identified version. The role involves leveraging advanced analytics by adopting, deploying, and improving AI/ML models (including LLMs) to enrich OMOP clinical data and enhance its research readiness. Develop, evaluate and customize open-source tools to support and promote the use of OMOP database for observational healthcare studies. Works in partnership with CTSI Informatics team members and the Information System Division (ISD) at URMC to provide robust, scalable informatics solutions to the research enterprise., * Builds, maintains, and evolves data quality evaluation standards, processes, and workflows to monitor and report data accuracy, consistency, and plausibility. Determines required changes to the Extract, Transform and Load (ETL) pipelines for improved data quality.
- Works with research team members to convert business and technical requirements into professional software solutions. Ensures timely completion of tasks while managing multiple assignments, project timelines, and business user expectations.
- Designs and implements custom, research project-specific data workflow solutions for data collection, management, reporting and analytics. Contributes to the scientific research.
- Participates in specification, implementation and execution of testing procedures to ensure quality of deliverables, system and data workflow reliability.
- Adheres to defined application development life-cycle practices, including but not limited to, requirements gathering, writing test plans, source code management, peer code review, and quality assurance through unit/system/user acceptance testing.
- Produces and maintains comprehensive technical documentation for all systems under the Engineer's responsibilities.
- Keeps abreast of current application developments through continuing education, professional reading, online forums, conferences, workshops, and professional groups.
- Other duties as assigned
Requirements
- Bachelor's degree in Data Science, Biomedical Science, Computer Science, Mathematics, Statistics, or similar discipline required
- 2 years of experience in technology and data intensive roles and environments, or equivalent combination of education and experience required
- Programming experience in SQL and at least one other applicable language (Java, Python, C, and/or R) required.
- Experience implementing and supporting analytical tools in a scientific research or clinical context (e.g. OHDSI HADES) preferred
- Experience with OMOP common data model preferred
- Familiarity with medical vocabularies (e.g., ICD-10-CM, CPT4, LOINC, RxNorm, SNOMED) and reporting clinical data from Epic databases (e.g., Clarity, Caboodle) preferred
- Experience with Version Control solutions (e.g. Git) preferred
- Programming experience with numeric computing software (eg. MATLAB, Julia) preferred
- Experience with Linux, container, and cloud technologies (e.g. HPC, Docker, Kubernetes, IaaS and PaaS) preferred
- Experience with Change Management solutions preferred
KNOWLEDGE, SKILLS AND ABILITIES
- Understanding of data analytics and statistical methods required
- Expertise of software engineering best practices such as version control and software release management required
- Strong analytical and problem solving skills required
- Strong organizational skills required
- Ability to work with others in a matrix management environment required.
- Excellent communication skills for describing progress and challenges to stakeholders required
- Attention to detail, patience, and a positive, customer-centric attitude required
- Strong technical presentation skills required
- Demonstrated ability to develop proficiency with unfamiliar toolsets required
Benefits & conditions
$77,216.00 - $115,824.00
The referenced pay range represents the minimum and maximum compensation for this job. Individual annual salaries/hourly rates will be set within the job's compensation range, and will be determined by considering factors including, but not limited to, market data, education, experience, qualifications, expertise of the individual, and internal equity considerations.