Research Data Engineer II (IT@JH Univ Data Analytics & Engineering&#x29

Johns Hopkins University
Baltimore, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 175K

Job location

Remote
Baltimore, United States of America

Tech stack

Query Performance
Data analysis
Computer Programming
Databases
Data Architecture
Data Governance
Data Integration
ETL
Data Warehousing
Database Development
Database Queries
Python
Operational Databases
Raw Data
Software Systems
Unstructured Data
Web Applications
Microsoft Fabric
Data Analytics
Data Management
Epic Caboodle
Data Pipelines
Databricks

Job description

IT@JH Univ Data Analytics & Engineering is seeking a Research Data Engineer II who will support research investigators and Research Leadership across Johns Hopkins University and Johns Hopkins Medicine by designing and deploying complex data architectures and supporting data integration, curation, and analysis. The Research Data Engineer works with cloud lake house environments such as Databricks and Microsoft Fabric, as well as modern ETL tools and Python to manage and support data pipelines for enterprise research data products. Assignments are primarily project-based, involving direct engagement with end-users to understand requirements, facilitate modeling sessions, write business and technical requirements, and implement solutions. The role functions with a high degree of independence under the general supervision of the IT Director of Research Data, with work assigned through project goals and reviewed based on solution outcomes., * Contribute to the design, production, and maintenance of data pipelines for data acquisition, management, transformation, and back-end code development to power data web applications and convert raw data into usable information.

  • Write and maintain ETL/ELTs that operate on a variety of structured and unstructured sources.
  • Develop and maintain web data scraping systems for automatic data acquisition.
  • Help design data architecture and provide ongoing support.
  • Input/output data from databases and perform queries.
  • Create scripts to clean, transform, and analyze data.
  • Put into production data pipelines using data warehousing systems.
  • Create and implement production software to monitor data quality and detect data anomalies.
  • Perform daily manual data quality assurance tasks.
  • Support, maintain, and troubleshoot the software infrastructure.
  • Source data, conduct analyses, visualize data, and generate insights to support ongoing research projects and other requests across the organization.
  • Collaborate with developers, analysts, data scientists, researchers, policy experts, and other partners.
  • Communicate with Division leadership, and others on the team.
  • Collaborate with external partners, contractors, and vendors.
  • Other duties as assigned.

Requirements

  • Bachelor's Degree.
  • Five years of related work experience focused within database management and design, and business requirements gathering.
  • Additional education may substitute for required experience and additional related experience may substitute for required education permitted by the JHU equivalency formula beyond a high school diploma/graduation equivalent, to the extent permitted by the JHU equivalency formula., * Experience with data standards such as controlled vocabularies (e.g., SNOMED, LOINC, ICD) and the OMOP common data model
  • Experience working with EHR data, particularly EPIC data models such as Clarity and Caboodle
  • Experience working with a variety of data types such as semi-structured and unstructured data
  • Experience working in a highly decentralized, consensus-driven environment, such as an academic institution
  • Experience directly engaging with end users to understand requirements and implement successful architectures
  • Thorough knowledge of data warehouse and data management principles and processes and database development
  • Strong proficiency in SQL programming, query writing, query performance tuning, and database technologies

Apply for this position