Data Engineer

CGT

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Business Analytics Applications

Data analysis

Azure

Business Intelligence

Big Data

Health Informatics

Cloud Computing

Cloud Database

Cloud Engineering

Information Systems

Computer Programming

Databases

Data Architecture

Data Validation

Information Engineering

Data Governance

Data Integration

ETL

Data Warehousing

IBM DB2

Relational Databases

DevOps

Fault Tolerance

Healthcare Effectiveness Data and Information Set

Python

Machine Learning

Meta-Data Management

Oracle Applications

Performance Tuning

Systems Development Life Cycle

Power BI

Cloud Services

Cloudera

Software Engineering

SQL Databases

Tableau

Teradata

Unstructured Data

Enterprise Data Management

Software Organization

Data Processing

Scripting (Bash/Python/Go/Ruby)

Google Cloud Platform

Cloud Platform System

Spark

PySpark

Information Technology

Data Lineage

Enterprise Integration

Tools for Reporting

Data Pipelines

Databricks

Job description

The Senior Data Engineer is responsible for designing, developing, and maintaining enterprise data platforms, ETL pipelines, and analytics solutions that support regulatory reporting, business intelligence, and data-driven decision-making. This role plays a critical part in ensuring the efficient movement, transformation, and accessibility of data across multiple systems and cloud environments. Primary responsibilities include developing scalable data integration solutions, supporting analytics and reporting platforms, and collaborating with business and technical stakeholders to deliver high-quality data products., * Design, develop, and maintain scalable ETL and ELT pipelines using Python, PySpark, SQL, and cloud-based technologies.

Build and support enterprise data integration solutions that move and transform data across multiple platforms and applications.
Develop and optimize data pipelines utilizing technologies such as Apache Spark, Databricks, and cloud-native data services.
Design, implement, and maintain data models and architectures that support reporting, analytics, and regulatory requirements.
Create and support business intelligence solutions using Power BI and Tableau, with an emphasis on Power BI development.
Collaborate with business stakeholders, analysts, and technical teams to translate requirements, user stories, and design specifications into technical solutions.
Develop fault-tolerant, scalable, and maintainable data engineering solutions following software development best practices.
Support cloud-based data platforms and services across Google Cloud Platform (Google Cloud Platform) and Microsoft Azure environments.
Perform data validation, quality assurance, troubleshooting, and performance optimization activities.
Work with structured and unstructured data from enterprise systems, databases, and third-party sources.
Participate in all phases of the Software Development Life Cycle (SDLC), including requirements gathering, design, development, testing, deployment, and support.
Maintain technical documentation, data lineage, and operational procedures for enterprise data solutions.

Requirements

Bachelor's Degree in Computer Science, Information Systems, Data Engineering, Software Engineering, or a related technical field.
Minimum of 5 years of experience in data engineering, software engineering, or business intelligence development.
At least 3 years of experience developing ETL and data integration pipelines using Python, PySpark, and SQL.
At least 3 years of experience with Apache Spark, Databricks, or comparable big data technologies.
At least 3 years of experience developing reports and dashboards using Power BI or Tableau.
At least 3 years of experience working with enterprise databases such as Oracle, Teradata, DB2, or similar platforms.
Experience working with cloud platforms, including Google Cloud Platform (Google Cloud Platform) and Microsoft Azure.
Strong understanding of data architecture, data modeling, and data integration best practices.
Experience participating in full Software Development Life Cycle (SDLC) processes.

Preferred

Experience within healthcare, health insurance, population health, or regulatory reporting environments.
Experience supporting HEDIS reporting and quality measurement initiatives.
Experience with Informatica or similar enterprise ETL tools.
Experience with cloud-native analytics and data warehouse technologies.

Special Requirements

Strong proficiency in Python, PySpark, SQL, and Unix/Linux environments.
Ability to support cloud-based data platforms and enterprise reporting systems.
Ability to work effectively in a fast-paced, collaborative environment.
Participation in after-hours support activities may be required as needed.

Knowledge, Skills, and Abilities

Advanced knowledge of data engineering, ETL development, and enterprise data architecture.
Strong programming skills in Python, PySpark, SQL, and scripting technologies.
Experience with Apache Spark, Databricks, Dataproc, and cloud-based data services.
Strong knowledge of relational databases, data warehousing, and data modeling concepts.
Proficiency with Power BI and Tableau reporting platforms.
Experience working with Google Cloud Platform and Azure cloud environments.
Knowledge of software engineering principles, SDLC methodologies, and DevOps practices.
Strong analytical and problem-solving skills.
Excellent troubleshooting and performance optimization abilities.
Strong written and verbal communication skills.
Ability to manage multiple priorities and deliver high-quality solutions within established timelines.
Strong attention to detail and commitment to data quality and accuracy.

Additional Desired Characteristics

Experience supporting healthcare analytics, regulatory reporting, or quality measurement programs.
Familiarity with HEDIS, healthcare data models, and clinical or claims data.
Experience with enterprise data governance, metadata management, and data quality frameworks.
Exposure to machine learning, predictive analytics, or advanced data science initiatives.
Cloud certifications related to Google Cloud Platform, Azure, Databricks, or data engineering are highly desirable.

Work Environment

Hybrid or remote work environment based on organizational requirements.
Primarily office-based work with extensive use of computer systems and cloud-based technologies.
Occasional travel may be required for team meetings, training, or project activities.
Ability to support critical data processing activities during scheduled maintenance windows when necessary.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all