Data Engineer

Jonus Group

Seattle, United States of America

2 months ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Seattle, United States of America

Tech stack

Big Data

Computer Programming

Data Architecture

Information Engineering

Data Governance

Data Infrastructure

Distributed Data Store

Python

Machine Learning

Operational Data Store

Operational Databases

Scala

SQL Databases

Data Streaming

Feature Engineering

Spark

Backend

Build Management

Data Lake

Core Data

Real Time Data

Data Management

Data Pipelines

Job description

A rapidly growing technology organization is seeking a Senior Data Engineer to help design and build the data infrastructure that powers advanced analytics and machine learning systems across the business.

This role focuses on transforming complex, large-scale datasets into reliable, production-ready data products used by analytics teams, machine learning engineers, and business stakeholders. The ideal candidate enjoys solving difficult data problems, building scalable systems, and working closely with both engineering and analytics teams.

Many of the data challenges involve high-volume transactional and operational data commonly found in regulated industries such as insurance and financial services, where accuracy, data governance, and reliability are critical., * Design and build scalable data pipelines to ingest, process, and transform large datasets

Develop distributed data processing workflows using Python, Spark, and Scala
Build data models and transformation layers that support analytics and machine learning applications
Collaborate with data scientists and ML engineers to prepare datasets for predictive modeling and advanced analytics
Improve data quality, monitoring, and reliability across the data platform
Optimize performance of large-scale data pipelines and processing frameworks
Contribute to the design of data architecture, schema standards, and governance practices
Work with cross-functional teams to integrate data from multiple internal and external sources

Requirements

6+ years of experience in data engineering or backend data platform development
Strong programming experience in Python
Experience building distributed data pipelines using Apache Spark
Working knowledge of Scala in production data environments
Experience handling large-scale datasets in cloud or distributed environments
Familiarity with machine learning data preparation, feature pipelines, or ML infrastructure
Strong SQL and data modeling skills
Experience working with modern data platforms (data lakes, distributed processing frameworks, etc.), * Experience supporting machine learning workflows or feature engineering pipelines
Exposure to insurance, financial services, or other regulated industry data environments
Experience working with cloud-based data platforms
Familiarity with streaming or real-time data processing, * Building the core data infrastructure that supports analytics and predictive modeling
Enabling machine learning teams with reliable, high-quality datasets
Designing scalable pipelines capable of processing large volumes of operational data
Improving the reliability and usability of enterprise data assets

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all