Principal Data Engineer

Vanderhouwen & Associates, Inc.

Portland, United States of America

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Compensation

$ 175K

Job location

Remote

Portland, United States of America

Tech stack

Artificial Intelligence

Airflow

Amazon Web Services (AWS)

Automated Storage and Retrieval Systems

JIRA

Encodings

Databases

Data as a Services

Data Architecture

Data Cleansing

Information Engineering

Data Governance

ETL

DevOps

DataOps

Data Streaming

Unstructured Data

Large Language Models

Data Lake

Kubernetes

Information Technology

Data Management

Terraform

Data Pipelines

Databricks

Job description

Lead enterprise-level data architecture and platform strategy, establishing standards, best practices, and scalable frameworks across multiple projects and teams.
Design and deliver robust, production-grade data pipelines supporting batch, streaming, and real-time use cases with built-in data quality, observability, and governance.
Architect and enable data foundations for AI and LLM-driven applications, including unstructured data ingestion, embedding pipelines, and vector-based retrieval systems.
Own end-to-end technical program execution, breaking down complex initiatives into structured deliverables while aligning cross-functional teams to business outcomes.
Serve as the primary technical advisor in client-facing engagements, translating ambiguous requirements into clear architectural solutions and technical roadmaps.
Define and implement cloud data strategies within AWS, including data lakes, warehouses, streaming platforms, and AI-ready infrastructure.
Establish data contracts, governance frameworks, and performance standards to ensure reliability, scalability, and consistency across platforms.
Drive DevOps and DataOps best practices, including infrastructure-as-code, automation, and cost optimization strategies.
Mentor and develop engineering talent, providing technical guidance, conducting code and architecture reviews, and elevating overall team capability.
Contribute to business development efforts by supporting solution design, technical proposals, and client presentations as a senior technical voice.

Requirements

Our client is seeking a highly strategic and hands-on Principal Data Engineer who thrives in complex, ambiguous environments and can translate business needs into scalable, production-ready data and AI solutions. This individual will operate as both a technical authority and program leader, driving enterprise data architecture, mentoring engineering teams, and delivering high-impact solutions across multiple workstreams. The ideal candidate brings a strong blend of deep technical expertise, systems thinking, and the ability to influence both technical and executive stakeholders., * 7+ years of hands-on data engineering experience, including at least 3 years in a lead, architect, or principal-level role.

Proven expertise designing and deploying scalable ETL/ELT pipelines, data platforms, and streaming architectures in production environments.
Deep experience with AWS data services such as S3, Redshift, Glue, Lake Formation, Athena, EMR, Kinesis, Lambda, and Step Functions.
Demonstrated experience supporting AI/ML or LLM data workflows, including data preprocessing, embedding generation, and vector database integration.
Strong background in solution architecture and client engagement, with the ability to translate business needs into technical designs.
Experience leading technical programs, including planning, estimation, and managing workstreams using tools such as Jira.
Advanced understanding of data governance, lineage, and data quality best practices within enterprise environments.
Proficiency with modern data tooling such as Databricks, dbt, and orchestration frameworks (e.g., Airflow or Prefect).
Experience with infrastructure-as-code and DevOps practices using tools such as Terraform or similar frameworks.
Bachelor's degree in Computer Science, Data Engineering, or a related field, or equivalent practical experience.