ETL Data Engineer

Eliassen Group

Tysons, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Compensation

$ 183K

Job location

Tysons, United States of America

Tech stack

API

Artificial Intelligence

Amazon Web Services (AWS)

Business Analytics Applications

Confluence

Build Automation

Automation of Tests

Big Data

Configuration Management

Code Generation

Information Systems

Databases

ETL

Software Debugging

Memory Management

Hive

Python

Object-Oriented Software Development

Pair Programming

Performance Tuning

Query Optimization

Cloud Services

Software Engineering

SQL Databases

Data Logging

Google Cloud Platform

GitHub Copilot

Large Language Models

Multi-Agent Systems

Concurrency

Prompt Engineering

Spark

Data Lake

AI Platforms

PySpark

Information Technology

Wikis

Virtual Agents

Dataiku

Cloudwatch

Code Restructuring

GPT

Data Pipelines

Job description

Build and maintain ETL/ELT pipelines using Apache Spark, Hive, and Trino across S3-based data lakes.
Develop and optimize SQL for large-scale surveillance datasets using window functions, joins, and complex aggregations.
Engineer big data systems on EMR-on-EC2 and EMR-on-EKS and deliver solutions on analytical platforms such as SageMaker, Domino, or Dataiku.
Participate in data quality monitoring, anomaly detection, and production incident investigation.
Develop AI agent systems using AWS Bedrock and agent frameworks such as Strands Agents SDK or LangChain/LangGraph.
Design agent harnesses that combine LLM reasoning with deterministic execution including skill or RAG-based SQL generation and structured output validation.
Implement agent memory, context management, and tool integration including MCP servers, API connectors, and data catalog lookups.
Build evaluation frameworks for agent accuracy covering paraphrase robustness, routing precision, and structural consistency.
Stay informed on advances in LLM frameworks and emerging AI capabilities.
Write clean, well-tested code and contribute to CI/CD pipelines and infrastructure-as-code on AWS.
Ensure secure handling of sensitive regulatory data with auditable execution traces.
Adhere to secure development practices and technology policies.
Partner across teams, communicate at the appropriate technical level, and maintain documentation on Confluence or Wiki.
Learn from senior team members and contribute to process improvement.

Requirements

Experience building data pipelines with Apache Spark (PySpark preferred) and SQL.
Experience with SQL engines such as Hive or Trino and cloud data platforms including AWS S3, EMR, and Lambda.
Understanding of data skew, large-volume processing, and troubleshooting job failures due to resources, data quality, and scalability.
Hands-on debugging and mitigation experience.
Practical experience building LLM-powered agent systems that use tools and produce structured outputs.
Experience with agent frameworks such as LangChain, LangGraph, or AWS Strands.
Knowledge of prompt engineering, RAG architectures, and context or memory management.
Experience with foundation model APIs such as Anthropic Claude, Amazon Nova, or OpenAI.
Understanding of agent memory tiers and strategies for persistence, pruning, and retrieval.
Familiarity with harness patterns including deterministic guardrails, tool routing, and verification loops.
Hands-on experience with AI development tools such as GitHub Copilot, Q Developer, ChatGPT, or Claude.
Experience with spec-driven development for AI-assisted code generation and validation.
Ability to leverage AI pair programming for suggestions, debugging, refactoring, and automated test generation.
Experience with AWS services including S3, EMR, EMR on EKS, Lambda, Bedrock, and Step Functions.
Hands-on experience using S3 with Spark and related file format or consistency considerations.
Familiarity with AWS Bedrock guardrails, knowledge bases, and agent orchestration.
Exposure to Google Cloud Vertex AI or equivalent managed AI platforms.
Familiarity with AWS monitoring and logging tools such as CloudWatch and CloudTrail.
Proficiency in Python with clean, modular, and performant code and understanding of functional concepts.
Strong understanding of collections, concurrency, and memory management.
Proficiency with SQL window functions, joins, aggregations, and complex query optimization including edge cases., * Bachelor's degree in Computer Science, Data Science, Information Systems, or related discipline with at least two years of related experience, or equivalent training and work experience. Financial services experience preferred.
Demonstrated expertise in object-oriented and database technologies resulting in enterprise-quality solutions.
Knowledge of software engineering approaches including test automation, build automation, and configuration management.
Strong written and verbal technical communication skills and effective cross-team collaboration.
Ability to learn new skills rapidly and operate in a fast-paced environment.

Benefits & conditions

Our client seeks an ETL Data Engineer to build and maintain large-scale data pipelines and to design agentic AI systems that support regulatory analytics. The role spans Spark-based ETL on AWS data lake platforms and development of LLM-powered agents with secure, auditable outputs. The engineer will collaborate across teams, uphold secure development practices, and contribute to CI/CD and infrastructure-as-code while monitoring data quality and production performance.

We can facilitate w2 and corp-to-corp consultants. For our w2 consultants, we offer a great benefits package that includes Medical, Dental, and Vision benefits, 401k with company matching, and life insurance.

Rate: $70.00 to $88.00/hr. w2, Skills, experience, and other compensable factors will be considered when determining pay rate. The pay range provided in this posting reflects a W2 hourly rate; other employment options may be available that may result in pay outside of the provided range.

W2 employees of Eliassen Group who are regularly scheduled to work 30 or more hours per week are eligible for the following benefits: medical (choice of 3 plans), dental, vision, pre-tax accounts, other voluntary benefits including life and disability insurance, 401(k) with match, and sick time if required by law in the worked-in state/locality.

If anyone reaches out to you about an open position connected with Eliassen Group, please ensure that you are working directly with us by confirming the following:

· When you work with Eliassen Group, all email communication will come from an Eliassen.com address, never Gmail, Yahoo, etc.

About the company

Eliassen Group is a strategic consulting firm that helps organizations reach further and achieve more through our technology, business advisory, and life sciences solutions. For nearly 40 years, we have combined exceptional people, deep domain expertise, and intelligent capabilities to expand our clients' capacity and accelerate meaningful outcomes. We are driven by a purpose to positively impact the lives of our employees, clients, consultants, and the communities we serve.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all