Data Engineer II, AAE

Amazon.com, Inc.
Bellevue, United States of America
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 179K

Job location

Bellevue, United States of America

Tech stack

Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Big Data
Software Quality
Databases
Information Engineering
Data Infrastructure
ETL
Data Stores
Graph Database
Identity and Access Management
Python
BIG-IP Global Traffic Manager (GTM)
Spark
Electronic Medical Records
Build Management
AI Platforms
Amazon Web Services (AWS)
Real Time Data
Non-relational Database
Data Management
Virtual Agents
Amazon Web Services (AWS)
Data Pipelines
Redshift

Job description

Design and build end-to-end data platforms for new AWS AI services - defining schemas, data models, ETL pipelines, and analytics infrastructure where none exists today Build and maintain production ETL/ELT pipelines using AWS Glue, Airflow, Spark, and Python to source data from operational, commercial, and telemetry systems into unified data models Develop agentic data workflows - automated reporting pipelines that leverage AI/ML to generate business insights, WBR summaries, and anomaly detection without manual intervention Create event-driven data architectures using CDK, Lambda, SNS/SQS, and S3 event notifications to support real-time data ingestion and processing Build executive dashboards and self-serve analytics using QuickSight that serve VP/GM-level leadership across multiple service lines Own revenue data accuracy - implement and validate revenue attribution models, discount calculations, and financial data pipelines that feed CFO-mandated reporting Design data models that support both operational analytics (feature adoption, customer health, churn signals) and financial reporting (revenue, billing, forecasting) Collaborate with Product Managers, Finance, Service Engineering, GTM, and Data Science teams to translate business questions into scalable data solutions Optimize pipeline performance - reduce runtimes, eliminate redundant processing, and improve SLA compliance across production workloads Mentor engineers, contribute to team standards, and drive a culture of automation, code quality, and operational excellence

A day in the life As a Data Engineer on this team, you will design data models for newly launched AWS AI services, build and deploy ETL pipelines to onboard telemetry and revenue data, and validate data accuracy across financial reporting systems. On any given day, you may be architecting a CDK-based event-driven pipeline, collaborating with Product Managers to define launch metrics, resolving data discrepancies surfaced by Finance, or optimizing production queries that feed into VP-level weekly business reviews. Your deliverables ship to production on a regular cadence and are consumed directly by senior leadership for strategic decision-making.

About the team The AI Services Data Engineering team builds the data infrastructure behind AWS's Agentic AI portfolio - Amazon Bedrock, AgentCore, QuickSight, Q Business, Kendra, Kiro, and Transform. Our data powers the metrics and reporting that flow up to Amazon's CEO and CFO, supporting S-Team level visibility into Agentic AI revenue, adoption, and growth. We build automated WBR reporting with agent-generated summaries, revenue attribution models for multi-billion dollar pricing programs, and launch telemetry platforms for new GA services. We ship weekly, operate across multiple VP orgs, and value automation over manual work, clean data models over quick fixes, and engineers who own their domain end-to-end.

Requirements

5+ years of data engineering experience

  • 3+ years of developing and operating large-scale data structures for business intelligence analytics using ETL/ELT processes experience
  • 3+ years of developing and operating large-scale data structures for business intelligence analytics using data modeling experience
  • Experience with data modeling, warehousing and building ETL pipelines, Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Experience providing technical leadership and mentoring other engineers for best practices on data engineering

Benefits & conditions

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, WA, Bellevue - 132,100.00 - 178,800.00 USD annually

About the company

AWS AI Services is one of the largest and fastest-growing business units within AWS, powering services like Amazon Bedrock, AgentCore, QuickSight, Q Business, Kendra, and Kiro. Our Data Engineering team builds the intelligence infrastructure that makes this portfolio measurable - from revenue attribution and launch telemetry to agent-generated business reviews that serve VP-level leadership weekly. We are looking for an experienced, self-driven Data Engineer to join a team that operates at the intersection of data engineering and agentic AI. In this role, you won't just build pipelines - you'll design data platforms that power AI agents, build automated reporting systems that replace manual processes, and create the data foundations that prove business impact across a multi-billion dollar service portfolio. You'll work with modern AWS-native data stacks (Glue, Redshift, Athena, QuickSight, Bedrock, SageMaker), build event-driven architectures with CDK, and contribute to agentic workflows that generate executive-level insights autonomously. You should be comfortable operating in ambiguity, designing data models from scratch for new services, and making architectural trade-off decisions that scale.

Apply for this position