Data Engineer

Amazon.com, Inc.
Stafford, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 228K

Job location

Stafford, United States of America

Tech stack

Java
Geographic Information Systems
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Apache HTTP Server
Bash
Big Data
Software Quality
Information Systems
Computer Programming
System Configuration
Information Engineering
Data Governance
Data Infrastructure
Data Integration
ETL
Data Security
Data Visualization
Software Debugging
Software Design Patterns
Digital Forensics
Distributed Computing Environment
Amazon DynamoDB
PostgreSQL
Metadata Repositories
MySQL
NoSQL
NumPy
Operational Databases
Performance Tuning
PostGIS
Query Optimization
DataOps
Azure
SQL Databases
Strategies of Testing
Speech Recognition
Enterprise Search
Data Processing
Cloud Platform System
Retrieval-Augmented Generation
Large Language Models
Spark
Topic Modeling
Data Strategy
GIT
Cloudformation
Pandas
Containerization
PySpark
Data Lineage
Data Analytics
Data Lakehouse
Terraform
Software Version Control
Data Pipelines
Docker

Job description

J5 Consulting is a Maryland based company established in 2006 to provide computing and consulting services for government and commercial entities. Our services improve Information System networking performance and compliance and protect electronic assets from loss and compromise. We welcome your application to receive consideration for the following position. Introduction The Sponsor's office is architecting and creating a coherent secure data ecosystem that is compatible and synchronized with the overall Sponsor's data strategy and broader mission capabilities. These solutions and capabilities include a platform to conduct enterprise search, digital forensics, and data analytics. The Sponsor fuses modern, data-centric tradecraft and capabilities with other components in pursuit of extracting maximum value from existing data.

Requirements

  • Demonstrated experience with Agile/Scrum development methodologies in a fast-paced, collaborative team environment.
  • Demonstrated experience working effectively in high-performing, cross-functional teams with multiple concurrent projects.
  • Demonstrated experience working directly with stakeholders to gather requirements, understand needs, and translate them into technical solutions with minimal oversight.
  • Demonstrated experience in self-directed work with a strong ownership mentality and commitment to code quality, testing, and documentation.
  • Demonstrated experience context-switching between projects and systems as priorities demand
  • Data Engineering
  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices.
  • Demonstrated experience understanding data security, privacy, governance, and compliance principles.
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow
  • Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments.
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions)
  • Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design.
  • Demonstrated experience with SQL and query optimization for complex analytical workloads
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines
  • Demonstrated experience working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight.
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks.

Highly Desired Skills and Demonstrated Experience:

  • Data Engineering
  • Demonstrated experience with data lakehouse architectures using Apache Iceberg.
  • Demonstrated experience configuring, deploying, and integrating data platform components: Apache Ranger (access control and data governance); Trino (distributed SQL query engine); Data catalogs (Unity Catalog OSS, Apache Polaris, etc.); Apache Superset (data visualization and dashboarding).
  • Demonstrated experience with Bash scripting for automation and data processing tasks
  • Demonstrated experience with Infrastructure as Code (Terraform or CloudFormation) for data infrastructure.
  • Demonstrated experience with tracking data lineage and associated tooling such as OpenLineage.
  • Demonstrated experience with Java.
  • Demonstrated experience with data quality frameworks, testing methodologies, and validation strategies.
  • Demonstrated experience or background with large-scale data migrations or platform modernization efforts.
  • Demonstrated experience integrating AI/ML services and models (translation, OCR, speech-to-text, NLP, language detection, topic modeling), LLMs, and RAG (retrieval-augmented generation) pipelines.
  • Demonstrated experience with geospatial data processing (H3, PostGIS, or similar).
  • Demonstrated experience contributing to data engineering documentation, best practices, or design patterns.
  • Demonstrated experience with NoSQL databases (DynamoDB, etc.).
  • Demonstrated experience with excellent written and verbal communication skills with both technical and non-technical audiences.

Benefits & conditions

  • This position requires US Citizenship. Verification of US Citizenship to meet federal government security requirements will be confirmed.

Security Clearance:

  • The successful candidate must have an active U.S. Government Top Secret Security Clearance with a Full Scope Polygraph.
  • Clearance Verification: This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment.

Travel:

  • This position is expected to be onsite. The position will be located within the Washington Metropolitan Area (WMA). Local travel/POV will be on an as needed basis, within the local place of performance.

Join J5 Consulting and Grow Your Cybersecurity Career At J5, we're a team of innovators protecting organizations from evolving cyber threats. With 18+ years of success in government and commercial sectors, we offer meaningful opportunities to grow your career. Enjoy comprehensive benefits, including:

  • 100% employer-paid health coverage
  • a 6% 401(k) match
  • PTO
  • tuition reimbursement
  • bonuses
  • professional development, and more.

Apply for this position