Data & Software Engineer

Avalore, LLC

Chantilly, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Chantilly, United States of America

Tech stack

Java

Airflow

Amazon Web Services (AWS)

Apache HTTP Server

Bash

Big Data

Computer Programming

System Configuration

Information Engineering

ETL

Data Security

Software Debugging

Software Design Patterns

Amazon DynamoDB

Python

PostgreSQL

Metadata Repositories

MySQL

NoSQL

NumPy

Operational Databases

Performance Tuning

PostGIS

Query Optimization

Azure

Software Deployment

SQL Databases

Data Streaming

Systems Integration

Data Processing

Cloud Platform System

Spark

GIT

Cloudformation

Pandas

Containerization

PySpark

Data Lineage

Terraform

Software Version Control

Data Pipelines

Docker

Job description

Work with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight
Leverage strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks
Leverage a background in large-scale data migration or platform modernization efforts
Contribute to data engineering documentation, best practices, and design patterns.

Requirements

Do you have experience in Version control?, The Data & Software Engineer works with a small team to build complex data flows for a custom application. Successful candidate will have advanced Python programming skills, familiarity with Java, an understanding of data security, privacy, governance and compliance principles and a demonstrated history of building production data pipelines and ETL workflows at scale. Candidate must have experience:

Building end-to-end data pipelines leveraging Python
Using orchestration tools to deploy data pipelines, including configuring and updating Spark Jobs
Containerizing and deploying applications in cloud environments like AWS.
Working with MySQL and PostgreSQL including performance tuning, schema design, and query optimization for complex, analytical workloads.
Leveraging industry standard tools for code control (Git, IaaC control, etc.)
Working with data catalogs, tracking data lineage and handling a variety of data formats, including Geospatial.
Using Bash scripting for automation and data processing tasks
Integrating Al/ML services and models, Minimum of 5 years' experience with:
Apache Spark & PySpark
Advanced Python skills (including Pandas & NumPy)
Docker, Podman
AWS S3, Lambda & Step functions
Apache Iceberg, Airflow, etc.
SQL (with Trino)
NoSQL, DynamoDB
Unity Catalog OSS, Apache Polaris
Apache Superset
Terraform or CloudFormation
OpenLineage
H3, PostGIS

Benefits & conditions

Pulled from the full job description

AD&D insurance
401(k)
Health insurance
401(k) matching
Paid time off
Vision insurance
Dental insurance, * Employer-Paid Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k, IRA) with a generous matching program
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation, Sick & Public Holidays)
Short Term & Long Term Disability
Training & Development
Employee Assistance Program

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all