Data & Software Engineer

Avalore, LLC
Chantilly, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Chantilly, United States of America

Tech stack

Java
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Apache HTTP Server
Bash
Big Data
Computer Programming
System Configuration
Information Engineering
ETL
Data Security
Software Debugging
Software Design Patterns
Amazon DynamoDB
Python
PostgreSQL
Metadata Repositories
MySQL
NoSQL
NumPy
Operational Databases
Performance Tuning
PostGIS
Query Optimization
Azure
Software Deployment
SQL Databases
Data Streaming
Systems Integration
Data Processing
Cloud Platform System
Spark
GIT
Cloudformation
Pandas
Containerization
PySpark
Data Lineage
Terraform
Software Version Control
Data Pipelines
Docker

Job description

  • Work with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight
  • Leverage strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks
  • Leverage a background in large-scale data migration or platform modernization efforts
  • Contribute to data engineering documentation, best practices, and design patterns.

Requirements

Do you have experience in Version control?, The Data & Software Engineer works with a small team to build complex data flows for a custom application. Successful candidate will have advanced Python programming skills, familiarity with Java, an understanding of data security, privacy, governance and compliance principles and a demonstrated history of building production data pipelines and ETL workflows at scale. Candidate must have experience:

  • Building end-to-end data pipelines leveraging Python
  • Using orchestration tools to deploy data pipelines, including configuring and updating Spark Jobs
  • Containerizing and deploying applications in cloud environments like AWS.
  • Working with MySQL and PostgreSQL including performance tuning, schema design, and query optimization for complex, analytical workloads.
  • Leveraging industry standard tools for code control (Git, IaaC control, etc.)
  • Working with data catalogs, tracking data lineage and handling a variety of data formats, including Geospatial.
  • Using Bash scripting for automation and data processing tasks
  • Integrating Al/ML services and models, Minimum of 5 years' experience with:
  • Apache Spark & PySpark
  • Advanced Python skills (including Pandas & NumPy)
  • Docker, Podman
  • AWS S3, Lambda & Step functions
  • Apache Iceberg, Airflow, etc.
  • SQL (with Trino)
  • NoSQL, DynamoDB
  • Unity Catalog OSS, Apache Polaris
  • Apache Superset
  • Terraform or CloudFormation
  • OpenLineage
  • H3, PostGIS

Benefits & conditions

Pulled from the full job description

  • AD&D insurance
  • 401(k)
  • Health insurance
  • 401(k) matching
  • Paid time off
  • Vision insurance
  • Dental insurance, * Employer-Paid Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA) with a generous matching program
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Short Term & Long Term Disability
  • Training & Development
  • Employee Assistance Program

Apply for this position