Data Engineer

Oasys Inc.
Fairfax, United States of America
16 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote
Fairfax, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Cloud Computing
Cloud Storage
Information Systems
Databases
System Configuration
Continuous Integration
Data Auditing
Information Engineering
Data Governance
Data Integration
Data Integrity
ETL
Data Transformation
Data Stores
Data Warehousing
Relational Databases
DevOps
Github
Python
PostgreSQL
NoSQL
Oracle Applications
Performance Tuning
Query Optimization
Ansible
Shell Script
SQL Databases
Data Streaming
Data Processing
Scripting (Bash/Python/Go/Ruby)
Delivery Pipeline
Electronic Medical Records
GIT
Cloudformation
Amazon Web Services (AWS)
PySpark
Information Technology
Amazon Web Services (AWS)
Data Analytics
Functional Programming
Redshift

Job description

The Data Engineer will be pivotal in designing, developing, and maintaining robust ETL (Extract, Transform, Load) processes to ensure seamless data flow between our diverse data sources and target data stores. You will be responsible for building and optimizing automated pipelines, ensuring data quality, and accommodating future data format changes. This position requires a strong technical foundation and a proactive approach to problem-solving., * Pipeline Design & Development: Design, develop, and implement scalable and efficient ETL pipelines using modern data integration tools and technologies.

  • Data Transformation: Transform and cleanse data from various sources (databases, APIs, cloud storage, etc.) to ensure accuracy, consistency, and compliance with data governance policies.
  • Data Store Management: Develop and maintain optimized data models and data warehousing solutions utilizing platforms like Oracle, PostgreSQL, Redshift, and EMR. Focus on performance tuning and query optimization.
  • Automation & Monitoring: Build and maintain automated ETL jobs, incorporating robust monitoring and alerting mechanisms for proactive issue detection and resolution.
  • Data Quality Assurance: Implement data quality checks and validation rules throughout the ETL process to guarantee data integrity.
  • Documentation: Create and maintain comprehensive documentation for ETL processes, data models, and system configurations.
  • Communication & Presentation: Effectively communicate complex technical concepts to both technical and non-technical stakeholders. Develop and deliver clear, concise presentations, reports, and data insights to support decision-making and drive business outcomes.
  • Collaboration: Work closely with business stakeholders and other teams to understand data requirements and deliver effective solutions.
  • Future-Proofing: Proactively assess and implement changes to data integration processes to accommodate evolving data formats, sources, and business needs. Ensuring designs accommodate potential future data changes.
  • All other duties as assigned by leadership.

Requirements

  • ETL development with Glue ETL, Python, Pyspark, RDS.
  • Solid understanding of data governance principles and data quality best practices.
  • Ability to work independently and as part of a collaborative team in an Agile environment.
  • Excellent problem-solving, analytical, and communication skills.

Required Education and Experience:

  • Bachelor's degree in Computer Science, Information Systems, or a related field.
  • 10+ years of experience in data integration, ETL development, and data warehousing.
  • Strong proficiency in SQL and experience with relational databases (e.g. Oracle, PostgreSQL) and NoSQL databases.
  • Experienced with scripting languages such as Python or Shell scripting for automation and data manipulation.
  • Experienced with cloud technologies, including AWS Glue, Lambda, CloudFormation/Ansible, S3, Redshift, and EMR.
  • Experienced with Git, GitHub, CI/CD pipelines for DevOps and data engineering.

Required Certifications:

  • AWS certification (Minimum - Cloud Practitioner, AI Practitioner)

Clearance Requirement:

  • Must be a U.S. Citizen (No dual citizenship will be accepted)
  • Ability to obtain a favorable Public Trust investigation

About the company

Who We Are: Oasys International LLC (Oasys) is a fast-growing federal government contractor delivering high-quality technology consulting and professional services to civilian, defense, and homeland security agencies. We have been recognized on Inc. 5000's list of the fastest-growing companies in America for five consecutive years and named a Best Places to Work in Virginia for the past two years. Our success is driven by a talented team of technologists, consultants, engineers, and subject-matter experts who support complex federal missions with integrity and excellence. At Oasys, we foster a collaborative, merit-based culture that values continuous learning, professional growth, and work-life balance. We are committed to creating an inclusive, engaging environment where employees are recognized for their contributions and empowered to build meaningful, long-term careers.

Apply for this position