Data Engineer

Nastech Global, Inc.
Washington, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote
Washington, United States of America

Tech stack

Clean Code Principles
Java
API
Amazon Web Services (AWS)
Audit Trail
Cloud Computing
Computer Programming
Databases
Data Architecture
Information Engineering
ETL
Data Manipulation Languages
Data Security
Data Systems
Data Warehousing
R
Hadoop
Python
NoSQL
Systems Development Life Cycle
Scala
Software Construction
Software Engineering
SQL Databases
Data Storage Technologies
Spark
SC Clearance
Data Lake
Kafka
Data Management
Data Pipelines

Job description

The work involves data layering and data applications and is considered surge work taken over from other contractors the customer was unhappy with. Funding for the program is currently guaranteed only through February at this time.

Requirements

Education: 8 years experience with a BS/BA, 6 years with a MS/MA, 3 years with a PhD or 12 years of experience in lieu of a degree Demonstrates strong expertise in AWS architecture, security in the SDLC, and communication of technical risk and architecture decisions to executive audiences. Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud Experience with programming skills like Java, SQL, Scala, Python, R, defining schema and software engineering best practices including secure, testable, and maintainable code and database (eg SQL, NoSQL, Hadoop, Spark, Kafka, Kinesis), Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud The successful candidate should have of experience in data engineering, including building data pipelines and warehouses Supports technical activities across the contract, and drives continuous improvement and innovation into program operations. Advises on technology alternatives related to processing, data storage, data access, application development, and enterprise architecture decisions as needed. Establish coding best practices and align complex data architectures with overarching business goals. Use APIs to push and pull data from various data systems and platforms Excellent listening, interpersonal, communication and problem solving skills. General data manipulation skills: read in data, process and clean it, transform and recode it, merge different data sets together, reformat data between wide and long, etc. Build and optimize ETL/ELT pipelines using glue and python to move and transform data across fabric. Package, curate, and version datasets for LM fine-tuning and model training jobs on Bedrock and Sagemaker. Enforce CUI handling requirements, automate PII detection and maintain Audit trails, e.g. Amazon Macie.

What you'll need:

Education: 8 years experience with a BS/BA, 6 years with a MS/MA, 3 years with a PhD or 12 years of experience in lieu of a degre Clearance: active Secret clearance Demonstrates strong expertise in AWS architecture, security in the SDLC, and communication of technical risk and architecture decisions to executive audiences. Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud Experience with programming skills like Java, SQL, Scala, Python, R, defining schema and software engineering best practices including secure, testable, and maintainable code and database (eg SQL, NoSQL, Hadoop, Spark, Kafka, Kinesis) Demonstrate a deep understanding of data engineering principles and techniques. Ability to work effectively in teams, in both a lead and support role. Ability to learn new techniques and troubleshoot code without support, ex. find answers to common programming challenges on Google etc. Ability to work effectively in teams, in both a lead and support role as needed Must be local to the Washington DC Metro area

Apply for this position