Data Engineer
Role details
Job location
Tech stack
Job description
The work involves data layering and data applications and is considered surge work taken over from other contractors the customer was unhappy with. Funding for the program is currently guaranteed only through February at this time.
Requirements
Education: 8 years experience with a BS/BA, 6 years with a MS/MA, 3 years with a PhD or 12 years of experience in lieu of a degree Demonstrates strong expertise in AWS architecture, security in the SDLC, and communication of technical risk and architecture decisions to executive audiences. Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud Experience with programming skills like Java, SQL, Scala, Python, R, defining schema and software engineering best practices including secure, testable, and maintainable code and database (eg SQL, NoSQL, Hadoop, Spark, Kafka, Kinesis), Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud The successful candidate should have of experience in data engineering, including building data pipelines and warehouses Supports technical activities across the contract, and drives continuous improvement and innovation into program operations. Advises on technology alternatives related to processing, data storage, data access, application development, and enterprise architecture decisions as needed. Establish coding best practices and align complex data architectures with overarching business goals. Use APIs to push and pull data from various data systems and platforms Excellent listening, interpersonal, communication and problem solving skills. General data manipulation skills: read in data, process and clean it, transform and recode it, merge different data sets together, reformat data between wide and long, etc. Build and optimize ETL/ELT pipelines using glue and python to move and transform data across fabric. Package, curate, and version datasets for LM fine-tuning and model training jobs on Bedrock and Sagemaker. Enforce CUI handling requirements, automate PII detection and maintain Audit trails, e.g. Amazon Macie.
What you'll need:
Education: 8 years experience with a BS/BA, 6 years with a MS/MA, 3 years with a PhD or 12 years of experience in lieu of a degre Clearance: active Secret clearance Demonstrates strong expertise in AWS architecture, security in the SDLC, and communication of technical risk and architecture decisions to executive audiences. Experience designing and building Data Lakes, Data Warehouses, and scalable data platforms in the cloud Experience with programming skills like Java, SQL, Scala, Python, R, defining schema and software engineering best practices including secure, testable, and maintainable code and database (eg SQL, NoSQL, Hadoop, Spark, Kafka, Kinesis) Demonstrate a deep understanding of data engineering principles and techniques. Ability to work effectively in teams, in both a lead and support role. Ability to learn new techniques and troubleshoot code without support, ex. find answers to common programming challenges on Google etc. Ability to work effectively in teams, in both a lead and support role as needed Must be local to the Washington DC Metro area