Data Scientist - Cleared

GRVTY, LLC

Chantilly, United States of America

17 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Remote

Chantilly, United States of America

Tech stack

API

Artificial Intelligence

Amazon Web Services (AWS)

Data analysis

Information Engineering

ETL

DevOps

Design of User Interfaces

Information Extraction

Python

Linux System Administration

Machine Learning

Natural Language Processing

Open Source Technology

Systems Integration

Workflow Management Systems

Large Language Models

Spark

Reliability of Systems

Kubernetes

Kafka

Data Pipelines

Docker

Job description

Our team is charged with taking commercial and academic innovation high-side, in domains of Artificial Intelligence / Machine Learning (AI / ML), Natural Language Processing (NLP). We also bring the best ideas, tools, and approaches in technology infrastructure (AWS, DevOps, etc.) to the IC. The tech stack used is extremely broad - anything cutting edge in the commercial market, the open source community, or the academic research community is likely involved: and if something isn't being looked at yet, you can make that happen.

This effort supports ALL missions of the Intelligence Community, including cyber-related data science missions. A seamless group of contractor and customer personnel work to create innovations that supply customer groups with the data sets, models, algorithms, software, and infrastructure they need to increase their mission success. Management is hands off, gives the team the freedom to explore new approaches, and markets the best ideas and results to all the other IC customers.

This project regularly needs various types of people - Data Scientists, Data / ETL Engineers, Analytic Software Engineers, Full Stack Developers, UI/UX Developers, and AWS/DevOps experts. We're particularly interested in people with any of the following experience, GRVTY is seeking a Data Engineer to join one of our top projects in Chantilly, VA. This role, requires working with a team of developers, data scientists, SMEs, and cyber analysts to design, develop, build, and analyze data management systems. The data engineer will work with and analyze our client's challenges and provide solutions by designing and implementing batch and streaming data pipelines.

Design, develop, and maintain Python-based data processing pipelines and workflow orchestration solutions for large-scale text ingestion, transformation, and enrichment.
Develop and implement AI-powered agentic workflows and LLM-integrated applications to automate data triage, classification, analysis, and processing tasks.
Build, enhance, and maintain reusable AI capabilities, prompt frameworks, and agent-based services that support enterprise data analysis platforms.
Integrate and operationalize Large Language Models (LLMs) to deliver retrieval, reasoning, summarization, information extraction, and decision-support capabilities.
Troubleshoot, optimize, and scale text-processing pipelines to ensure data quality, system reliability, and efficient AI-driven workflows.
Design and develop APIs and backend services that connect AI models, data pipelines, and mission applications.
Collaborate with cross-functional teams including data scientists, software engineers, and product stakeholders to prototype, test, and deploy AI-enabled solutions.
Develop Python-based automation tools and services supporting data engineering, workflow orchestration, model integration, and operational efficiencies.
Support the deployment and maintenance of production-scale AI and machine learning solutions in mission-focused environments.
Evaluate emerging AI technologies and recommend enhancements that improve analytical capabilities and operational outcomes.

Requirements

Active TS/SCI with Polygraph Clearance
Develop and perform ETL on large unstructured datasets.
Experience with Python
Experience with services including Apache Kafka, Apache Spark, and Prefect
Experience containerizing applications using Docker and deployments on Kubernetes
Building and maintaining CI/CD pipelines for data and platform services
Familiarity with Linux-based systems
Solid understanding of DevOps principles (automation, monitoring, reliability)

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all