Data Scientist - Cleared

GRVTY, LLC
Chantilly, United States of America
17 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote
Chantilly, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Data analysis
Information Engineering
ETL
DevOps
Design of User Interfaces
Information Extraction
Python
Linux System Administration
Machine Learning
Natural Language Processing
Open Source Technology
Systems Integration
Workflow Management Systems
Large Language Models
Spark
Reliability of Systems
Kubernetes
Kafka
Data Pipelines
Docker

Job description

Our team is charged with taking commercial and academic innovation high-side, in domains of Artificial Intelligence / Machine Learning (AI / ML), Natural Language Processing (NLP). We also bring the best ideas, tools, and approaches in technology infrastructure (AWS, DevOps, etc.) to the IC. The tech stack used is extremely broad - anything cutting edge in the commercial market, the open source community, or the academic research community is likely involved: and if something isn't being looked at yet, you can make that happen.

This effort supports ALL missions of the Intelligence Community, including cyber-related data science missions. A seamless group of contractor and customer personnel work to create innovations that supply customer groups with the data sets, models, algorithms, software, and infrastructure they need to increase their mission success. Management is hands off, gives the team the freedom to explore new approaches, and markets the best ideas and results to all the other IC customers.

This project regularly needs various types of people - Data Scientists, Data / ETL Engineers, Analytic Software Engineers, Full Stack Developers, UI/UX Developers, and AWS/DevOps experts. We're particularly interested in people with any of the following experience, GRVTY is seeking a Data Engineer to join one of our top projects in Chantilly, VA. This role, requires working with a team of developers, data scientists, SMEs, and cyber analysts to design, develop, build, and analyze data management systems. The data engineer will work with and analyze our client's challenges and provide solutions by designing and implementing batch and streaming data pipelines.

  • Design, develop, and maintain Python-based data processing pipelines and workflow orchestration solutions for large-scale text ingestion, transformation, and enrichment.
  • Develop and implement AI-powered agentic workflows and LLM-integrated applications to automate data triage, classification, analysis, and processing tasks.
  • Build, enhance, and maintain reusable AI capabilities, prompt frameworks, and agent-based services that support enterprise data analysis platforms.
  • Integrate and operationalize Large Language Models (LLMs) to deliver retrieval, reasoning, summarization, information extraction, and decision-support capabilities.
  • Troubleshoot, optimize, and scale text-processing pipelines to ensure data quality, system reliability, and efficient AI-driven workflows.
  • Design and develop APIs and backend services that connect AI models, data pipelines, and mission applications.
  • Collaborate with cross-functional teams including data scientists, software engineers, and product stakeholders to prototype, test, and deploy AI-enabled solutions.
  • Develop Python-based automation tools and services supporting data engineering, workflow orchestration, model integration, and operational efficiencies.
  • Support the deployment and maintenance of production-scale AI and machine learning solutions in mission-focused environments.
  • Evaluate emerging AI technologies and recommend enhancements that improve analytical capabilities and operational outcomes.

Requirements

  • Active TS/SCI with Polygraph Clearance
  • Develop and perform ETL on large unstructured datasets.
  • Experience with Python
  • Experience with services including Apache Kafka, Apache Spark, and Prefect
  • Experience containerizing applications using Docker and deployments on Kubernetes
  • Building and maintaining CI/CD pipelines for data and platform services
  • Familiarity with Linux-based systems
  • Solid understanding of DevOps principles (automation, monitoring, reliability)

Apply for this position