Senior Principal Platform Engineer - AI Automation

Clarity
Jessup, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Jessup, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Automated Storage and Retrieval Systems
Azure
Bash
Cloud Computing
Encodings
Computer Security
Continuous Delivery
Continuous Integration
Information Engineering
Data Infrastructure
DNS
Identity and Access Management
OSI Models
Python
PostgreSQL
MySQL
Open Source Technology
OpenID
Package Management Systems
TensorFlow
Prometheus
Security Assertion Markup Language (SAML)
Search Technologies
Software Engineering
Management of Software Versions
Software Vulnerability Management
AI Infrastructure
Data Processing
Scripting (Bash/Python/Go/Ruby)
Load Balancing
Computer Network Technologies
Okta
PyTorch
Istio
Delivery Pipeline
Large Language Models
Grafana
Amazon Web Services (AWS)
Backend
Gitlab
Containerization
AI Platforms
Gitlab-ci
Templating
Kubernetes
HuggingFace
Build Tools
Machine Learning Operations
Api Design
REST
Terraform
Data Pipelines
Automation Anywhere
Devsecops
Docker
Ci Server

Job description

  • Data Engineering: Experience maintaining data pipelines or high-throughput data infrastructure.
  • Cybersecurity: Background in threat modeling, vulnerability management, or SOC operations.
  • API Design: Experience designing or maintaining RESTful or RPC APIs.

Requirements

Do you have experience in Tooling?, * Professional Experience: 7+ years of combined experience in DevSecOps, Platform Engineering, or SRE.

  • Kubernetes Mastery: Deep expertise in Administration and Development of Kubernetes clusters.
  • Containerization: Advanced knowledge of Docker or equivalent container build tools.
  • Cloud & IaC: Experience with Azure or AWS architecture and Infrastructure as Code (Terraform/Crossplane).
  • Software Development: Proficiency in at least one backend or scripting language-ideally Python, Go, or Bash-to drive systems automation.
  • Core Networking: Solid understanding of OSI Layer 4-7, including VPC/VNET configuration, DNS, Load Balancing, and SSL/TLS management.
  • AI Ecosystem: Practical experience with modern AI/ML frameworks and tooling such as PyTorch, Hugging Face, LangChain, vLLM, Ray, MLflow, or equivalent open-source ecosystems.
  • LLM & AI Infrastructure: Hands-on experience deploying, scaling, and securing AI/ML workloads on Kubernetes, including GPU-enabled clusters, model-serving platforms, and distributed inference/training systems.
  • AI Platform Operations: Experience building internal AI platforms or developer enablement tooling that supports model lifecycle management, experimentation, inference endpoints, and reproducible AI workflows.
  • MLOps & AI Delivery Pipelines: Familiarity with MLOps concepts and tooling, including automated model deployment, versioning, evaluation, observability, rollback strategies, and CI/CD integration for AI systems.Vector & Retrieval Systems: Working knowledge of vector databases, embedding pipelines, retrieval-augmented generation (RAG), and semantic search architectures.
  • AI Security & Governance: Understanding of AI security concerns including model isolation, data handling controls, prompt injection risks, supply-chain security, and governance requirements for sensitive or regulated environments.

Primary Technical Focus Heavy emphasis is placed on the GitLab ecosystem and GitOps workflows:

  • GitLab Expert: Mastery of GitLab CI/CD (including Runners, Templates, and Security Scanners).
  • GitOps & Continuous Delivery: Hands-on experience with ArgoCD or equivalent declarative CD tools.
  • Package Management: Proficiency with Helm for templating and deploying Kubernetes applications.

Preferred Skills & Certifications

  • Security Compliance: Possession of DoD 8570 certifications (e.g., Security+ or CASP+/SecurityX).
  • Advanced Networking: Experience with Service Mesh technologies (e.g., Istio, Cilium) and CNI plugins.
  • Identity & Access: Configuration and management of Keycloak or OIDC/SAML providers.
  • Observability: Experience with the Grafana / Prometheus stack or similar.
  • Database Administration: Functional knowledge of PostgreSQL and MySQL.

About the company

Clarity Innovations is a trusted national security partner, dedicated to safeguarding our nation's interests and delivering innovative solutions that empower the Intelligence Community (IC) and Department of Defense (DoD) to transform data into actionable intelligence, ensuring mission success in an evolving world. Our mission-first software and data engineering platform modernizes data operations, utilizing advanced workflows, CI/CD, and secure DevSecOps practices. We focus on challenges in Information Warfare, Cyber Operations, Operational Security, and Data Structuring, enabling end-to-end solutions that drive operational impact., We are committed to delivering cutting-edge tools and capabilities that address the most complex national security challenges, empowering our partners to stay ahead of emerging threats and ensuring the success of their critical missions. At Clarity, we are people-focused and set on being a destination employer for top talent, offering an environment where innovation thrives, careers grow, and individuals are valued. Join us as we continue to lead innovation and tackle the most pressing challenges in national security.

Apply for this position