Software Engineer - TS/SCI with Poly
Role details
Job location
Tech stack
Job description
We are seeking a skilled and mission-driven Software Engineer to support the ML Frameworks team. In this role, you will contribute to the design, development, and deployment of a Retrieval-Augmented Generation (RAG) solution within a high-performance computing (HPC) Linux environment. This position is ideal for someone with a strong Linux background and a passion for AI/ML technologies, especially Large Language Models (LLMs) and secure AI systems. Primary Responsibilities:
- Contribute to the development and deployment of secure RAG pipelines in an HPC Linux environment
- Work with cutting-edge AI/ML technologies, including LLM orchestration frameworks, embedding models, and inference platforms
- Develop and maintain services using Python and Golang
- Build and manage containerized services using Docker, Podman, and containerd
- Deploy services using orchestration tools like Kubernetes and Docker Compose
- Automate and monitor workflows with CI/CD pipelines, using GitLab CI and version control with Git
- Integrate monitoring tools such as Prometheus and Grafana for performance and reliability
- Participate in system administration, including Linux CLI, shell scripting, and general support tasks
Requirements
Clearance Required: TS/SCI with Polygraph Experience Level: 7+ years of experience Education: Bachelor's degree in Computer Science, Engineering, or related field, * Active TS/SCI clearance with Polygraph
- Bachelor's degree in Computer Science, Engineering, or a related technical field
- 7+ years of professional software engineering experience
- Strong experience with Linux system administration, CLI, and shell scripting
- Proficiency in Python and Golang
- Familiarity with RAG pipelines, LLMs, and knowledge retrieval systems
- Experience with containerization technologies and orchestration tools (Docker, PodMan, Kubernetes, Docker Compose)
- Solid understanding of CI/CD principles and related tools (GitLab CI)
- Experience with metrics and monitoring tools (e.g., Prometheus, Grafana)
- Experience using Git for version control
Desired Skills & Technologies:
- Experience with GPU-enabled applications and debugging tools
- Familiarity with LLM orchestration frameworks and OpenAPI
- Experience with distributed processing frameworks such as Spark, Dask, or Ray
- Familiarity with SQL, Elasticsearch, and vector databases
- Experience with HTMX or Hyperscript
- Understanding of multi-node, multi-GPU AI training environments
- Knowledge of AI inferencing platforms such as Nvidia NIM/TRITON, vLLM, or Ray-based deployment
- Experience with the Atlassian suite (Confluence, Jira)
Benefits & conditions
Our client offers a highly competitive and comprehensive benefits package designed to support your personal and professional growth, while promoting a healthy work-life balance. Benefits include:
- 100% Employer-Paid Health, Dental, and Vision Insurance - Full coverage for employees
- Zero Vesting 401(k) Plan with 10% Company Contribution - Immediate access to all contributions
- 31 Days of Paid Time Off - Includes vacation, personal time, and all federal holidays
- Student Loan Repayment Assistance - Helping you pay down your educational debt
- Unlimited Certification & Training Support - Invest in your professional development
- Flexible Work Environment - Remote work options and flexible scheduling available
- Multiple Incentive Bonuses - Performance-based rewards throughout the year
- Exclusive Company Memberships - Access to curated memberships and employee perks
This package reflects our client's commitment to empowering their employees with meaningful benefits and recognizing outstanding performance.