AI/ML Platform Engineer
Role details
Job location
Tech stack
Job description
implement, and test AI/ML based analytic tools using one or more of the following frameworks. Candidate should be able to establish and maintain AI/ML enabling infrastructure using COTS, GOTS, and FOSS tools. Key Responsibilities * Design, develop, and maintain AI/ML sandbox type enviornments
- Develop and optimize applications utilizing:
- Kubernetes & Docker; KubeFlow; MLFlow; Bedrock; FastAPI; Ray Train; Tune; Safe Tensors; ONNX
- Support application packaging, versioning, and release management processes
- Collaborate with platform, infrastructure, and security teams to support system requirements
- Implement best practices for scalability, reliability, and performance
- Troubleshoot application, database, and deployment issues in distributed environments
- Support continuous integration and deployment (CI/CD) pipelines
- Participate in system accreditation, compliance, and audit activities
- Document system designs, configurations, and operational procedures
Requirements
-
Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
-
Entry level to 10+ years of experience in software or AI/ML development roles
-
Experience with the above mentioned tools and applications
-
Ability to operate in secure, compliance-driven environments Preferred Qualifications * Experience developing distributed or data-intensive applications
-
Experience supporting DoD or Intelligence Community programs
-
Experience with CI/CD tools and automation frameworks
-
Familiarity with Infrastructure as Code tools
-
Experience with performance tuning and database optimization
-
Experience working in hybrid or air-gapped environments Altamira is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, national origin, disability, or protected veteran status.