Senior Platform Engineer - ML Systems (Infrastructure Focus)

Computer Enterprises, Inc.
Kissimmee, United States of America
12 days ago

Role details

Contract type
Temporary to permanent
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 166K

Job location

Remote
Kissimmee, United States of America

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Application Layers
Computer Vision
Computer Programming
Continuous Integration
Distributed Systems
Amazon DynamoDB
Monitoring of Systems
Python
Machine Learning
Object Detection
Data Streaming
Cloud Platform System
Real Time Systems
Delivery Pipeline
Backend
Kubernetes
Rancher
Machine Learning Operations

Job description

We are seeking a Senior Platform Engineer to support the development and operation of ML-driven systems in production. This role is infrastructure-first, with a strong emphasis on building, deploying, and scaling systems that support machine learning workloads, particularly in computer vision and object detection environments., * Design, deploy, and operate containerized applications in Kubernetes environments (Rancher preferred)

  • Build and maintain cloud-native infrastructure in AWS (S3, DynamoDB, SageMaker)
  • Develop backend services, automation, and tooling using Python
  • Support deployment and scaling of ML models in production environments
  • Collaborate with ML and backend engineers to ensure reliable data and inference pipelines
  • Troubleshoot and resolve production issues across infrastructure and application layers
  • Contribute to system design decisions around scalability, reliability, and performance, This role offers the opportunity to work on production-grade ML infrastructure supporting advanced computer vision systems. You'll have meaningful ownership across the full lifecycle of scalable, real-world platforms.

Requirements

  • Strong hands-on experience with Kubernetes in production environments
  • Experience with Rancher or similar cluster management tools
  • Solid experience with AWS, including S3, DynamoDB, and SageMaker (or similar ML infrastructure tools)
  • Strong programming experience in Python
  • Experience building and operating distributed systems
  • Comfort troubleshooting systems across infrastructure and application layers
  • Experience working with or supporting data or ML pipelines
  • Understanding of how data flows through systems (ingestion ? processing ? output)
  • Familiarity with event-driven or real-time systems
  • Ability to collaborate effectively with backend and ML teams

Preferred Skills

  • Experience with model deployment and inference systems
  • Exposure to object detection and computer vision workflows
  • Familiarity with vLLM or similar inference frameworks
  • Experience with CI/CD and infrastructure-as-code
  • Familiarity with monitoring and observability tools
  • Infrastructure-first mindset without being infrastructure-only
  • Strong problem-solving skills in production environments
  • Ability to bridge infrastructure and application layers
  • Comfort working in fast-moving, ambiguous environments
  • Ownership mindset from build ? deploy ? operate

About the company

As a trusted technology partner, CEI delivers solutions that help our customers transform their business and achieve meaningful results. From strategy and custom application development through application management, our technology and digital experience services are tailored to meet each unique need of our customers. Our staffing solutions bring specialized skills to complement our customers' workforce and project requirements.

Apply for this position