Software Infrastructure Engineer
Role details
Job location
Tech stack
Job description
- Build and maintain the AI Factory dashboard and the yield/triage data pipeline (ingest, scheduled processing, uptime).
- Support the image-inspection pipeline: factory image ingestion and automated end-to-end processing.
- Operate on production-grade infra (Tupperware, S3) -- deployments, monitoring, on-call reliability.
- Onboard new factories onto the platform and scale capacity as we expand.
- Partner with factory-side engineers on camera/AOI and station-data integration.
Requirements
- Strong software / firmware engineering with data-pipeline experience
- Cloud storage and service deployment (e.g., S3, containerized services)
- Track record owning reliability / infrastructure for a production tool
Nice-to-Have Skills:
- Image / vision pipeline experience
- Familiarity with factory / manufacturing data
- Python and ML tooling
Additional Information:
Story behind the Need: We build AI tooling that root-causes factory failures, auto-generates EE validation SOPs, and AI-inspects assembly images; it runs on production-grade infrastructure, is deployed live at Ceres2, and is scaling to Coop and across the AI-devices portfolio. This role builds and maintains the software/infrastructure backbone of that tooling.
Purpose/Size of the Team: A small, dedicated AI Factory pod within V+V / Wrist Hardware. This position owns the infrastructure and reliability backbone the tooling runs on.
How will performance be measured? Tool/dashboard uptime and pipeline reliability; on-time delivery of infra and pipeline features; smooth factory-side data integration.
Required YOE: 8+ Years
Degree/Certifications Required: BS in Computer Science, EE, or equivalent
Candidate Disqualifiers: Front-end-only profiles, or candidates with no production infrastructure / reliability background. No experience with production deployments / on-call reliability; unable to support occasional cross-time-zone factory bring-up.
Difficult Aspects of Job: Fast-moving 0 1 environment; production reliability expectations (uptime/on-call); coordination across US China time zones during factory bring-up.