Sr. System Engineer
Role details
Job location
Tech stack
Job description
We are seeking an experienced Sr. Systems Engineer to join our Kubernetes AI team. Job functions include Linux & Kubernetes certifications, AI software stack validation & reference architectures, systems benchmarking (MLPerf) & tuning. The ideal candidate will have hands on experience in a server lab environment. This role provides a degree of autonomy, where the candidate should be able to operate and produce results without guidance; but we will also make sure your properly technically trained or tutored. In the fast evolving world of AI, the candidate must be willing, and a fast learner. The candidate will be a key contributor to our Kubernetes AI team. We develop products, take them to market, support sales to architect bespoke HW + SW stacks for customers, and deploy as a turn key solution., * Architect & develop reference architectures for full AI Factory stacks, from NVAIE, to Kubernetes, to Storage.
- Support sales on customer engagements to provide technical subject matter expertise on AI solutions.
- Architect & BOM SW+HW AI stacks for customers.
- Run Linux and Kubernetes certifications on Supermicro servers
- Run benchmarks (MLPerf) on GPU servers and perform systems tuning & optimization support.
Requirements
Do you have experience in Technical troubleshooting support?, * Bachelor's degree in Electrical Engineering, Electronics, Computer Engineering, Mechanical Engineering, or related field
- 8+ years of experience in an engineering, product development, validation, or test engineering environment.
- Strong understanding of hardware/software integration and performance tuning.
- Experienced with Linux and Kubernetes environments.
- Experience with scripting or automation tools such as Python, Bash, or PowerShell.
- Strong troubleshooting and analytical skills.
- Ability to read schematics, engineering drawings, and technical documentation.
- Good written and verbal communication skills.
Preferred Qualifications
- Experience with physical server hardware installation and deployment
- Experience with automated test systems and data acquisition tools.
- Familiarity with embedded systems or firmware validation.
- Knowledge of networking concepts and interfaces.
- Experience with Jira, Confluence, or similar engineering workflow tools.
- Understanding of manufacturing test processes or product qualification.