Staff Software Engineer
Role details
Job location
Tech stack
Job description
- Design and optimize infrastructure systems for machine learning workloads at scale and drive reliability and efficiency improvements across Snapchat's ML Infrastructure
- Develop high-performance embedding generation / batch inference systems to improve model performance
- Develop high-performance data storage/compute systems to improve the efficiency of our ML infrastructure
- Integrate state of the art ML data quality system to assure model performance
- Build comprehensive data management systems for scalable data collection, labeling, processing, and evaluation
- Work closely with ML engineers to deploy cutting-edge models into production
- Utilize AI tools and high velocity engineering workflows to design and ship scalable services while upholding rigorous standards for code correctness, security, and production ready quality code
Requirements
- Strong programming skills in Python, Java, Scala, or C++
- Strong problem-solving skills with a focus on system performance, scalability, and efficiency
- Good understanding of distributed systems and the infrastructure components of large-scale ML
- Ability to collaborate and work well with others
- Proven track record of operating highly-available systems at significant scale
- Ability to proactively learn new concepts and apply them at work
- Proficiency in, or a strong aptitude for, leveraging AI tools to streamline development, paired with the critical judgment to audit generated output for architectural integrity, performance bottlenecks, and security risks.
- Adaptability in learning and applying evolving AI systems and tools to remain at the forefront of engineering trends and modern development practices, * Bachelor's degree in a technical field such as computer science or equivalent experience
- 9+ years of post-Bachelor's software development experience; or Master's degree in a technical field + 5+ years of post-grad software development experience; or PhD in a relevant technical field+ 2+ years of post-grad software development experience
- Experience building large scale production machine learning systems, distributed systems or big data processing
Preferred Qualifications:
- Masters/PhD in a technical field such as computer science or equivalent industry experience
- Experience with big data processing frameworks such as Spark, Flink, or Ray
- Experience with large scale feature store or embedding system
- Familiarity with ML frameworks such as Pytorch, Tensorflow
Benefits & conditions
In the United States, work locations are assigned a pay zone which determines the salary range for the position. The successful candidate's starting pay will be determined based on job-related skills, experience, qualifications, work location, and market conditions. The starting pay may be negotiable within the salary range for the position.These pay zones may be modified in the future.
Zone A (CA, WA, NYC): The base salary range for this position is $229,000-$343,000 annually.
Zone B: The base salary range for this position is $218,000-$326,000 annually.
Zone C: The base salary range for this position is $195,000-$292,000 annually.
This position is eligible for equity in the form of RSUs.