Machine Learning Data Engineer
Role details
Job location
Tech stack
Job description
As an ML Data Engineer, you will be responsible for researching, developing and implementing data and machine learning pipelines and systems to support applications that enable operations teams to manage and optimize the manufacturing process as effectively as possible. Applications for these systems include continuous predictions of quality, cost and machine failure, delivering live recommendations for process optimization, and identifying anomalies for real-time alerting etc.
As part of developing these systems, you will work with manufacturing data, including real-time sensor readings and measurements; context about this data collected from external systems; outcomes such as quality/cost/yield; and knowledge about the process from structured and unstructured sources. You will be expected to build systems and models that can learn from these high dimensional and heterogeneous timeseries datasets, and that can be adaptive to changing process conditions and explicit and implicit feedback. You will work with the latest techniques from deep learning, AI for timeseries, reinforcement learning, and recommendation systems to enable the applications.
You will work closely with Oden's Product team to build these systems as foundations for our product, and are expected to participate in the end-to-end engineering, carrying projects from conceptualization to development to deployment. You will also work closely with our customer success teams to develop an understanding of the process and deliver and validate results in production environments., * Developing and maintaining systems that manage the full ML lifecycle - from data preparation and model training to deployment and monitoring.
- Working with our data scientists to create and deploy ML/AI models, algorithms, representations and methods that capture properties of complex manufacturing processes
- Enable applications of these models to different problems for prediction, optimization, and monitoring and alerting.
- Working with the engineering, product and customer success teams to Implement and deploy solutions
- Validating results in real-world deployments with our customers
- Generalizing solutions and innovating to create the next generation of product features
- Engaging with the technical community to present results externally, keep up to date on recent advances, and advance the state of the art
Requirements
- 3-5 years professional experience as an ML Engineer/Data Engineer and advanced degrees (M.S. or PhD. in Machine Learning, Distributed Systems, Computer Science with ML/AI focus, or related fields)
- Proficient with distributed data processing systems e.g. Apache Beam/Dataflow, Apache Spark
- Proficient with ML workflows and tools - including open source tools like MLFlow, cloud platforms like Vertex or Sagemaker
- Proficient with Python, including most common ML and Optimization libraries
- Experience with SQL and GCP is a plus.
- Experience with designing, building and deploying performant ML products
- Familiarity with unstructured data and Generative AI - especially around enterprise applications is a plus.
- Enthusiasm to own projects end-to-end; from experimentation to customer delivery
- This person should be based in the New York metro-area and willing to come to an office at least a several times a month
We anticipate the base salary band for this role will be between $120,000 and $190,000, in addition to equity and benefits. This role may be hired at the L3 Engineer level ($120,000 to $170,000) or the Senior ML Engineer level ($140,000 to $190,000). The salary at offer will be determined by factors such as the candidate's experience, knowledge, skills, and abilities, as well as internal equity among our team.
At this time, we cannot offer visa sponsorship. Candidates must be authorized to work in the United States without current or future employer sponsorship