Senior ML Data Engineer

Intuition Machines, Inc.

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Azure

Big Data

Continuous Integration

Information Engineering

Data Stores

Distributed Data Store

Python

Machine Learning

NoSQL

TensorFlow

Software Engineering

SQL Databases

Data Processing

Data Storage Technologies

Feature Engineering

PyTorch

Backend

Kubernetes

Kafka

Front End Software Development

Data Pipelines

Job description

As a Senior ML Data Engineer, you will help shape and expand the pipelines that power our products and research efforts. You'll work across teams to design, maintain, and improve high-performance data pipelines, ensuring that data is accessible, reliable, and scalable to meet the needs of our users and internal stakeholders., * Maintain, extend, and improve existing data/ML workflows, and implement new ones to handle high-velocity data.

Provide interfaces and systems that enable ML engineers and researchers to build datasets on demand.
Influence data storage and processing strategies.
Collaborate with the ML team, as well as frontend and backend teams, to build out our data platform.
Reduce time-to-deployment for dashboards and ML models.
Establish best practices and develop pipelines and software that enable ML engineers and researchers to efficiently build and use datasets.
Work with large datasets under performance constraints comparable to those at the largest companies.
Iterate quickly, with a focus on shipping early and often, ensuring that new products or features can be deployed to millions of users.

Requirements

Minimum of 3 years of experience in a data role involving designing and building data stores, feature engineering, and building reliable data pipelines that handle high loads.
At least 2 years of professional software development experience in a role other than data engineering.
Proficiency in Python and experience working with Kafka infrastructure and distributed data systems.
Deep understanding of SQL and NoSQL databases (preferably Clickhouse).
Familiarity with public cloud providers (AWS or Azure).
Experience with CI/CD and orchestration platforms: Kubernetes, containerization, and microservice design.
Proven ability to make independent decisions regarding data processing strategy and architecture.
Thoughtful, self-directed individual who is able to operate effectively in a fast-paced environment., * Experience collaborating across ML, backend, and frontend teams.
Understanding of machine learning fundamentals, including model training, inference, and frameworks such as PyTorch or TensorFlow.

Benefits & conditions

Fully remote position with flexible working hours.
An inspiring team of colleagues spread all over the world.
Pleasant, modern development and deployment workflows: ship early, ship often.
High impact: lots of users, happy customers, high growth, and cutting-edge R&D.
Flat organization, direct interaction with customer teams.

We celebrate equality of opportunity and are committed to creating an inclusive environment for all team members.

Join us as we transform cybersecurity, user privacy, and machine learning online!

Please note that all positions require pre-employment screening, including third-party verification of work history, education, and identity, as well as a final in-person interview and identity verification step, which will be conducted in your country of residence. If you require alternative methods of application or screening, you must approach the employer directly to request this as Indeed is not responsible for the employer's application process.