Data Engineer for Software Engineering Group

Mobileye
12 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Apache HTTP Server
Big Data
Information Engineering
Data Infrastructure
Data Integrity
ETL
Data Retrieval
Data Warehousing
Relational Databases
Hadoop
Python
PostgreSQL
Machine Learning
MongoDB
MySQL
NoSQL
Unstructured Data
Workflow Management Systems
Parquet
Data Processing
Spark
GIT
Containerization
Data Lake
Kubernetes
Information Technology
Cassandra
Software Version Control
Data Pipelines
Docker

Job description

We are seeking a skilled and motivated Data Engineer to join our dynamic team. The ideal candidate will have extensive experience in building and optimizing data pipelines, handling large datasets, and working within cloud-based and on-premise environments. You will be responsible for designing and implementing efficient, scalable solutions to manage, process, and analyze high volumes of data, ensuring the reliability and performance of our data infrastructure.

What will your job look like:

  • Design, build, and maintain scalable data pipelines to process large datasets efficiently
  • Develop and implement data models and architectures that support both real-time and batch data processing
  • Ensure data integrity, security, and accuracy across all systems
  • Collaborate with data scientists, analysts, and other engineers to ensure data availability and quality
  • Optimize data retrieval and storage processes to handle large volumes of data seamlessly
  • Work with structured, semi-structured, and unstructured data, integrating various data sources
  • Troubleshoot and resolve data issues, ensuring continuous operation of the data infrastructure
  • Maintain and enhance ETL processes, ensuring scalability and performance in handling large datasets
  • Stay up-to-date with industry best practices and emerging technologies related to big data engineering

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • At least 4 years experience in data engineering, with a focus on large-scale data processing and big data technologies
  • Strong proficiency in Python
  • Experience with data pipeline and workflow management tools
  • Hands-on experience with large-scale data processing frameworks like Apache Spark, Hadoop, or similar
  • Familiarity with data modeling, ETL processes, and data warehousing concepts like Table formats. Apache Iceberg, or similar
  • Good knowledge of relational databases (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., Cassandra, MongoDB)
  • Experience with AWS-based data lakes, including working with Amazon S3 for storage, querying data using Amazon Athena, and managing datasets stored in Parquet format
  • Significant ownership ability

Nice to have:

  • Knowledge of machine learning frameworks and integrating data pipelines for model training and deployment
  • Experience with version control systems (e.g., Git), CI/CD pipelines, and automation tools
  • Experience with containerization (Docker, Kubernetes)

Apply for this position