Data Engineer

Intone Networks
Bentonville, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Bentonville, United States of America

Tech stack

Artificial Intelligence
Airflow
Big Data
Google BigQuery
Cloud Storage
Directed Acyclic Graph (Directed Graphs)
Information Engineering
Data Infrastructure
ETL
Data Security
Data Warehousing
Database Queries
Data Flow Control
Query Optimization
SQL Databases
Data Streaming
Feature Engineering
Spark
Build Management
Data Lineage
Apache Flink
Kafka
Data Pipelines
Apache Beam

Requirements

Do you have experience in Spark implementation?, Must Haves: GCP SPARK Airflow SQL GCP Data services AI/ML IS A super nice to have Job description: KEY RESPONSIBILITIES * Design and build scalable ETL/ELT pipelines using Apache Airflow, Apache Spark, and GCP Dataflow * Develop and maintain BigQuery data models, schemas, and performance-optimized SQL queries * Build and maintain data pipelines feeding AI/ML feature stores and forecasting models * Collaborate with AI Developers to ensure high-quality, low-latency data access for model training * Manage and optimize Cloud Composer DAGs and pipeline orchestration * Implement data quality monitoring, alerting, and lineage tracking * Participate in data platform architecture decisions and documentation REQUIRED QUALIFICATIONS * 3+ years (Intermediate) or 5+ years (Specialist) of data engineering experience * Hands-on experience with Apache Airflow for pipeline orchestration * Proficiency in Apache Spark for large-scale data processing * Strong SQL skills including complex query optimization and BigQuery-specific capabilities * Experience with GCP data services: BigQuery, Cloud Storage, Pub/Sub, Dataflow * Solid understanding of ETL/ELT patterns and data warehousing principles PREFERRED QUALIFICATIONS * GCP Professional Data Engineer certification * Experience supporting ML/AI data infrastructure (feature engineering, training datasets) * Familiarity with real-time streaming (Kafka, Dataflow/Flink) * Retail or large-scale consumer data experience

Apply for this position