Data Engineer

Raas Infotek LLC
Texas City, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Texas City, United States of America

Tech stack

Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Azure
Big Data
Google BigQuery
Cloud Computing
Data Architecture
Information Engineering
Data Governance
ETL
Data Systems
Data Warehousing
Database Design
DevOps
Distributed Computing Environment
Hadoop
Python
Metadata
Operational Databases
Query Optimization
SQL Databases
SQL Server Integration Services
Data Streaming
Talend
Google Cloud Platform
Informatica Powercenter
Delivery Pipeline
Snowflake
Spark
Database Performance
Infrastructure as Code (IaC)
GIT
Event Driven Architecture
Data Lake
Information Technology
Kafka
Machine Learning Operations
Terraform
Stream Processing
Azure
Data Pipelines
Redshift
Databricks

Job description

  • Design, develop, and optimize large-scale data pipelines and ETL/ELT processes.
  • Build and maintain data warehouses, data lakes, and lakehouse architectures.
  • Develop scalable solutions for batch and real-time data processing.
  • Collaborate with business stakeholders, architects, data scientists, and analysts to gather requirements and deliver data solutions.
  • Implement data quality, governance, security, and compliance standards.
  • Optimize database performance, query tuning, and data processing workflows.
  • Develop and maintain data models, metadata, and technical documentation.
  • Monitor and troubleshoot production data pipelines and resolve performance bottlenecks.
  • Mentor junior engineers and promote best practices in data engineering.
  • Support cloud migration and modernization initiatives.

Requirements

We are seeking a highly skilled Senior Data Engineer with 10+ years of experience in designing, developing, and maintaining enterprise-scale data solutions. The ideal candidate will have strong expertise in data warehousing, ETL/ELT development, cloud platforms, big data technologies, and data modeling. The candidate will play a key role in building scalable data pipelines and supporting analytics, reporting, and AI/ML initiatives., * Strong expertise in SQL and Python.

  • Hands-on experience with Apache Spark, Kafka, Hadoop, and distributed data processing frameworks.
  • Extensive experience with ETL/ELT tools such as Informatica, Talend, SSIS, Airflow, or dbt.
  • Experience with cloud platforms (AWS, Azure, or Google Cloud Platform).
  • Strong knowledge of Snowflake, Redshift, BigQuery, Synapse, or similar data warehouse technologies.
  • Expertise in data modeling, data architecture, and database design.
  • Experience with real-time streaming and event-driven architectures.
  • Knowledge of CI/CD pipelines, Git, and DevOps practices.
  • Strong understanding of data governance, security, and compliance standards., * Experience with Databricks and Delta Lake.
  • Exposure to AI/ML data pipelines and MLOps.
  • Experience with Terraform or Infrastructure as Code (IaC).
  • Cloud certifications (AWS, Azure, or Google Cloud Platform).
  • Experience working in Agile/Scrum environments., * Bachelor''s or Master''s degree in Computer Science, Information Technology, Engineering, or a related field., * Healthcare, Banking, Insurance, or Energy domain experience.
  • Experience leading enterprise data transformation projects.
  • Strong stakeholder management and leadership skills.

Apply for this position