Senior IT Data Engineer
Role details
Job location
Tech stack
Job description
This role involves participating in the planning and execution of policies, practices, and projects designed to acquire, control, protect, and enhance the value of organisational data assets., Included but not limited to:Data Acquisition: Facilitate obtaining data from variety of sources in correct formats while adhering to quality standards.Pipeline Development: Build robust data pipelines that clean, transform, and aggregate unorganized data into databases.Platform Operations: Develop, maintain, monitor, and manage the long-term operations of data pipelines or processing systems within the Cloudera Data Platform.Issue Resolution: Resolve information flow and content issues as they arise.CI/CD Implementation: Implement and maintain continuous integration and continuous delivery (CI/CD) pipelines and manage data platforms.Strategic Planning: Participate in the planning of practices and projects to enhance data asset value
Requirements
General Experience: 5+ years of experience in application/data development, specifically with Python.Specialized Experience: 5+ years of experience with data integration and ingestion tools, such as Apache NiFi.Methodology: Experience working with Scrum and Kanban methodologies.Platform Experience: Proficiency in the long-term operations of data pipelines or processing systems running in the Cloudera Data Platform.**ship is a requirement for this position, Data Processing & Engineering: Proficiency in PySpark, pandas, or dbt.Data Ingestion: Expertise in Apache NiFi.Languages & Databases: Advanced knowledge of SQL, Java, and Microsoft SQL Server.Distributed Computing: Experience with platforms including Hadoop, MapReduce, Hive, HBase, Kafka, and Spark.DevOps & Tools: Understanding of git and DevOps-enabled technologies.Systems: Proficiency in UNIX/Linux, including basic commands and shell scripting.Technical Operations: Knowledge of data extraction, transformation, loading (ETL), and performance tuning.