Data Engineer (Databricks, Neo4j)

Datamatics Technologies

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Azure

Cloud Computing

Information Engineering

Data Governance

ETL

Data Systems

Distributed Data Store

Python

Meta-Data Management

Neo4j

Performance Tuning

DataOps

SQL Databases

Teradata

Scripting (Bash/Python/Go/Ruby)

Spark

Data Lake

PySpark

Real Time Data

Kafka

Data Management

Data Pipelines

Databricks

Job description

We are looking for an experienced Data Engineer with strong hands-on expertise in Databricks, Teradata, and Neo4j to join a leading technology-driven team in Sweden. This is a remote role, but we require candidates who are currently residing in Europe due to project compliance and collaboration needs., Data Engineering & Development

Design, develop, and optimize scalable data pipelines using Databricks (PySpark/Spark).
Build, maintain, and enhance ETL/ELT processes across multiple data environments.
Integrate structured and unstructured datasets for downstream analytics and consumption.
Develop and optimize data models on Teradata for performance and reliability.
Implement graph-based data solutions using Neo4j.

Solution Design & Architecture

Collaborate with solution architects and business teams to understand data needs and design robust solutions.
Participate in system design sessions and contribute to architecture improvements.
Ensure data quality, validation, and governance throughout the data lifecycle.

Performance & Optimization

Troubleshoot and optimize Spark jobs, Teradata SQL queries, and data workflows.
Ensure highly available and high-performance data pipelines.
Monitor data operations and automate workflows where possible.

Collaboration & Communication

Work with cross-functional teams including BI, Data Science, and Platform Engineering.
Document technical designs, pipelines, and solutions clearly and thoroughly.
Communicate effectively with remote stakeholders in a multicultural environment

Requirements

Do you have experience in Teradata?, The ideal candidate will have a solid background in building scalable data pipelines, integrating complex data sources, and working with modern data platforms., * 5-7 years of experience as a Data Engineer.

Strong, hands-on experience with Databricks (Spark, PySpark, Delta Lake).
Mandatory expertise in Neo4j (graph modeling, Cypher queries).
Solid experience with Teradata (SQL, performance tuning, data modelling).
Strong scripting and coding experience in Python.
Experience working with cloud platforms (Azure/AWS/GCP) is preferred-Azure is a plus.
Strong understanding of ETL/ELT concepts, data modelling, and distributed data processing.
Excellent analytical, problem-solving, and communication skills.
Ability to work independently in remote, cross-cultural teams.

Preferred Qualifications

Experience with CI/CD pipelines for data workflows.
Knowledge of data governance, data quality frameworks, and metadata management.
Exposure to real-time data processing technologies (Kafka, Event Hub, etc.) is an advantage.