Engineer, Data Engineering

Propertyvalue

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Tech stack

Clean Code Principles

Agile Methodologies

Airflow

Amazon Web Services (AWS)

Automation of Tests

Code Review

Computer Programming

Serialization

Electronic Data Interchange (EDI)

Protocol Buffers

Gradle

Monitoring of Systems

Java Virtual Machine (JVM)

Python

Maven

E2e Testing

Software Engineering

Data Streaming

Management of Software Versions

Test Driven Development

Spark

FastAPI

Avro

Kafka

REST

Terraform

Docker

Microservices

Job description

defined test scenarios, while ensuring full compliance through detailed validation. After release, the role includes monitoring system performance, alerts, and SLOs to ensure optimal functionality and reliability. Deliver well-documented, maintainable code following Test-Driven Development (TDD) principles, ensuring comprehensive unit, integration, and end-to-end testing. Design, operate, and document versioned RESTful APIs using FastAPI and JVM-based frameworks, ensuring scalability, reliability, and backward compatibility. Implement and enforce data schema evolution and versioning strategies to support reliable data exchange across systems. Develop and maintain batch and streaming data pipelines using technologies such as Kafka and Spark, handling backpressure, orchestration, retries, and data quality controls. contribute to CI/CD pipelines, automated testing, and participate in incident response to ensure system resilience and SLO adherence. Partner closely with Product and

Requirements

cross-functional teams to translate requirements into high-quality technical solutions that deliver business outcomes. Adhere to clean code and architectural standards through code reviews, testing, and Agile development practices, ensuring maintainable and compliant solutions. 3-5 years of experience in data-focused software engineering roles. ~ Good programming skills in Scala (or JVM) experience with Python preferred. ~ Good understanding of data modeling, schema evolution, and serialization technologies such as Avro or Protobuf. ~ Experience building and maintaining batch or streaming data systems, with knowledge of streaming patterns and reliability concerns. ~ Familiarity CI/CD pipelines, and modern monitoring and alerting practices. ~ Proficiency with Git-based workflows, code reviews, and Agile development methodologies. ~ Good sense of ownership, with pragmatic problem-solving skills, constructive critique and the ability to deliver end-to-end solutions. ~ Excellent communication skills and fluency in English, with the ability to collaborate across product and engineering teams. Experience with Apache Airflow for workflow orchestration. Exposure to cloud platforms (preferably AWS) and infrastructure as code using Terraform. Experience with Docker and Kubernetes in production environments. Hands-on knowledge of Kafka and event-driven or microservices architectures. Familiarity with JVM build and tooling ecosystems such as Gradle or Maven. We believe that different perspectives lead to better ideas, and better ideas allow us to better understand the needs and interests of our diverse, global community. Research shows that women and people of color are less likely than others to apply if they feel like they don't match 100% of the job requirements.

About the company

By simplifying global trade information and providing valuable insights, we empower organisations to make informed decisions in commodities, energy, and maritime sectors. Our team of over 700 experts from 35+ countries works tirelessly to transform intricate data into actionable strategies, ensuring our clients stay ahead in a dynamic market landscape. Join us to leverage cutting-edge innovation for impactful results and experience unparalleled support on your journey to success. Build and maintain Kpler's core datasets (vessels characteristics, companies, geospatial data). You will be responsible for creating and maintaining REST APIs, streaming pipelines (Kafka Stream), and Spark batch pipelines. The individual designs and builds functionality-including APIs and data processing components-ensuring code is deployed to development environments and reviewed through peer and product testing. They are responsible for writing and executing unit, integration, and functional tests aligned with

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all