Data Observability Engineer

COCUS AG
Düsseldorf, Germany
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, German

Job location

Düsseldorf, Germany

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Big Data
Business Process Modeling
Configuration Management
Scientific Data Archiving
ETL
Linux
Elasticsearch
Python
KNIME
Linux System Administration
Operational Data Store
Software Maintenance
Ansible
DataOps
System Availability
Gitlab
GIT
Infrastructure Automation Frameworks
Apache Nifi
Firewall Services Module
Terraform
Splunk
Software Version Control
Data Pipelines

Job description

We are developing a modular, technology-independent data platform for operational data, built on a plug-and-play principle. The platform provides a one-stop solution for operational data management, empowering teams to independently design and manage data pipelines from source to destination. At its core, the platform focuses on automation-first workflows that minimize manual effort and streamline processes. This is complemented by long-term data archiving and AI-driven analytics that support informed, real-time decision-making, all running on a flexible and location-independent infrastructure. You will be:

  • Building and managing a complex Splunk and Elastic setup for the platform
  • Taking full ownership of the end-to-end data onboarding processes for Splunk & Elastic
  • Building and managing end-to-end monitoring pipelines with OpenTelemetry (OTel)
  • Scaling and automating infrastructure based on AWS and Infrastructure as a Code (e.g. Terraform for provisioning and Ansible for configuration management and orchestration)
  • Managing and tuning Linux-based environments (e.g. Amazon Linux) for high-performance applications like Splunk
  • Building and extending internal tools and APIs using Python and/or GO, moving beyond basic automation to professional software maintenance
  • Acting as a bridge within the team and as reliable partner for internal and external customers
  • Owning of the full lifecycle of our platform, including regular maintenance, patching, and security management
  • Act as a trusted consultant for internal teams and external customers. You help them to optimize their queries, understand their data and build meaningful knowledge objects like dashboards.

Requirements

Do you have experience in Terraform?, * Deep expertise in Splunk: You are at least Splunk Admin Certified and have optional a proven track record of managing self-managed environments (preferably on AWS).

  • Hands-on experience with AWS or other hyper scalers and Infrastructure as a Code (e.g. Terraform for provisioning and Ansible for configuration management and orchestration). You know how to scale and automate infrastructure.
  • You have experience with OpenTelemetry (OTel) and understand how to build end-to-end monitoring pipelines.
  • Solid experience in managing and tuning Linux-based environments (e.g. Amazon Linux), especially for high-performance applications like Splunk.
  • Ability to build and extend internal tools and APIs using Python and/or GO, moving beyond basic automation to professional software maintenance
  • Proficient with Git (GitLab) and strong understanding of CI/CD best practices and version control
  • Excellent communication skills (German and English) to act as a bridge within the team and as reliable partner for internal and external customers. You can explain complex technical concepts to different stakeholders
  • You take ownership of the full lifecycle of our platform. This includes regular maintenance, patching, and security management (e.g. managing firewall rules, security groups, certificates) to ensure high availability and compliance
  • You take full ownership of the end-to-end data onboarding processes for Splunk & Elastic.
  • You act as a trusted consultant for internal teams and external customers. You help them to optimize their queries, understand their data and build meaningful knowledge objects like dashboards etc.

What will be a plus:

  • Experience with Apache Nifi or similar ETL/ELT tools (Knime, dbt etc) for managing and routing large-scale data streams
  • You have a strong interest in shaping the technical roadmap and participating in architectural decision-making for our observability platform.
  • Experience with Elasticsearch/Elastic Cloud is a strong benefit.
  • Certifications in OTel, Splunk (in addition to Admin9 and Elastic).

Benefits & conditions

  • Permanent employment contract and a competitive, market-aligned salary based on your experience
  • Two pet-friendly offices in Germany
  • Company phone for personal use
  • Up to 30 days of vacation
  • Employee Assistance Program (EAP)
  • Company ticket for public transportation and available parking spaces
  • Benefit from a company pension scheme
  • Bicycle leasing option with a company subsidy
  • Hybrid work model with 2 days of home office per week
  • Referral program with a bonus - invite a friend to join the team.

Apply for this position