Data Observability Engineer
Role details
Job location
Tech stack
Job description
We are developing a modular, technology-independent data platform for operational data, built on a plug-and-play principle. The platform provides a one-stop solution for operational data management, empowering teams to independently design and manage data pipelines from source to destination. At its core, the platform focuses on automation-first workflows that minimize manual effort and streamline processes. This is complemented by long-term data archiving and AI-driven analytics that support informed, real-time decision-making, all running on a flexible and location-independent infrastructure. You will be:
- Building and managing a complex Splunk and Elastic setup for the platform
- Taking full ownership of the end-to-end data onboarding processes for Splunk & Elastic
- Building and managing end-to-end monitoring pipelines with OpenTelemetry (OTel)
- Scaling and automating infrastructure based on AWS and Infrastructure as a Code (e.g. Terraform for provisioning and Ansible for configuration management and orchestration)
- Managing and tuning Linux-based environments (e.g. Amazon Linux) for high-performance applications like Splunk
- Building and extending internal tools and APIs using Python and/or GO, moving beyond basic automation to professional software maintenance
- Acting as a bridge within the team and as reliable partner for internal and external customers
- Owning of the full lifecycle of our platform, including regular maintenance, patching, and security management
- Act as a trusted consultant for internal teams and external customers. You help them to optimize their queries, understand their data and build meaningful knowledge objects like dashboards.
Requirements
Do you have experience in Terraform?, * Deep expertise in Splunk: You are at least Splunk Admin Certified and have optional a proven track record of managing self-managed environments (preferably on AWS).
- Hands-on experience with AWS or other hyper scalers and Infrastructure as a Code (e.g. Terraform for provisioning and Ansible for configuration management and orchestration). You know how to scale and automate infrastructure.
- You have experience with OpenTelemetry (OTel) and understand how to build end-to-end monitoring pipelines.
- Solid experience in managing and tuning Linux-based environments (e.g. Amazon Linux), especially for high-performance applications like Splunk.
- Ability to build and extend internal tools and APIs using Python and/or GO, moving beyond basic automation to professional software maintenance
- Proficient with Git (GitLab) and strong understanding of CI/CD best practices and version control
- Excellent communication skills (German and English) to act as a bridge within the team and as reliable partner for internal and external customers. You can explain complex technical concepts to different stakeholders
- You take ownership of the full lifecycle of our platform. This includes regular maintenance, patching, and security management (e.g. managing firewall rules, security groups, certificates) to ensure high availability and compliance
- You take full ownership of the end-to-end data onboarding processes for Splunk & Elastic.
- You act as a trusted consultant for internal teams and external customers. You help them to optimize their queries, understand their data and build meaningful knowledge objects like dashboards etc.
What will be a plus:
- Experience with Apache Nifi or similar ETL/ELT tools (Knime, dbt etc) for managing and routing large-scale data streams
- You have a strong interest in shaping the technical roadmap and participating in architectural decision-making for our observability platform.
- Experience with Elasticsearch/Elastic Cloud is a strong benefit.
- Certifications in OTel, Splunk (in addition to Admin9 and Elastic).
Benefits & conditions
- Permanent employment contract and a competitive, market-aligned salary based on your experience
- Two pet-friendly offices in Germany
- Company phone for personal use
- Up to 30 days of vacation
- Employee Assistance Program (EAP)
- Company ticket for public transportation and available parking spaces
- Benefit from a company pension scheme
- Bicycle leasing option with a company subsidy
- Hybrid work model with 2 days of home office per week
- Referral program with a bonus - invite a friend to join the team.