DevOps Engineer (holding a valid NATO Security Clearance) - Deadline 21/05/26

AlmavivA de Belgique

Brussels, Belgium

1 month ago

Role details

Contract type

Permanent contract

Employment type

Part-time (≤ 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Brussels, Belgium

Tech stack

API

Amazon Web Services (AWS)

Azure

Bash

Code Reuse

Complex Networks

Computer Networks

DevOps

Disaster Recovery

Python

Network Monitoring

Role-Based Access Control

Ansible

Prometheus

Zero Trust Network Access

vSphere

Data Logging

Google Cloud Platform

SUSE Linux

Istio

Grafana

Kubernetes Helm Charts

Multi-Cloud

Gitlab

Kubernetes

Infrastructure Automation Frameworks

Rancher

Bare Metal

Hardware Infrastructure

Nutanix

Job description

Design, build, and evolve highly scalable, secure, and resilient Kubernetes platforms utilizing SUSE Rancher as the central management plane.
Handle the end-to-end lifecycle of RKE/RKE2 clusters (provisioning, scaling, patching and upgrading) across hybrid, multi-cloud or bare-metal environments with minimal to zero downtime.
Develop and maintain automated CI/CD pipelines and implement GitOps workflows (using tools like Gitlab, ArgoCD or Rancher Fleet) to streamline continuous application delivery.
Monitor platform health using tools like Prometheus and Grafana, respond to system alerts and troubleshoot complex containerized workload issues.
Enforce security policy's, including network policies, Rancher RBAC configurations and automated container image scanning.
Proactively monitor resource utilization (CPU, memory, persistent storage) to right-size clusters, scale nodes dynamically and optimize cloud or on-prem infrastructure.
Create and maintain technical documentation, including runbooks and architecture diagrams.

Requirements

Do you have experience in vSphere?, Do you have a Bachelor's degree?, * Minimum of 4 years of dedicated, hands-on experience in DevOps or Platform Engineering roles.

Over 4 years of sustained experience designing, deploying and maintaining highly available production Kubernetes clusters, heavily utilizing the SUSE Rancher ecosystem.
A solid track record of leading complex migrations, moving legacy virtualized applications into containerized, microservice-based architectures.
Expertly provision, secure, and lifecycle-manage large-scale, multi-cluster Kubernetes environments using SUSE Rancher UI/API and Rancher Manager.
Write modular, reusable code to automate infrastructure provisioning across hybrid environments using Ansible.
Architect and maintain sophisticated, zero-downtime deployment pipelines using GitOps principles with Gitlab, ArgoCD or Rancher Fleet.
Author/manage advanced Helm charts and manage complex stateful and stateless deployments.
Engineer comprehensive monitoring and logging stacks (Prometheus, Grafana, ELK) to establish SLIs/SLOs and configure automated remediation. Experience with Rancher Observability is a plus.
Develop robust automation scripts in Python or Bash.
Rapidly diagnose and resolve high-severity incidents across the entire stack, from kernel-level container issues to complex network routing failures.
Comprehensive understanding of the underlying architectures of RKE, RKE2, K8S /K3s and knowing exactly when to apply each distribution.
Knowledge of zero-trust models, Rancher RBAC, Pod Security Admissions, container scanning and enforcing governance.
CNIs tools (Calico, Cilium, Canal) deep understanding of Software Defined Network and Container Networking, network policies and managing complex Ingress and Service Mesh (Istio) architectures.
Expert knowledge of Container Storage Interfaces (CSI), disaster recovery planning and managing highly available storage solutions like SUSE Longhorn.
Extensive practical experience operating distributed Kubernetes environments across a mix of bare-metal, on-premises hypervisors (vSphere, Nutanix), and public clouds (AWS, Azure, GCP).
Experience acting as a technical escalation point, guiding junior team members, and establishing DevOps best practices across development teams.

Preferences

Engineer comprehensive monitoring and logging stacks (Prometheus, Grafana, ELK) to establish SLIs/SLOs and configure automated remediation. Experience with Rancher Observability is a plus.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all