DevOps Engineer (holding a valid NATO Security Clearance) - Deadline 21/05/26

AlmavivA de Belgique
Brussels, Belgium
yesterday

Role details

Contract type
Permanent contract
Employment type
Part-time (≤ 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Brussels, Belgium

Tech stack

API
Amazon Web Services (AWS)
Azure
Bash
Code Reuse
Complex Networks
Computer Networks
DevOps
Disaster Recovery
Python
Network Monitoring
Role-Based Access Control
Ansible
Prometheus
Zero Trust Network Access
vSphere
Data Logging
Google Cloud Platform
SUSE Linux
Istio
Grafana
Kubernetes Helm Charts
Multi-Cloud
Gitlab
Kubernetes
Infrastructure Automation Frameworks
Rancher
Bare Metal
Hardware Infrastructure
Nutanix

Job description

  • Design, build, and evolve highly scalable, secure, and resilient Kubernetes platforms utilizing SUSE Rancher as the central management plane.
  • Handle the end-to-end lifecycle of RKE/RKE2 clusters (provisioning, scaling, patching and upgrading) across hybrid, multi-cloud or bare-metal environments with minimal to zero downtime.
  • Develop and maintain automated CI/CD pipelines and implement GitOps workflows (using tools like Gitlab, ArgoCD or Rancher Fleet) to streamline continuous application delivery.
  • Monitor platform health using tools like Prometheus and Grafana, respond to system alerts and troubleshoot complex containerized workload issues.
  • Enforce security policy's, including network policies, Rancher RBAC configurations and automated container image scanning.
  • Proactively monitor resource utilization (CPU, memory, persistent storage) to right-size clusters, scale nodes dynamically and optimize cloud or on-prem infrastructure.
  • Create and maintain technical documentation, including runbooks and architecture diagrams.

Requirements

Do you have experience in vSphere?, Do you have a Bachelor's degree?, * Minimum of 4 years of dedicated, hands-on experience in DevOps or Platform Engineering roles.

  • Over 4 years of sustained experience designing, deploying and maintaining highly available production Kubernetes clusters, heavily utilizing the SUSE Rancher ecosystem.
  • A solid track record of leading complex migrations, moving legacy virtualized applications into containerized, microservice-based architectures.
  • Expertly provision, secure, and lifecycle-manage large-scale, multi-cluster Kubernetes environments using SUSE Rancher UI/API and Rancher Manager.
  • Write modular, reusable code to automate infrastructure provisioning across hybrid environments using Ansible.
  • Architect and maintain sophisticated, zero-downtime deployment pipelines using GitOps principles with Gitlab, ArgoCD or Rancher Fleet.
  • Author/manage advanced Helm charts and manage complex stateful and stateless deployments.
  • Engineer comprehensive monitoring and logging stacks (Prometheus, Grafana, ELK) to establish SLIs/SLOs and configure automated remediation. Experience with Rancher Observability is a plus.
  • Develop robust automation scripts in Python or Bash.
  • Rapidly diagnose and resolve high-severity incidents across the entire stack, from kernel-level container issues to complex network routing failures.
  • Comprehensive understanding of the underlying architectures of RKE, RKE2, K8S /K3s and knowing exactly when to apply each distribution.
  • Knowledge of zero-trust models, Rancher RBAC, Pod Security Admissions, container scanning and enforcing governance.
  • CNIs tools (Calico, Cilium, Canal) deep understanding of Software Defined Network and Container Networking, network policies and managing complex Ingress and Service Mesh (Istio) architectures.
  • Expert knowledge of Container Storage Interfaces (CSI), disaster recovery planning and managing highly available storage solutions like SUSE Longhorn.
  • Extensive practical experience operating distributed Kubernetes environments across a mix of bare-metal, on-premises hypervisors (vSphere, Nutanix), and public clouds (AWS, Azure, GCP).
  • Experience acting as a technical escalation point, guiding junior team members, and establishing DevOps best practices across development teams.

Preferences

  • Engineer comprehensive monitoring and logging stacks (Prometheus, Grafana, ELK) to establish SLIs/SLOs and configure automated remediation. Experience with Rancher Observability is a plus.

Apply for this position