Site Reliability Engineer

Aerospacelab

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English, French

Job location

Tech stack

Kubernetes Security

Java

Amazon Web Services (AWS)

Big Data

C Sharp (Programming Language)

Cloud Computing

Databases

Continuous Delivery

Continuous Integration

DevOps

Distributed Systems

Github

Identity and Access Management

Python

Key Management

PostgreSQL

Linux System Administration

MongoDB

NoSQL

Open Web Application Security

Performance Tuning

Reliability Engineering

Software Reliability Testing

Ansible

Prometheus

Secure Coding

SQL Databases

Policy as Code

Data Logging

Load Balancing

Azure

Grafana

Gitlab

GIT

Containerization

Tanzu

Git Flow

Kubernetes

Information Technology

Amazon Web Services (AWS)

Hardware Infrastructure

Terraform

Software Version Control

Apache Beam

Docker

Jenkins

Programming Languages

Job description

Provide SRE expertise across relevant projects, contributing to reliability, performance, and operational excellence.
Ensure clear and efficient communication between all relevant stakeholders (engineering, security, business teams, etc.).
Offer SRE guidance and best practices within internal R&D processes when required.
Continuously expand knowledge, supported by internal resources and learning opportunities.
Collaborate closely with Security, Development, and Operations teams to integrate reliability and observability best practices into the entire service lifecycle.
Implement SRE principles such as SLIs/SLOs, error budgets, incident management, tooling automation, and proactive reliability improvements.
Contribute to incident response, on-call rotation, root-cause analysis, and long-term remediation to strengthen production environments.

Requirements

Do you have experience in Terraform?, Do you have a Master's degree?, * Master's, PhD, or Bachelor's degree in Computer Science, Engineering, or a related field.

Creative problem-solver with strong knowledge of modern reliability, DevOps, and infrastructure technologies.
Ability to drive improvements, implement process changes, and introduce new tools and automation into the organization.
Strong organization, communication, and documentation skills, with the ability to convey complex concepts to non-technical audiences.
Experience collaborating with a variety of stakeholders across the organization.
Understanding of secure coding practices and modern security frameworks (e.g., NIST, OWASP).
Familiarity with SRE methodologies such as observability, automation, performance engineering, system design, and reliability metrics.

Technical skills

Experience with cloud and on-premise infrastructure environments.
Experience with container platforms such as Kubernetes; experience with Tanzu is a plus.
Experience with Docker and containerization.
Experience with packaging and config (e.g., Helm, Kustomize, etc).
Experience with Backup/DR for clusters and workloads (e.g., Velero, Kasten, etc).
Experience with policy as code(e.g., OPA/GateKeeper, Kyverno, etc).
Experience with Performance and reliability testing is a plus (e.g., k6).
Understanding and application of GitOps principles and tooling (e.g., declarative configurations, automated reconciliation, and continuous delivery, Flux, ArgoCD, etc).
Experience with cloud providers (certifications are an asset).
Knowledge of databases (NoSQL, SQL, Postgres, MongoDB, etc.) is a plus.
Experience with programming languages such as Python, Java, or C/C++/C# is a plus.
Experience with CI/CD tools (e.g., Jenkins, GitLab, GitHub Actions).
Experience with Git, branching strategies, and version control best practices.
Experience with Infrastructure as Code tooling (e.g., Terraform, Ansible).
Experience with Linux system administration, troubleshooting, and performance tuning.
Experience with big data technologies (e.g., Azure Data Factory, AWS Data Pipeline, GCP Dataflow) is a plus.
Knowledge of distributed systems, load balancing, networking, and storage.
Experience with monitoring/observability, alerting, logging, and automation tools (e.g., Prometheus, Grafana, Alloy, Mimir, Loki, etc).
Understanding of container security, IAM, and secrets management (e.g., Vault, AWS Secrets Manager).
Knowledge of French and English.

Benefits & conditions

Salary package consistent with your experience

About the company

About Aerospacelab Our mission is supported by an ambitious vertically integrated approach: design & manufacture of small satellites combined with development of earth observation services. Since its creation in 2018, the company has already grown substantially. Our team gathers more than 25 engineering expertise, from hardware design to software development & data science. In 2026, Aerospacelab counts offices in Louvain-la-Neuve (Belgium), Toulouse (France), Lausanne (Switzerland) and US West Coast with more than 350 full-time employees, with the ambition to position itself as the European leader in small satellites.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all