Site Reliability Engineer

Aerospacelab
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, French

Job location

Tech stack

Kubernetes Security
Java
Amazon Web Services (AWS)
Big Data
C Sharp (Programming Language)
Cloud Computing
Databases
Continuous Delivery
Continuous Integration
DevOps
Distributed Systems
Github
Identity and Access Management
Python
Key Management
PostgreSQL
Linux System Administration
MongoDB
NoSQL
Open Web Application Security
Performance Tuning
Reliability Engineering
Software Reliability Testing
Ansible
Prometheus
Secure Coding
SQL Databases
Policy as Code
Data Logging
Load Balancing
Azure
Grafana
Gitlab
GIT
Containerization
Tanzu
Git Flow
Kubernetes
Information Technology
Amazon Web Services (AWS)
Hardware Infrastructure
Terraform
Software Version Control
Apache Beam
Docker
Jenkins
Programming Languages

Job description

  • Provide SRE expertise across relevant projects, contributing to reliability, performance, and operational excellence.
  • Ensure clear and efficient communication between all relevant stakeholders (engineering, security, business teams, etc.).
  • Offer SRE guidance and best practices within internal R&D processes when required.
  • Continuously expand knowledge, supported by internal resources and learning opportunities.
  • Collaborate closely with Security, Development, and Operations teams to integrate reliability and observability best practices into the entire service lifecycle.
  • Implement SRE principles such as SLIs/SLOs, error budgets, incident management, tooling automation, and proactive reliability improvements.
  • Contribute to incident response, on-call rotation, root-cause analysis, and long-term remediation to strengthen production environments.

Requirements

Do you have experience in Terraform?, Do you have a Master's degree?, * Master's, PhD, or Bachelor's degree in Computer Science, Engineering, or a related field.

  • Creative problem-solver with strong knowledge of modern reliability, DevOps, and infrastructure technologies.
  • Ability to drive improvements, implement process changes, and introduce new tools and automation into the organization.
  • Strong organization, communication, and documentation skills, with the ability to convey complex concepts to non-technical audiences.
  • Experience collaborating with a variety of stakeholders across the organization.
  • Understanding of secure coding practices and modern security frameworks (e.g., NIST, OWASP).
  • Familiarity with SRE methodologies such as observability, automation, performance engineering, system design, and reliability metrics.

Technical skills

  • Experience with cloud and on-premise infrastructure environments.
  • Experience with container platforms such as Kubernetes; experience with Tanzu is a plus.
  • Experience with Docker and containerization.
  • Experience with packaging and config (e.g., Helm, Kustomize, etc).
  • Experience with Backup/DR for clusters and workloads (e.g., Velero, Kasten, etc).
  • Experience with policy as code(e.g., OPA/GateKeeper, Kyverno, etc).
  • Experience with Performance and reliability testing is a plus (e.g., k6).
  • Understanding and application of GitOps principles and tooling (e.g., declarative configurations, automated reconciliation, and continuous delivery, Flux, ArgoCD, etc).
  • Experience with cloud providers (certifications are an asset).
  • Knowledge of databases (NoSQL, SQL, Postgres, MongoDB, etc.) is a plus.
  • Experience with programming languages such as Python, Java, or C/C++/C# is a plus.
  • Experience with CI/CD tools (e.g., Jenkins, GitLab, GitHub Actions).
  • Experience with Git, branching strategies, and version control best practices.
  • Experience with Infrastructure as Code tooling (e.g., Terraform, Ansible).
  • Experience with Linux system administration, troubleshooting, and performance tuning.
  • Experience with big data technologies (e.g., Azure Data Factory, AWS Data Pipeline, GCP Dataflow) is a plus.
  • Knowledge of distributed systems, load balancing, networking, and storage.
  • Experience with monitoring/observability, alerting, logging, and automation tools (e.g., Prometheus, Grafana, Alloy, Mimir, Loki, etc).
  • Understanding of container security, IAM, and secrets management (e.g., Vault, AWS Secrets Manager).
  • Knowledge of French and English.

Benefits & conditions

Salary package consistent with your experience

About the company

About Aerospacelab Our mission is supported by an ambitious vertically integrated approach: design & manufacture of small satellites combined with development of earth observation services. Since its creation in 2018, the company has already grown substantially. Our team gathers more than 25 engineering expertise, from hardware design to software development & data science. In 2026, Aerospacelab counts offices in Louvain-la-Neuve (Belgium), Toulouse (France), Lausanne (Switzerland) and US West Coast with more than 350 full-time employees, with the ambition to position itself as the European leader in small satellites.

Apply for this position