Site Reliability Engineer (SRE) - Database and Monitoring
Role details
Job location
Tech stack
Job description
Infomaniak is the company behind SwissTransfer and a trusted partner for leading organisations: international institutions such as the United Nations, media outlets like France Télévisions, iconic events such as the Montreux Jazz Festival and the Annecy Festival, as well as central banks, major cities and security organisations across Europe.
An independent company, B Corp certified and awarded for its data centres that push the limits of efficiency and energy recovery, Infomaniak is living proof that it is possible to build a different digital world: sovereign, sustainable and beneficial for the local economy. Here, your passion will become meaningful work: you will work autonomously, take on real responsibilities and contribute to projects that impact millions of people.
We are looking for a:
Site Reliability Engineer (SRE) - Database and Monitoring
You are a proactive, responsible, curious geek, passionate about Web technologies. Join our team to develop our high value-added solutions and much more, * Ensure the stability of existing services and infrastructures
- Maintain but above all evolve (updates / capacity planning / design) infrastructures
- Automate recurring tasks (interventions / deployments), implement automatic remediation to aim for self-healing systems, not just automated ones (self-healing)
- Participate in the on-call rotation
Environment:
You will join the "Core services" team which manages and evolves our infrastructures related to data flows. These infrastructures support all other teams and services by primarily providing robust and redundant database, messaging, log archiving and monitoring services.
Requirements
- 3 years or more of personal or professional experience in Linux system administration, self-taught candidates welcome
- Good knowledge of Linux systems, git, databases (MySQL, MariaDB, PostgreSQL, Clickhouse)
- Production experience with high availability databases is a plus
- Experience with one or more messaging tools (Kafka, RabbitMQ) is a plus
- Production experience with Elastic Cloud on Kubernetes (ECK) is a plus
- Good knowledge of the Zabbix monitoring system
- Comfortable with configuration managers (Ansible, Terraform, Puppet)
- Comfortable with Python and shell scripting is a plus
- Experience with Kubernetes is a plus
- Excellent oral and written communication in French and fluent English
- Good organisational skills, dynamism, ability to work autonomously
- Have a sense of humour, * 3 years experience in Linux administration
- Production experience with at least one of the following areas:
- High availability databases
- Zabbix monitoring
- Elasticsearch
- Knowledge of containerisation