Linux System Engineer
Role details
Job location
Tech stack
Job description
- The System Engineer provides advanced operational support for Linux-based IT infrastructure, with a strong focus on system monitoring, observability, and rClientbility. They ensure platform stability, performance, and availability through proactive monitoring, alerting, and continuous analysis of system health.
- The engineer is responsible for timely resolution of incidents and service requests, leveraging monitoring data and diagnostics to identify and address OS-level issues. They perform routine maintenance and implement system changes in line with established processes and standards, ensuring minimal service disruption.
- A key aspect of the role is the design, implementation, and continuous improvement of monitoring solutions, including metrics collection, logging, alerting, and dashboards. The engineer uses these capabilities to detect anomalies early and improve overall service quality.
- The engineer contributes to problem management by performing root cause analysis based on monitoring data, logs, and system behavior, and by proposing and implementing long-term corrective actions. They collaborate with cross-functional teams to ensure systems are secure, compliant, and effectively monitored end-to-end.
- In addition to operational duties, the engineer participates in R&D activities, evaluating new technologies and observability tooling, and contributing to solution prototypes. They drive automation through scripting and Infrastructure-as-Code to enhance monitoring coverage, reduce manual intervention, and increase platform reliability., * Telework: Client's remote work policy is applicable to the delivery of the service.
- Travel: Exceptional travel to Berlin (max 2 days per month) - Travel expenses will be paid by CLIENT.
STANDBY: After the training period, the consultant will be assigned to standby duties as agreed in advance with the manager. While on standby, the consultant is not actively working but must remain reachable and ready to respond if needed. The consultant may stay at home or elsewhere, provided they can respond promptly and intervene using their CLIENT laptop. If necessary, the consultant will return to the site during the standby period to perform the intervention.
- The consultant must be available and able to act immediately if contacted.
- Standby periods will be scheduled and agreed upon beforehand.
- Important: if the consultant will work in Belgium, for non-EU candidates, please present candidates who comply with the following criteria:
- Possess a work permit allowing the individual to work in Belgium.
- Hold a valid residence permit confirming the right of residence in Belgium.
Requirements
-
You are able to speak, read and write fluently English and French or Dutch.
-
Experience
-
Minimum 4 years of experience in IT operations, managing containerized, virtualized, and/or physical Linux-based infrastructure in large-scale environments (enterprise, governmental, or supranational).
-
Monitoring & Observability
-
Proven experience in the design, implementation, and operation of monitoring and alerting solutions
-
Hands-on experience with PRTG is strongly preferred
-
Solid understanding of:
-
Metrics collection, log aggregation, and alerting strategies
-
Event correlation, anomaly detection, and performance baselining
-
Incident detection and reduction of MTTD/MTTR
-
Linux Platform Engineering
-
End-to-end lifecycle management of Red Hat Enterprise Linux (or equivalent) environments
-
Experience with enterprise tooling, including:
-
Red Hat Satellite 6.x (provisioning, patching, lifecycle management)
-
System performance tuning, troubleshooting, and OS-level diagnostics
-
Automation & Infrastructure as Code
-
Strong experience with Ansible Automation Platform (playbooks, roles, automation workflows)
-
Familiarity with Infrastructure-as-Code principles and pipeline integration
-
Experience integrating automation with monitoring and operational workflows
-
CI/CD & Version Control
-
Practical experience with Git-based workflows (GitLab, Azure DevOps, or similar)
-
Understanding of CI/CD pipelines for infrastructure and configuration deployment
-
Container Technologies
-
Experience with Red Hat container ecosystem, including:
-
Podman / OCI containers
-
Understanding of container monitoring and lifecycle operations
-
IT Service Management (ITSM)
-
Experience working within structured ITIL-aligned processes:
-
Incident, problem, and change management
-
Service request handling
-
Strong focus on monitoring-driven operations, diagnostics, and forensics
-
Languages
-
Dutch or French: Full professional / native proficiency
-
English: Professional working proficiency
Preferred experience and skills
- Infrastructure & Platform Ecosystem
- Exposure to hardware lifecycle management (server, firmware updates, lifecycle planning)
- Experience with VMware-based virtual environments
- Familiarity with NetApp storage systems
- Security & Detection
- Experience with endpoint detection and protection tools:
- Trend Micro Deep Security, ClamAV, or equivalent
- Understanding of integrating security signals into monitoring/alerting pipelines
- Network & Platform Integration
- Basic operational knowledge of network/security platforms:
- Palo Alto, Check Point, F5, InfoBlox
- Ability to correlate infrastructure and network events for incident analysis
- Scripting & Automation
- Solid scripting skills in Bash and/or Python
- Ability to develop automation for monitoring, remediation, and reporting
- Vendor & Lifecycle Management
- Experience with vendor coordination, support processes, and licensing management
Other
- Must be a team player with organizational skills
- Customer minded, accountable, not a 9-to-5 worker
- Must have a Driver license + car + mobile phone
- Willingness to participate in 24/7 standby/on-call duties